Show simple item record

dc.contributor.authorLu, Jing
dc.contributor.authorWang, Cuiqing
dc.contributor.authorKeech, Malcolm
dc.date.accessioned2020-09-11T10:04:19Z
dc.date.available2020-09-11T10:04:19Z
dc.date.issued2017-09-25
dc.identifier.citationLu J, Wang C, Keech M (2017) 'A novel approach to knowledge discovery and representation in biological databases', International Journal of Bioinformatics Research and Applications, 13 (4), pp.352-375.en_US
dc.identifier.issn1744-5485
dc.identifier.doi10.1504/IJBRA.2017.087384
dc.identifier.urihttp://hdl.handle.net/10547/624500
dc.description.abstractExtraction of motifs from biological sequences is among the frontier research issues in bioinformatics, with sequential patterns mining becoming one of the most important computational techniques in this area. A number of applications motivate the search for more structured patterns and concurrent protein motif mining is considered here. This paper builds on the concept of structural relation patterns and applies the concurrent sequential patterns (ConSP) mining approach to biological databases. Specifically, an original method is presented using support vectors as the data structure for the extraction of novel patterns in protein sequences. Data modelling is pursued to represent the more interesting concurrent patterns visually. Experiments with real-world protein datasets from the UniProt and NCBI databases highlight the applicability of the ConSP methodology in protein data mining and modelling. The results show the potential for knowledge discovery in the field of protein structure identification. A pilot experiment extends the methodology to DNA sequences to indicate a future direction.en_US
dc.language.isoenen_US
dc.publisherInderscienceen_US
dc.relation.urlhttp://www.inderscience.com/offer.php?id=87384en_US
dc.rightsYellow - can archive pre-print (ie pre-refereeing)
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/*
dc.subjectbioinformaticsen_US
dc.subjectdata analyticsen_US
dc.subjectstructural relationsen_US
dc.subjectbiological databasesen_US
dc.subjectconcurrent vector methoden_US
dc.subjectgraphical modelingen_US
dc.subjectprotein motif miningen_US
dc.subjectsequential patterns post-processingen_US
dc.subjectknowledge discoveryen_US
dc.titleA novel approach to knowledge discovery and representation in biological databasesen_US
dc.typeArticleen_US
dc.identifier.eissn1744-5493
dc.identifier.journalInternational Journal of Bioinformatics Research and Applicationsen_US
dc.date.updated2020-09-11T09:55:00Z
dc.description.note
refterms.dateFOA2020-09-11T10:04:19Z


Files in this item

Thumbnail
Name:
296Lu_ANovelApproachToKnowledg ...
Size:
501.6Kb
Format:
PDF
Description:
author's accepted version

This item appears in the following Collection(s)

Show simple item record

Yellow - can archive pre-print (ie pre-refereeing)
Except where otherwise noted, this item's license is described as Yellow - can archive pre-print (ie pre-refereeing)