Enhanced cluster merging and deep learning techniques for entity name identification from biomedical corpus
| creativeworkseries.issn | 1508-2806 | |
| dc.contributor.author | Das, Nilanjana | |
| dc.contributor.author | Dutta, Rakesh | |
| dc.contributor.author | Mondal, Uttam Kumar | |
| dc.contributor.author | Majumder, Mukta | |
| dc.contributor.author | Mandal, Jyotsna Kumar | |
| dc.date.available | 2025-09-25T05:28:40Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | For mining biomedical information identifying names is the prime task. Complex and uncertain naming styles of biomedical entities are the major setbacks here. Thus, state-of-the-art accuracy of biomedical name identification is reasonably inferior compared to general domain. This study includes Machine Learning and Deep Learning techniques to recognize names from biomedical corpus. In supervised classification, a classifier is built by finding required statistics from training corpus. Accordingly, performance of the system is primarily dependent on quantity and quality of training corpus. But manually preparing a large training dataset with enriched feature samples is laborious and time-taking. Therefore, various techniques were adopted in the literature to make effective use of raw corpora. We have incorporated a novel Cluster Merging technique and Attention Mechanism with BERT embedding for boosting Machine Learning and Deep Learning classifiers respectively. The suggested results outpour that profound techniques are competent and delineate signifying improvement over surviving methods. | en |
| dc.description.placeOfPublication | Kraków | |
| dc.description.version | wersja wydawnicza | |
| dc.identifier.doi | https://doi.org/10.7494/csci.2025.26.1.5600 | |
| dc.identifier.eissn | 2300-7036 | |
| dc.identifier.issn | 1508-2806 | |
| dc.identifier.uri | https://repo.agh.edu.pl/handle/AGH/114955 | |
| dc.language.iso | eng | |
| dc.publisher | Wydawnictwa AGH | |
| dc.relation.ispartof | Computer Science | |
| dc.rights | Attribution 4.0 International | |
| dc.rights.access | otwarty dostęp | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/legalcode | |
| dc.subject | biomedical named entity recognition | en |
| dc.subject | conditional random field | en |
| dc.subject | support vector machine | en |
| dc.subject | cluster merging | en |
| dc.subject | BERT | en |
| dc.subject | bidirectional GRU | en |
| dc.subject | attention mechanism | en |
| dc.title | Enhanced cluster merging and deep learning techniques for entity name identification from biomedical corpus | en |
| dc.title.related | Computer Science | en |
| dc.type | artykuł | |
| dspace.entity.type | Publication | |
| publicationissue.issueNumber | No. 1 | |
| publicationissue.pagination | pp. 49-75 | |
| publicationvolume.volumeNumber | Vol. 26 | |
| relation.isJournalIssueOfPublication | 1c9dc1de-09ac-408b-ae17-fb1ca607a65f | |
| relation.isJournalIssueOfPublication.latestForDiscovery | 1c9dc1de-09ac-408b-ae17-fb1ca607a65f | |
| relation.isJournalOfPublication | 020291ee-249b-4dcf-98a3-276a2f7981aa |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- csci.2025.26.1.49.pdf
- Size:
- 924.17 KB
- Format:
- Adobe Portable Document Format
