Towards textual data augmentation for neural networks: synonyms and maximum loss
| creativeworkseries.issn | 1508-2806 | |
| dc.contributor.author | Jungiewicz, Michał | |
| dc.contributor.author | Smywiński-Pohl, Aleksander | |
| dc.date.available | 2025-06-17T07:37:34Z | |
| dc.date.issued | 2019 | |
| dc.description | Bibliogr. s. 79-83. | |
| dc.description.abstract | Data augmentation is one of the ways to deal with labeled data scarcity and overfitting. Both of these problems are crucial for modern deep-learning algorithms, which require massive amounts of data. The problem is better explored in the context of image analysis than for text, this work is a step forward to help close this gap. We propose a method for augmenting textual data when training convolutional neural networks for sentence classification. The augmentation is based on the substitution of words using a thesaurus as well as Princeton University's WordNet. Our method improves upon the baseline in most of the cases. In terms of accuracy, the best of the variants is 1.2% (pp.) better than the baseline. | en |
| dc.description.placeOfPublication | Kraków | |
| dc.description.version | wersja wydawnicza | |
| dc.identifier.doi | https://doi.org/10.7494/csci.2019.20.1.3023 | |
| dc.identifier.eissn | 2300-7036 | |
| dc.identifier.issn | 1508-2806 | |
| dc.identifier.uri | https://repo.agh.edu.pl/handle/AGH/113221 | |
| dc.language.iso | eng | |
| dc.publisher | Wydawnictwa AGH | |
| dc.relation.ispartof | Computer Science | |
| dc.rights | Attribution 4.0 International | |
| dc.rights.access | otwarty dostęp | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/legalcode | |
| dc.subject | deep learning | en |
| dc.subject | data augmentation | en |
| dc.subject | neural networks | en |
| dc.subject | natural language processing | en |
| dc.subject | sentence classification | en |
| dc.title | Towards textual data augmentation for neural networks: synonyms and maximum loss | en |
| dc.title.related | Computer Science | en |
| dc.type | artykuł | |
| dspace.entity.type | Publication | |
| publicationissue.issueNumber | No. 1 | |
| publicationissue.pagination | pp. 57-83 | |
| publicationvolume.volumeNumber | Vol. 20 | |
| relation.isAuthorOfPublication | 0c190561-f05e-48d1-91f1-f6be571d5572 | |
| relation.isAuthorOfPublication.latestForDiscovery | 0c190561-f05e-48d1-91f1-f6be571d5572 | |
| relation.isJournalIssueOfPublication | 0a53592a-d344-44ab-a173-c9ab7912b51d | |
| relation.isJournalIssueOfPublication.latestForDiscovery | 0a53592a-d344-44ab-a173-c9ab7912b51d | |
| relation.isJournalOfPublication | 020291ee-249b-4dcf-98a3-276a2f7981aa |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- csci.2019.20.1.57.pdf
- Size:
- 777.46 KB
- Format:
- Adobe Portable Document Format
