Repository logo
Article

Towards textual data augmentation for neural networks: synonyms and maximum loss

creativeworkseries.issn1508-2806
dc.contributor.authorJungiewicz, Michał
dc.contributor.authorSmywiński-Pohl, Aleksander
dc.date.available2025-06-17T07:37:34Z
dc.date.issued2019
dc.descriptionBibliogr. s. 79-83.
dc.description.abstractData augmentation is one of the ways to deal with labeled data scarcity and overfitting. Both of these problems are crucial for modern deep-learning algorithms, which require massive amounts of data. The problem is better explored in the context of image analysis than for text, this work is a step forward to help close this gap. We propose a method for augmenting textual data when training convolutional neural networks for sentence classification. The augmentation is based on the substitution of words using a thesaurus as well as Princeton University's WordNet. Our method improves upon the baseline in most of the cases. In terms of accuracy, the best of the variants is 1.2% (pp.) better than the baseline.en
dc.description.placeOfPublicationKraków
dc.description.versionwersja wydawnicza
dc.identifier.doihttps://doi.org/10.7494/csci.2019.20.1.3023
dc.identifier.eissn2300-7036
dc.identifier.issn1508-2806
dc.identifier.urihttps://repo.agh.edu.pl/handle/AGH/113221
dc.language.isoeng
dc.publisherWydawnictwa AGH
dc.relation.ispartofComputer Science
dc.rightsAttribution 4.0 International
dc.rights.accessotwarty dostęp
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/legalcode
dc.subjectdeep learningen
dc.subjectdata augmentationen
dc.subjectneural networksen
dc.subjectnatural language processingen
dc.subjectsentence classificationen
dc.titleTowards textual data augmentation for neural networks: synonyms and maximum lossen
dc.title.relatedComputer Scienceen
dc.typeartykuł
dspace.entity.typePublication
publicationissue.issueNumberNo. 1
publicationissue.paginationpp. 57-83
publicationvolume.volumeNumberVol. 20
relation.isAuthorOfPublication0c190561-f05e-48d1-91f1-f6be571d5572
relation.isAuthorOfPublication.latestForDiscovery0c190561-f05e-48d1-91f1-f6be571d5572
relation.isJournalIssueOfPublication0a53592a-d344-44ab-a173-c9ab7912b51d
relation.isJournalIssueOfPublication.latestForDiscovery0a53592a-d344-44ab-a173-c9ab7912b51d
relation.isJournalOfPublication020291ee-249b-4dcf-98a3-276a2f7981aa

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
csci.2019.20.1.57.pdf
Size:
777.46 KB
Format:
Adobe Portable Document Format