Computer Science

Numer czasopisma

Computer Science

ISSN: 1508-2806

Pliki

csci.2015.16.2.red.pdf (119.08 KB)

csci.2015.16.2.tytul.pdf (29.72 KB)

csci.2015.16.2.contents.pdf (58.95 KB)

Data wydania

2015

Rocznik

Vol. 16

Numer

No. 2

Prawa dostępu

Dostęp: otwarty dostęp

Prawa: CC BY 4.0

Uznanie autorstwa 4.0 Międzynarodowe (CC BY 4.0)

Rocznik czasopisma (rel.)

Rocznik czasopisma

Computer Science

Vol. 16 (2015)

Artykuły numeru (rel.)

Artykuł

Dostęp ograniczony

Pre-trained Deep Neural Network using Sparse Autoencoders and Scattering Wavelet Transform for musical genre recognition

(2015) Kleć, Mariusz; Koržinek, Danijel

Research described in this paper tries to combine the approach of Deep Neural Networks (DNN) with the novel audio features extracted using the Scattering Wavelet Transform (SWT) for classifying musical genres. The SWT uses a sequence of Wavelet Transforms to compute the modulation spectrum coefficients of multiple orders, which has already shown to be promising for this task. The DNN in this work uses pre-trained layers using Sparse Autoencoders (SAE). Data obtained from the Creative Commons website jamendo.com is used to boost the well-known GTZAN database, which is a standard benchmark for this task. The final classifier is tested using a 10-fold cross validation to achieve results similar to other state-of-the-art approaches.

Artykuł

Dostęp ograniczony

Application of linguistic cues in the analysis of language of hate groups

(2015) Balcerzak, Bartłomiej; Jaworski, Wojciech

Hate speech and fringe ideologies are social phenomena that thrive on-line. Members of the political and religious fringe are able to propagate their ideas via the Internet with less effort than in traditional media. In this article, we attempt to use linguistic cues such as the occurrence of certain parts of speech in order to distinguish the language of fringe groups from strictly informative sources. The aim of this research is to provide a preliminary model for identifying deceptive materials online. Examples of these would include aggressive marketing and hate speech. For the sake of this paper, we aim to focus on the political aspect. Our research has shown that information about sentence length and the occurrence of adjectives and adverbs can provide information for the identification of differences between the language of fringe political groups and mainstream media.

Artykuł

Dostęp ograniczony

Automated credibility assessment on Twitter

(2015) Lorek, Krzysztof; Wiciński, Jacek; Jankowski-Lorek, Michał; Gupta, Amit

In this paper, we make a practical approach to automated credibility assessment on Twitter. We describe the process behind the design of an automated classifier for information credibility assessment. As an addition, we propose practical implementation of TwitterBOT, a tool which is able to score submitted tweets while working in the native Twitter interface.

Artykuł

Dostęp ograniczony

Noisy-parallel and comparable corpora filtering methodology for the extraction of bi-lingual equivalent data at sentence level

(2015) Wołk, Krzysztof

Text alignment and text quality are critical to the accuracy of Machine Translation (MT) systems, some NLP tools, and any other text processing tasks requiring bilingual data. This research proposes a language-independent bisentence filtering approach based on Polish (not a position-sensitive language) to English experiments. This cleaning approach was developed on the TED Talks corpus and also initially tested on the Wikipedia comparable corpus, but it can be used for any text domain or language pair. The proposed approach implements various heuristics for sentence comparison. Some of the heuristics leverage synonyms as well as semantic and structural analysis of text as additional information. Minimization of data loss has been? ensured. An improvement in MT system scores with text processed using this tool is discussed.

Artykuł

Dostęp ograniczony

Document controversy classification based on the Wikipedia category structure

(2015) Jankowski-Lorek, Michał; Zieliński, Kazimierz

Dispute and controversy are parts of our culture and cannot be omitted on the Internet (where it becomes more anonymous). There have been many studies on controversy, especially on social networks such as Wikipedia. This free on-line encyclopedia has become a very popular data source among many researchers studying behavior or natural language processing. This paper presents using the category structure of Wikipedia to determine the controversy of a single article. This is the first part of the proposed system for classification of topic controversy score for any given text.

Artykuł

Dostęp ograniczony

Data mining and neural network simulations can help to improve deep brain stimulation effects in Parkinson’s Disease

(2015) Szymański, Artur; Kubis, Anna; Przybyszewski, Andrzej Wojciech

Parkinson’s Disease (PD) is primary related to substantia nigra degeneration and, thus, dopamine insufficiency. L-DOPA as a precursor of dopamine is the standard medication in PD. However, disease progression causes L-DOPA therapy efficiency decay (on-off symptom fluctuation), and neurologists often decide to classify patients for DBS (Deep Brain Stimulation) surgery. DBS treatment is based on stimulating the specific subthalamic structure: subthalamic nucleus (STN) in our case. As STN consists of parts with different physiological functions, finding the appropriate placement of the DBS electrode contacts is challenging. In order to predict the neurological effects related to different electrode-contact stimulations, we have tracked connections between the stimulated part of STN and the cortex with the help of diffusion tensor imaging (DTI). By changing a contacts number and amplitude of stimulus (proportional in size to stimulated area), we have determined connections to cortical areas and related neurological effects. We have applied data mining methods to predict which contact (and at what amplitude) should be stimulated in order to improve a particular symptom. We have compared different data mining methods: Wekas Random Forest classifier and Rough Set Exploration System (RSES). We have demonstrated that the Weka classifier was more accurate when predicting the effects of stimulations on general neurological improvements, while RSES was more accurate when using specific neurological symptoms. We have simulated other effects of stimulation related to the interruption of pathological oscillation in the basal ganglia found in PD. Our model represents possible STN neural population with inhibitory and excitatory connections that have pathologically synchronized oscillations. High-frequency electrical stimulation has interrupted synchronization. Something that is also observed in PD patients.

Pełny opis