XII edycja - abstrakty




Główna | Organizacja | Abstrakty | Program | Ankieta |


Prezentujemy listę referentów oraz tytułów wystąpień wraz ze streszczeniami. Zapraszamy do lektury!
Przypominamy, że język, w którym podany jest tytuł, jest równoznaczny z językiem całego wystąpienia.
Pod każdym streszczeniem widoczny jest skrócony program (aktualny na 19.10.2020), dzięki któremu można dowiedzieć się, jakie inne referaty będą wygłaszane podczas danej sekcji.

poprzednie streszczenie | lista streszczeń | następne streszczenie

Karolina Jankowska (Uniwersytet im. Adama Mickiewicza w Poznaniu)

Trial use of the WordNet and NLTK as tools for keywords' synonyms based information search



As with the current information overload it is a great challenge to find the information that a user actually needs. There are numerous ideas how to overcome the problem with getting right to the useful sources without spending time reading huge volumes of data i.e. on the Internet. Among the most popular and relatively easy to conduct is context and synonyms-based searching. However, in order to perform such task a very good quality synonyms, collocation corpus is necessary. In fact there are many such tool available online, however not all of them are free to use or can be downloaded or synchronised with any software. A good example of ready to use and integrate English corpus is WordNet. On the other hand, there are many Natural Language Processing (NLP) tools available for linguistics in order to make their job easier and more effective such as Natural Language Toolkit (NLTK).
The purpose of this paper is to present the trial use of WordNet as a source linguistic data and the use of NTLK as a tool for effective NLP tasks. A simple code in Python programming language has been developed that performs searching text by a keyword and the keyword's synonym from WordNet. The lists of synonyms found via WordNet have been compared to results from Synonyms.com. Furthermore, for this very trial, a text corpus has been created. It contains 100 articles in the English language copied from available online sources, normalised and stored in a set of .txt files containing crucial metadata. The general conclusion from the experiment is that the results are not satisfying and searching based on synonyms only is of low value. 



Gdzie i kiedy:

W tym samym czasie ...
sekcja 5
sekcja 6
moderacja: Julia Augustyniak
14:40-15:00Krystian Kamiński (Uniwersytet Marii Curie-Skłodowskiej w Lublinie) - To Talk or to Sign? - How Foreign Language Teachers Communicate With Deaf Learners
15:00-15:20(okienko)
15:20-15:40Karolina Jankowska (Uniwersytet im. Adama Mickiewicza w Poznaniu) - Trial use of the WordNet and NLTK as tools for keywords' synonyms based information search
15:40-16:10dyskusja