In the context of this workshop, 3 talks will be done. First, we will present an overview of NLP (Natural Language Processing) approaches in order to mine textual data [PART 1]. The second part will present two NLP tasks dedicated to Health domain: (i) acquisition of tweets, (ii) terminology extraction in (social) media [PART 2]. The last part of this workshop is dedicated to textual classification issues based on machine learning techniques with Weka [PART 3]. A part of this presentation will use a public dataset: https://dataverse.cirad.fr/dataset.xhtml?persistentId=doi:10.18167/DVN1/POIZMA.
显示更多 [+] 显示较少 [-]