Natural Language Processing for Social Media. Diana Inkpen. Читать онлайн. Newlib. NEWLIB.NET

Автор: Diana Inkpen
Издательство: Ingram
Серия: Synthesis Lectures on Human Language Technologies
Жанр произведения: Программы
Год издания: 0
isbn: 9781681738147
Скачать книгу
4.10 NLP-based Information Visualization for Social Media

       4.11 Government Communication

       4.12 Rumor Detection

       4.13 Recommender systems

       4.14 Preventing Sexual Harassment

       4.15 Summary

       5 Data Collection, Annotation, and Evaluation

       5.1 Introduction

       5.2 Discussion on Data Collection and Annotation

       5.3 Spam and Noise Detection

       5.4 Privacy and Democracy in Social Media

       5.5 Evaluation Benchmarks

       5.6 Summary

       6 Conclusion and Perspectives

       6.1 Conclusion

       6.2 Perspectives

       A TRANSLI: a Case Study for Social Media Analytics and Monitoring

       A.1 TRANSLI architecture

       A.2 User Interface

       Glossary

       Bibliography

       Authors’ Biographies

       Index

       List of Figures

       1.1 Social networks ranked by the number of active users as of January 2014 (in millions) provided by Statista

       1.2 Number of monthly active Facebook users from the third quarter of 2008 to the first quarter of 2014 (in millions) provided by Statista

       1.3 Number of LinkedIn members from the first quarter of 2009 to the first quarter of 2014 (in millions) provided by Statista

       1.4 A framework for semantic analysis in social media, where NLP tools transform the data into intelligence

       2.1 Methodology for tweet normalization. The dotted horizontal line separates the two steps (detecting the text to be normalized and applying normalization rules) [Akhtar et al., 2015]

       2.2 Taxonomy of normalization edits [Baldwin and Li, 2015]

       2.3 Arabic dialects distribution and variation across Asia and Africa [Sadat et al., 2014a]

       2.4 Division of Arabic dialects in six groups/divisions [Sadat et al., 2014a]

       2.5 Accuracies on the character-based n-gram Markov language models for 18 countries [Sadat et al., 2014a]

       2.6 Accuracies on the character-based n-gram Markov language models for the six divisions/groups [Sadat et al., 2014a]

       2.7 Accuracies on the character-based n-gram Naïve Bayes classifiers for 18 countries [Sadat et al., 2014a]

       2.8 Accuracies on the character-based n-gram Naïve Bayes classifiers for the six divisions/groups [Sadat et al., 2014a]

       3.1 Example of a pair of tweets extracted from the bilingual feed pair Health Canada/Santé Canada, after tokenization

       3.2 An original tweet with hashtags in its three possible regions

       4.1 Examples of annotated social media posts discussing ADRs [Nikfarjam et al., 2015]

       4.2 The DeepHealthMiner neural net architecture [Nikfarjam, 2016]

       4.3 SVM-based text mining procedure for impact management [Schniederjans et al., 2013]

       A.1 TRANSLI Social Media Analytics and monitoring module architecture

       A.2 TRANSLI user interface for event creation module

       A.3 TRANSLI user interface for event browsing module

       A.4 TRANSLI user interface to present an event. Components are identified with their IDs

       List of Tables

       1.1 Social media platforms and their characteristics

       2.1 Three examples of Twitter texts

       2.2 Examples of tokenization

       2.3 Penn TreeBank tagset

       2.4 POS tagset from Gimpel et al. [2011]

       2.5 Example of