Natural Language Processing for Online Applications: Text Retrieval, Extraction, and Categorization

John Benjamins Pub., 2002 - Computers - 225 pages

This text covers the emerging technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and practical issues. It seeks to satisfy a need on the part of technology practitioners in the Internet space, faced with having to make difficult decisions as to what research has been done an what the best practices are. It is not intended as a vendor guide (such things are quickly out of date), or as a recipe for building applications (such recipes are very context-dependent). But it does identify the key technologies, the issues involved, and the strengths and weaknesses on evaluation in every chapter, both in terms of methodology (how to evaluate) and what controlled experimentation and industrial experience have to tell us.

From inside the book

Results 1-3 of 33

Page 46
... precision and recall Relevant Non - relevant Retrieved Not retrieved a b a + b = m C d c + d = N - m a + c = n b + d = N - n a + b + c + d = N Thus recall can be thought of as the ' hit ratio ' , the proportion of target docu- ments ...

Page 112
... recall was much more important than precision , so editors would be prepared to tolerate a certain number of false positives in order to ensure high recall . In other applications , such as scanning the news for events of interest , ...

Page 158
... Recall and precision have been adapted to text classification . Precision is the proportion of documents for which the classifier correctly assigned cate- gory c ; and is given by Pi = TP ; mi Recall is the proportion of target document ...

Where's the rest of this book?

CHAPTER 2	21

CHAPTER 3	75

CHAPTER 4	116

Copyright

2 other sections not shown

Other editions - View all

Natural Language Processing for Online Applications: Text Retrieval ...
Peter Jackson,Isabelle Moulinier
Limited preview - 2007

Natural Language Processing for Online Applications: Text retrieval ...
Peter Jackson,Isabelle Moulinier
Limited preview - 2007

Natural Language Processing for Online Applications: Text Retrieval ...
Peter Jackson,Isabelle Moulinier
Limited preview - 2002

View all »

Common terms and phrases

algorithm analysis anaphora applications approach assigned automatic Boolean Chapter classifiers cluster collection combination computed Conference contain context corefer coreference court decision tree docu document retrieval estimate evaluation example FASTUS filtering finite frequency given grammar identify information extraction information retrieval linear classifiers linguistic Machine Learning match measure Message Understanding Conference methods Microsoft Naïve Bayes named entity Natural Language Processing non-relevant NOT-A-NAME noun groups noun phrase occur parser parsing patterns performance probabilistic probability problem Proceedings pronoun proper names query expansion query terms ranked retrieval recall and precision regular expressions relevance feedback relevant documents represent rules score search engine Section semantic sentence Sidebar simple statistical structure summary syntactic Table tagged taggers task techniques template text categorization text classification text mining tf-idf tion topic TREC typically vector space vector space model weight vector words

Bibliographic information

Title	Natural Language Processing for Online Applications: Text Retrieval, Extraction, and Categorization Issue 5 of Natural language processing, ISSN 1567-8202 Volume 5 of Natural language processing, ISSN 1567-8202
Authors	Peter Jackson, Isabelle Moulinier
Edition	illustrated
Publisher	John Benjamins Pub., 2002
ISBN	9027249881, 9789027249883
Length	225 pages
Subjects	Computers › Artificial Intelligence › Natural Language Processing Computers / Artificial Intelligence / Natural Language Processing Language Arts & Disciplines / Linguistics / General

Export Citation	BiBTeX EndNote RefMan

About Google Books - Privacy Policy - Terms of Service - Information for Publishers - Report an issue - Help - Google Home

Books

Natural Language Processing for Online Applications: Text Retrieval, Extraction, and Categorization

From inside the book

Contents

Other editions - View all

Common terms and phrases

Bibliographic information