Introduction to Information Retrieval (英語) ハードカバー – 2008/7/7
Kindle 端末は必要ありません。無料 Kindle アプリのいずれかをダウンロードすると、スマートフォン、タブレットPCで Kindle 本をお読みいただけます。
Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.
'This is the first book that gives you a complete picture of the complications that arise in building a modern web-scale search engine. You'll learn about ranking SVMs, XML, DNS, and LSI. You'll discover the seedy underworld of spam, cloaking, and doorway pages. You'll see how MapReduce and other approaches to parallelism allow us to go beyond megabytes and to efficiently manage petabytes.' Peter Norvig, Director of Research, Google Inc.
'… this book sets a high standard …' Natural Language Engineering
'Introduction to Information Retrieval is a comprehensive, authoritative, and well-written overview of the main topics in IR. The book offers a good balance of theory and practice, and is an excellent self-contained introductory text for those new to IR.' Computational Linguistics
'This book provides what Salton and Van Rijsbergen both failed to achieve … Even more important, unlike some other books in IR, the authors appear to care about making the theory as accessible as possible to the reader, on occasion including short primers to certain topics or choosing to explain difficult concepts using simplified approaches. … its coverage [is] excellent, the quality of writing high and I was surprised how much I learned from reading it. I think the online resources are impressive.' Natural Language Engineering
I knew from the free sample that this book was what I was looking for. Thinking this would be a completely a new field to me, I was surprised how much I already knew. Some of it is not relevant to corpus linguists (result ranking for example), but if you're a corpus linguist and want to build an index for your corpus, I doubt you'll find a better book than this.
And the Kindle edition is done well, which is not always the case. Websites are hyperlinked and you can jump to the next or previous section with the 5-way controller.
This book not only describes how to build a search engine (including crawling, indexing, ranking, classification, and clustering), but also has many of the insights you can only get from lengthy experience using these techniques at large scale.
Definitely my new favorite book on search. If you work in search or just have an interest in the field, it is a great read.