一些蛮好的信息检索(IR)的参考资料

来源:互联网 发布:landsat7数据预处理 编辑:程序博客网 时间:2024/05/06 23:29

InformationRetrieval Resources

Information on Information Retrieval (IR) books, courses, conferences andother resources.

Books on Information Retrieval (General)
Introduction to Information Retrieval.C.D. Manning, P. Raghavan, H.Schütze. Cambridge UP, 2008. Classical and web information retrieval systems:algorithms, mathematical foundations andpractical issues.
Modern Information Retrieval. R. Baeza-Yates, B.Ribeiro-Neto. Addison-Wesley, 1999. Widely used and cited.
Information Retrieval: Algorithms and Heuristics.D.A. Grossman, O. Frieder. Springer,2004. Excellent textbook.
Managing Gigabytes.I.H. Witten, A. Moffat,T.C. Bell. Morgan Kaufmann, 1999. The authority onindex construction and compression.
Finding Out About. R. Belew. Cambridge UP,2001. More suitable for undergraduate classes thanother books listed here.
Information Retrieval:A Health and Biomedical Perspective. W.R. Hersh. Springer, 2002. As the title says: ahealth/biomedical perspective.
TREC: Experiment and Evaluation in InformationRetrieval. E.M. Voorhees, D.K. Harman. MITPress, 2005. A survey of recent research results.
Language Modeling for Information Retrieval. W.B.Croft, J. Lafferty. Springer, 2003. Language models areof increasing importance in IR.
Readings in Information Retrieval.K. Sparck Jones, P. Willett. Morgan Kaufmann, 1997.A collection of classical IR papers.
Recommended Reading for IR Research Students.A. Moffat, J. Zobel, D. Hawking. SIGIR Forum, 39(2), 2005. Not a book, but a collection of seminal papers, more up-to-datethan Sparck-Jones et al.
Information Storage and Retrieval Systems.G. Kowalski, M.T. Maybury. Springer, 2005."... takes a system approach, discussing all aspects of an InformationRetrieval System."
The Geometry of Information Retrieval.C.J. van Risjbergen. Cambridge UP, 2004. Am ambitious attemptto develop quantum mechanics as a new foundation for IR.
Introduction to Modern Information Retrieval.G.G. Chowdhury. Neal-Schuman, 2003.Intended for students of library and information studies.
Text Information Retrieval Systems.C. Meadow, B. Boyce, D. Kraft. Academic Press, 2000.Also takes a library/information science perspective.
More Books

Books on Web Information Retrieval
Mining the Web: Analysis of Hypertext and Semi Structured Data.S. Chakrabarti. Morgan Kaufmann, 2002. The bestintroduction for web-centric IR.
Google's PageRank and beyond: The science of Search Engine Rankings.Amy N. Langville, Carl D. Meyer. Princeton University Press, 2006. More focused on the algorithms of PageRank, but also covers general web IR.
Modeling the Internet and the Web: Probabilistic Methods and Algorithms.P. Baldi, P. Frasconi, P. Smyth. Wiley, 2003.A bit terse. Recommended for those who have a good foundation inprobability theory, but are new to IR.

Good books for implementing a search engine
Managing Gigabytes (see above)
Building Search Applications: Lucene, Lingpipe, and Gate.M. Konchady.Mustru Publishing, 2008.
Lucene in Action.O. Gospodnetic, E. Hatcher.Manning Publications, 2004.
Spidering Hacks.K. Hemenway, T. Calishain.O'Reilly, 2003.

Online Books - Browsable
Introduction to Information Retrieval (see above)
Finding Out About (see above)
Information Retrieval. C. J. vanRijsbergen. Butterworths, 1979. The classic. Almost 40years old, but still worth reading.
Information Retrieval.T. van der Weide. 2004. Introduction to IR and hypertext.

Online Books - PDF
Introduction to Information Retrieval (see above)
Information Retrieval in Practice. B. Croft, D. Metzler, T. Strohman. Pearson Education, 2009. (two chapters)
Information Retrieval. C. J. vanRijsbergen. Butterworths, 1979.
Information Retrieval Interaction.P. Ingwersen. Taylor Graham, 1992. Focuses on userinteraction in IR.
Information Retrieval: A Survey.Ed Greengrass. 2000. Good survey of "classical" IR, but little orno coverage of recent work (e.g., language models, PageRank, SVMs).
Various tutorials at Mi Islita

Research Centers
CMU (LTI)
Dublin CU
Geneva (Viper)
Glasgow
Helsinki Institute for Information Technology
IBM
Illinois Institute of Technology
Information Retrieval Facility (IRF)
Microsoft Research
NIST
Peking
Pittsburgh
Queen Mary
Sheffield
UIUC
UMASS

Courses
Berkeley (SIMS)
CMU
Cornell
DePaul
IIT
Johns Hopkins I
Johns Hopkins II
Maryland
MPI
Otago
Princeton
Stanford
Stuttgart
Texas
UMASS

Problem Sets / Assignments
Bilkent
DePaul
Georgetown
Minas Gerais
North Texas
Stuttgart
Tennessee

Web Information Retrieval
webir.org
Search Engine Watch
Users' Guide to Web Searching
PageRank

Subareas, Applications, Methods
Information Retrieval & Extraction
Information Retrieval & Machine Learning
Text Mining & Web Mining
INEX: XML retrieval
Geographic Information Retrieval
Music Information Retrieval
CLIR & Multilingual Information Retrieval
Cross-LanguageInformation Retrieval (CLIR) Resources
N-Grams in Information Retrieval
Agent-based Information Retrieval
Audio Information Retrieval
Adversarial Information Retrieval

Conferences
TREC
Cross Language Evaluation Forum (CLEF)
SIGIR 2007 (last),SIGIR 2008 (next)
CIKM 2007,CIKM 2008
WWW 2008, WWW 2009
JCDL 2008, JCDL 2009
RIAO 2004, RIAO 2007
ECIR 2008, ECIR 2009
AIRS 2006,AIRS 2008
SPIRE 2007, SPIRE 2008
Norbert Fuhr's IR conference calendar

Journals
ACM Transactions on Information Systems (TOIS):dblphome
Information Processing and Management (IP&M):dblphome
Information Retrieval:dblphome
International Journal on Digital Libraries:dblphome
Journal of theAmerican Society of Information Science and Technology (JASIST):dblphome
SIGIR Forum:dblphome
Journal of Documentation
D-Lib Magazine
Data & Knowledge Engineering:dblphome
Information Processing Letters:dblphome
Information Research
Information Systems:dblphome
Journal of Intelligent Information Systems:dblphome
Knowledge and Information Systems:dblphome
Foundations and Trends in Information Retrieval:home

Popular Articles
Wikipedia: Information Retrieval
A. Singhal: Modern Information Retrieval: A Brief Overview
S.E. Robertson, K. Sparck Jones: Simple, proven approaches to text retrieval
Bruce Croft: What Do People Want From IR
Information Retrieval on the World Wide Web
Michael Lesk: The Seven Ages of Information Retrieval

Software
C. Middleton, R. Baeza-Yates: A Comparison of Open Source Search Engines (contains an up-to-date list of available search engine software)
Doug Oard's list of availabletext retrieval systems
Avi Rappoport:open source search engines
MySQL full text search
Text to Matrix Generator, a MATLAB toolbox for indexing, retrievaland other text processing tasks

Collections
U. of Glasgow list of availabletext retrieval collections
NLP/IR corpus list at NUS
NLP/IR corpus list at Edinburgh
SMART at Cornell (downloads of a number of collections, stop lists, SMART retrieval system etc.)
Internet archive(limited availability)
Linguistic Data Consortium

Professional Organizations
ACM SIGIR
BCS IRSG

Other Collections of Information Retrieval Links
ACM SIGIR
David Karger

Other Resources
Glossary (Modern Information Retrieval)
Information retrieval research links @ Search Tools
BUBL: Information Retrieval Links
LSU: Information Retrieval Systems
Open Directory: Information Retrieval Links
UBC: Indexing Resources
IR & Neural Networks, Symbolic Learning, Genetic Algorithms
Astop list(a list of stop words)
Chris Manning'sNLP resources
Weiguo Patrick Fan'stext mining links

原创粉丝点击