Liknande böcker
Apache Solr : A Practical Approach to Enterprise Search
Bok av Dikshant Shahi
Build anenterprise search engine using Apache Solr: index and search documents; ingestdata from varied sources; apply various text processing techniques; utilizedifferent search capabilities; and customize Solr to retrieve the desiredresults. Apache Solr: APractical Approach to Enterprise Search explains each essentialconcept-backed by practical and industry examples--to help you attainexpert-level knowledge.The book,which assumes a basic knowledge of Java, starts with an introduction to Solr,followed by steps to setting it up, indexing your first set of documents, andsearching them. It then introduces you to information retrieval and itsimplementation in Apache Solr; this will help you understand your searchproblem, decide the approach to build an effective solution, and use variousmetrics to evaluate the results.The booknext covers the schema design and techniques to build a text analysis chain forcleansing, normalizing and enriching your documents and addressing differenttypes of search queries. It describes various popular matching techniques whichare generally applied to improve the precision and recall of searches. You willlearn the end-to-end process of data ingestion from varied sources, metadataextraction, pre-processing and transformation of content, various searchcomponents, query parsers and other advanced search capabilities. Aftercovering out-of-the-box features, Solr expert Dikshant Shahi dives into waysyou can customize Solr for your business and its specific requirements, alongwith ways to plug in your own components. Most important, you will learn aboutimplementations for Solr scoring, factors affecting the document score, andtuning the score for the application at hand. The book explains why textualscoring is not sufficient for practical ranking of documents and ways tointegrate real-world factors for contributing to the document ranking.You'll seehow to influence user experience by providing suggestions and recommendations. You'll also see integration of Solr with important related technologies such asOpenNLP and Tika. Additionally, you will learn about scaling Solr usingSolrCloud. This book concludes withcoverage of semantic search capabilities, which is crucial for taking thesearch experience to the next level. By the end of Apache Solr, you will be proficient in designing anddeveloping your search engine.