Solr Training Program India Starting Solr (1 .5 day session)Overview

Solr Training Program India Starting Solr

 (1 .5 day session)


After taking this course you will be able to configure and deploy Solr, run a wide range of queries including queries with facets, and index documents with Solr. You will learn about inverted index, about tokens, token filters, Solr schema, analysis,highlighting, query parsing.

For whom

The course is designed for technical attendees of any knowledge level and is aimed at those who need to configure, tune and manage Solr and have only basic Solr knowledge. No prior Solr experience is required. Experience with Linux systems is a must, but basic familiarity with running shell commands (e.g., using curl command) is good.

Course Outline

Getting Started with Solr

      • What is Apache Solr

      • General principles

      • Architecture types


  1. Introduction to Solr

    • Starting Solr with schema-less configuration

    • Inverted index

    • Relevancy basics

    • Indexing documents

    • Retrieving documents by identifier

    • Searching for documents

    • Deleting documents

    • Lab

      • Using start scripts

      • Working with configuration

      • CRUD operations


  2. Indexing Data

    • Data structure

    • Index structure configuration

    • Defining custom field types

    • String vs Text based types

    • Basic field usage examples

    • Tokenizers

    • Char filters

    • Filters

    • Language oriented data

    • Dynamic fields

    • Copy fields

    • Running Solr with our own configuration

    • XML data format explained

    • JSON data format explained

    • CSV data format explained

    • Batch indexing

    • Doc values

    • Additional field properties

    • Nested documents support

    • Lab

      • Creating fields and types structure

      • Using copy fields

      • Using Solr language analysis capabilities

      • Indexing data in various format


  3. Searching

    • Simple URI search

    • Paging

    • Sorting

    • Filters

    • Choosing display fields

    • Pseudo fields

    • Debug query

    • Lucene query language

    • Standard query parser

    • Dismax query parser

    • Extended dismax query parser

    • XML query parser

    • Examples of other parsers

    • Timing out searches

    • Using cursor for deep paging

    • Nested documents support

    • Dealing with relevancy

    • Lab

      • Paging

      • Sorting

      • Term searching

      • Using various query parsers

      • Using cursor


  4. Data Analysis

    • Introduction to faceting

    • Basic use cases

    • Field faceting

    • Field prefix faceting

    • Sorting faceting results

    • Limiting faceting

    • Faceting execution control

    • Range faceting

    • Query faceting

    • Hierarchical faceting

    • Interval faceting

    • Lab

      • Building tag cloud using field faceting

      • Using prefixes to build simple autocomplete feature

      • Sorting faceting results

      • Working with numerical data and faceting

      • Using hierarchical faceting to get more insight into the data

      • Interval faceting


  5. JSON Facets

    • Introduction to JSON request API

    • Facet functions

    • Nested JSON facets

    • Execution type

    • Lab

      • Searching using JSON request API

      • Finding top tags

      • Retrieving statistics using range faceting

      • Using terms JSON facets to retrieve term counts

      • Using functions with JSON facets

      • Nesting JSON facets


  6. Highlighting and More Like This

    • Introduction to highlighting

    • Highlighting query hits

    • Specifying fields to highlight

    • Choosing highlighting tags

    • Using FastVectorHighlighter

    • Using PostingsHighlighter

    • Finding similar documents

    • Prerequisites for More Like This functionality

    • Configuring More Like This functionality

    • Lab

      • Highlighting field matches

      • Using own tags for matching highlighted fragments

      • Using various parsers with highlighting

      • Using different query for highlighting and matching

      • Finding documents similar to a given one

      • Using term frequency and length to find similar documents