Solr advanced Training India
After taking this course you will be able to configure and deploy Solr, run a wide range of queries including queries with facets, and index documents with Solr. In this course you will learn about query routing, results re-ranking, term vectors, schema API, custom similarity, merge policy, codecs, language identification, data import handler, advanced Solr and SolrCloud tuning and scaling, shard splitting, data migrations, handling a large number of collections, authentication, Solr and HDFS, and so on .
For whom
The course is designed for technical attendees experienced with Solr and looking to extending their Solr knowledge. A person should be able to index data to Solr, run queries, work with Solr analysis, use faceting, grouping, know basic Solr configuration and tuning principles. Experience with Linux systems is must
Course Outline
If you do not have prior Solr experience and you would like to take advantage of Solr advance training, please consider attendingSolr Startup and Solr and Cloudclasses.
Search Under Control
Routing
Index time routing
Query time routing
Basic syntax for local params
Parameter dereferencing
Using parameter dereferencing in handlers configuration
Using filters tagging
Using faceting exclusions
Re-ranking queries results
Lab
Indexing documents with routing
Running queries with routing
Using parameter dereferencing to create your own parameters in query
Tagging and excluding filters
Term Vectors
What are term vectors
Retrieving additional information from Solr
Understanding term vector component
Lab
Configure fields to use term vectors
Creating handlers that use term vectors
Retrieving term vectors when searching
Retrieving term positions and offsets
Configuring Solr Index
Custom similarity
Schema API
Managed resources
Lab
Using Schema API to retrieve information about collection structure
Adding new field type using Schema API
Adding new field using Schema API
Adding copy field using Schema API
Configuring Solr Internals
General solrconfig.xml sections
Lucene directory configuration
Schema factory settings
Codecs
Merge policy
Merge scheduler
Transaction log configuration
Replication
Update request processors
Language detection
Configuring logging
Slow query log
Lab
Configuring Solr to use managed schema
Preparing master – slave replication
Configuring slow logging
Data Import Handler
Configuring data import handler
Using data import handler
Entity processors
Transformers
Lab
Importing data from SQL database
Partial data import from SQL database
Importing data from XML files using
Streaming Aggregations
Streaming expressions basics
Stream sources
Stream decorators
Scheduling streams
SQL over MapReduce in SolrCloud
Export request handler
Lab
Searching using streaming aggregations
Merging two results streams
Retrieve unique documents based on a given field
Using scheduling streams
Expert Tuning Solr
Memory considerations
Indexing threads
Auto commit tuning
Caches
Replication throttling
Lab
Configuring indexing threads
Configuring auto commits
Throttle replication
Scaling Solr
Proper Solr master configuration
Proper Solr slaves configuration
Multiple masters architecture
Setting up Solr slaves for multiple masters
Indexing data in multimaster environment
Querying in multimaster environment
Expert SolrCloud
ZooKeeper role explained
Sharding and replicating
Cluster state explained
Shard splitting
Migrating data between collections
Working with large number of collections
Lab
Creating collection matching environment needs
Adding and removing replicas
Moving shards around the cluster
Adding shards to collection
Migrating data between collections
Other Solr Features
Enabling security in SolrCloud
Basic authorization
Other authorization options