Distributed search engine using dynamic passive replication for load balancing
A distributed search engine that creates dynamic replicas based on frequencies of search terms and categories from a particular location.
A single central server is set up initially(say in USA), and receives search queries from users
across the world. Now if many users in India query for a certain topic say soccer, the central
server sets up a replica in India dynamically containing data pertaining to only soccer and
related terms. All requests containing soccer or similar queries from India now go to the
replica in India. Now in case the replica in India doesn’t receive relevant queries for a long
time/has to include more indices, the master server deletes the idle indices from the data in the dynamically created replica in India. Furthermore, the master server should have a backup server running to take over as master in case of failure(fault-tolerant) and hence any metadata pertaining to the dynamic replicas should be sequentially consistent.