Issue Details (XML | Word | Printable)

Key: KATTA-49
Type: Improvement Improvement
Status: Open Open
Priority: Major Major
Assignee: Unassigned
Reporter: Erich Nachbar
Votes: 3
Watchers: 4
Operations

If you were logged in you would be able to see more operations.
Katta

Introduce a refreshIndex method

Created: 22/Apr/09 10:17 PM   Updated: 08/Jan/10 07:55 AM
Component/s: None
Affects Version/s: 0.5
Fix Version/s: 0.7

Environment: N/A


 Description  « Hide
The current redeploy method is a combination of remove and deploy making the index unavailable until the deployment is finished.

It would be a great addition if an already existing index could be refreshed:

  • Katta would examine the HDFS index directory to see, which shards are there
  • Depending on the shard directory name it could deduct if the shard is already present on the nodes and doesn't need to be copied or
  • If there are new shards that are not already distributed. It would automatically start copy them over to the shards.
  • After the deployment is done it switches all search traffic to the refreshed index automatically.

Assumptions:

  • It is okay for refreshed indexes to have deleted events not automatically removed (or the client needs to filter those out through an add'l deleted field)
  • Shard names are unique and incremental updates into the shard directory will have a unique name to avoid collisions.

This would be a great enhancement considering that it would cut down deployment time for large indexes dramatically and simplify the overall client code (the current API makes it necessary to have multiple rolling indexes to incorporate updates).



 All   Comments   Change History   git Commits      Sort Order: Ascending order - Click to sort in descending order
There are no comments yet on this issue.