cloudant - What steps are performed by a 'Rescan'? -
to automatically warehouse documents cloudant dashdb, there schema discovery process (sdp) automates data migration you. when using sdp warehouse documents cloudant dashdb, there option 'rescan'.
i have used 'rescan' number of times, unclear on steps performs. steps performed 'rescan'? e.g.
- drop tables in dashdb target schema? tables?
- scan cloudant source database?
- recreate target schema?
- ...
- ...
the steps pretty suggested. rescan will
inspect discovered json schema , remove tables dashdb instance created load (leaving user defined tables untouched)
re-discover json schema again using current settings (including sample size, type of discovery algorithm etc.)
create new tables same dashdb target
ingest newly created tables data cloudant
subscribe
_changes
feed cloudant continuously synchronize document changes dashdb
all steps (except first) identical initial load rescan function.
the main motivation rescan support schema evolution. whenever document structure in cloudant source database changes, user can make conscious decision drop , re-create dashdb tables using rescan function. sdp won't automate process avoid potential conflicts applications depending on existing dashdb tables.
Comments
Post a Comment