Home | Resources | Services | Hosting | Publications | Collaboration | Joining CERL | About CERL |

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

resources:cerl_thesaurus:editing:mergingunmarked [2018/06/18 12:12] – created jahnkeresources:cerl_thesaurus:editing:mergingunmarked [2020/01/23 09:05] (current) jahnke
Line 1: Line 1:
 ====== How to merge records which have not been marked for deduplication ====== ====== How to merge records which have not been marked for deduplication ======
  
-Each time a new file is added to the CERL Thesaurus, there is an algorithm run over the data that checks if there are records in the new data that would produce a duplicate entry for an entity already existing in the database. However, since there is sometimes very little information associated with either the new or the existing record, this alogrithm might miss a hit here and there. If you come across a duplicate entry in the search that has not yet been marked for deduplication, you can add those marks yourself and merge the records manually. For example:+Each time a new file is added to the CERL Thesaurus, there is an algorithm running over the data and checking if there are records in the new data that would produce a duplicate entry for an entity already existing in the database. However, since there is sometimes very little information associated with either the new or the existing record, this alogrithm might miss a hit here and there. If you come across a duplicate entry in the search that has not yet been marked for deduplication, you can add those marks yourself and merge the records manually. For example:
  
 {{ :resources:cerl_thesaurus:editing:dedupman1.png?700 |Identical records in search result}} {{ :resources:cerl_thesaurus:editing:dedupman1.png?700 |Identical records in search result}}
 resources/cerl_thesaurus/editing/mergingunmarked.txt · Last modified: 2020/01/23 09:05 by jahnke

 

 

Recent changes RSS feed Valid XHTML 1.0 Driven by DokuWiki