1- Under
Tools/Translation Memories, select the Translation Memory that you want to work on.
2- In the Translation Memory page, in the
Search section, select the check box labeled
Duplicates...Also make sure that the
Search Type is set to
Standard, the Search is in
Source and the
Minimum Score to 0%:
3- By default, WorldServer searches for duplicates only in the
Source Entry. To ensure real duplicates are deleted, you need to modify this setting. Click the
Duplicates link and select the desired columns to search, making sure true duplicates are found according to your criteria.
4- To include both the source and target entries, add
Target Entry to the
Source Entry attribute. If you want to take context information into consideraton in your Duplicates search, add
Previous Source Entry and
Next Source Entry. Once you have added your criteria as required, click
OK to confirm the changes.
5- Now select the desired language pair and enter
* to run a search for all TM entries that are duplicated. This search will return all the duplicates filtered according to the selected attributes.
Note: You can also click the 'Choose Columns...' link to make all desired columns visible in the search results, if not already present. For example, you can include columns like
Created On,
Created By,
Modified On,
Modified By, or
Entry Origin.
6- Review your results and, if you are sure, click on
Delete all Duplicates. Only the
latest TM entry in each duplicate set will be preserved.
Important: Deletions cannot be undone. If you're unsure about deleting all duplicates and as a security measure, export them first using the
Export Result Set... button before clicking on
Delete All Duplicates. The duplicates will be exported as a TMX file and can be re-imported if needed.
Hint: Note that the Minimum Score and Maximum # of Hits page settings are not considered. All matching TM entries will be deleted regardless of these settings.
All TM entries that share the same values for all the selected attributes (Step #5) will be returned as duplicates. If you are searching for duplicates based on specific source or target entry text, note that for the
Source Entry and
Target Entry attributes, an absolute match is required for entries to be considered duplicates. TM entries do not qualify as duplicates if there are any differences in whitespace, punctuation, capitalization, placeholders, or numbers.