How does the Concordance Search work in SDL Trados Studio?
This article explains some of the options and settings for Concordance Search that are available in SDL Trados Studio:
A common question users have when using concordance is why the Concordance Search does not find an expected match in the translation memory. This question can be paraphrased or simplified by asking "Why is the concordance search not searching through the complete translation memory?"
The most important observation around concordance searching is that the Concordance Search is NOT a full-text search through your complete translation memory, rather it is a fuzzy search that has been designed to return meaningful results in large data sets as quickly as possible. If theConcordance Search was a traditional full-text search, performance would be affected negatively as the entire translation memory would need to be searched sequentially and completely before the list of matches would be displayed.
To avoid this performance decrease, the Concordance Search is based on a search algorithm that works in the same way as a fuzzy search and is split into three phases as follows:
Although you can use Concordance Search for words and characters, the algorithm is optimized for searching phrases. This means that phrase searching will typically return the best results.
If you want to run a full-text search for specific words/phrases through the entire translation memory, we recommend that you use the translation memory view in SDL Trados Studio that supports full-text searches.
This is the degree of match that must exist between the search text you have selected and the matching text fragment in a translation memory segment in order for the translation to be offered as a match. The default is 70% but you can make the limit higher or lower.
If you increase the number of hits in the Maximum number of Hits options for the Concordance Search, this will include more matching candidates in the scoring set. The candidate search also has scores but these are independent from the match scores. This means that a “good” candidate score can lead to a “not as good” match score.
For example, if you search a word in the Concordance Search and set the Maximum number of hits to 10, and you receive 10 candidates for the match scoring that have all the same (good) candidate score, it could be that the last good hit in the results list for example receives a match score of 90%. But if you run a search and set the Maximum number of hits to 11 hits and candidate #11 has a score of 95% then this hit will be listed better/higher in the results list than #10 that had a score of 90%.
Concordance search works in two steps:
In a situation where you have a translation memory containing 1000 segments including the word
SDL Trados Studio 2009 (re-)introduced the character-based concordance search feature in addition to the existing word-based concordance search. To use the character-based concordance search, you need to select the Enable character-based concordance search option during the creation of a translation memory. This can only be done during the creation stage of a translation memory (and is not reversible). There are two main aspects to be aware of when enabling this feature:
Enabling character-based concordance search for a translation memory allows SDL Trados Studio to index groupings of characters within a word. This results in more partial (fuzzy) matches and bigger indexes. For example, if you enable Character-based Concordance Search and you search a translation memory for the word