How do I process my Excel (XLSX) file containing embedded content such as HTML content - and convert HTML control elements to tags? For instance, my XLSX file contains these strings: <li><b>TextText: </b>TextTexTextTexTextTexTextText.</li> <p><b>TextText:</b> TextTexTextTexTextTexTextText.</b> and <b>Textagain</b> TextTexTextTexTextTexTextText.</p> |
To configure your Microsoft Excel 2007-2019 File Type in Trados Studio, make sure to enable the processing of embedded content by going to the Embedded content section of the File Type and selecting Enable embedded content processing as displayed below: Once this option is selected, a pre-set Tag definition rule is applied. This will address most of the HTML syntax. If you work in an earlier version of WorldServer or SDL Trados Studio, you will apply the Microsoft Excel 2007-2013 Studio File Type. To process your content with this filter, you need to enable the embedded content here as well the same way as described above. Note: for this file type, you can do this directly in WorldServer: In the Microsoft Excel 2007-2013 Studio File Type Default Filter Configuration there is no pre-set Tag Definition Rule, so you need to add this manually. 1- You can use the same Regular Expression as already pre-set in the Microsoft 2007-2019 File Type in Trados Studio 2021. The Tag Pair is Placeholder and the Regular Expression is: </?[\p{Ll}\p{Lu}]\w*[^<>]*> Note that you should first add "sdl" as Document structure information (it is already there per default). Your Embedded Content configuration should look like this: 2- Alternatively, you can use this Regular Expression as Tag Pair: Start Tag: <[a-z][a-z0-9]*[^<>]*> End Tag: <V[a-z][a-z0-9]*[^<>]*> 3- Another possible configuration of the embedded content could be of Tag Type Placeholder with Start Tag: (\<(/?[^\>]+)\>) This is how it should look like: 4- Once you have adapted your Filter/File Type and your Filter Configuration, use the Preview function in Trados Studio to view the results by browsing to the source Excel file and then clicking on Preview. 5- If you are happy with the results, close the Preview and click OK to save your changes. 6- Export the file type configuration to an sdlftssettings file and import it to WorldServer. You might need to adapt your embedded content filter settings for perfect results. The suggested configurations might not address all of your embedded content. Note: if you work in versions 11.1.1 to 11.3.x of WorldServer, due to a defect with defect ID CRQ-6594 (which will be fixed in WorldServer 11.4.), you will be able to configure the embedded content section only by working on it in Trados Studio 2017 or 2019, exporting the *.sdlftsettings file and then importing it to WorldServer following the steps described in this article: How do I import a file type configuration from Trados Studio to WorldServer? Starting from WorldServer 11.7., all file type configurations will need to be done in Trados Studio and imported into WorldServer. |