A user asked us how to process html code contained in an Excel file?
For example a cell containing: "<a href="#">Click here</a>.
A user asked us how to process html code contained in an Excel file?
For example a cell containing: "<a href="#">Click here</a>.
This is perfectly possible:
- Select your Excel configuration from "Settings" > "Microsoft Excel" > (name of your config)
- Go to the last section named "Untranslatable words or terms"
- Fill in this code, it is a regular expression:
</?\w+.*?>
Then test the result with the "Test configuration" link in the top of the page.
Enjoy!
Hi Stephan,
Thanks for this tip. I'm sure this will be useful, as we've got more and more requests that are similar.
One question though : does this regex include any text between "<" and ">"?
Hi Brahim,
It excludes the html markup only. Any text between markup is of course preserved.
Hope this helps,
Stephan
Hi Stephan,
It seems that the "Untranslatable words or terms" section is available in the Default Excel rule set and any customised "Monolingual" rule sets, but not in "Multilingual" rule sets.Could you please confirm this?
TIA.
Gareth
Hi Gareth,
Yes, indeed. I confirm you that it does apply to monolingual rule sets only, but we hope to extend it to multilingual rule sets soon.
Kind regards,
Thanks for the confirmation, Brahim.
When you say "soon", are we talking days, weeks, months... ? :-)
Best regards,
Gareth
Hi Gareth,
We added this to our suggestions list. This might not be released within the next weeks, as we are currently working on many developments that get a higher priority.
But this is something we would like to add to Wordbee in October.
Stephan Hello
We've been asked to translate an xls containing html code (website).
I created a new xls profile and marked HTML content but it still displays the code. <br>... <p style> etc...
sample herein:
"<p style=""direction: rtl;"">סקירה מהירה:<br />הנסי (Hennesy) הוא אחד ממשקאות הקוניאק הפופולריים והנמכרים ביותר בעולם. הנסי VS מתאפיין בניחוחות וטעמים של פירות, שקדים קלויים, ענבים וארומה עצית. אף על פי שמשמעות האותיות (VS (Very Special הינה לרוב יישון של בין 3 ל-5 שנים,<br />ה-Hennesy VS מתיישן במשך לא פחות מ-8 שנים.</p>
<p style=""direction: rtl;""> </p>
Any suggestion how to extract just the plain text?
Hi Isaac,
I know your message is addressed to Stephan, but I've had a similar problem before.
In addition to checking the "Content is html" check-box, you need to go to the "Do not translate" tab and enter the following regular expression in the "Words or terms" section: "<.+?>" (without the quotation marks; note that there is a dot/period after the "<").
I've created and tested an Excel profile with these settings and got the following results.
Hope this helps.
Gareth