Apache OpenOffice (AOO) Bugzilla – Issue 80888
Titles of some html files become to garbage.
Last modified: 2013-08-07 14:38:26 UTC
input "http://www.baidu.com" in the URL field of IE in the menu: File->Save as Save it to local disk. open writer, file ->open Open the html file just saved. The title is garbage. see attachment for details.
Created attachment 47668 [details] the Chinese character title comes to garbage
Reassigned to ES.
@HDU: can you help analyzing this please? I don't know if it's a question of encoding (gb2312), fonts or locale... iE and Firefox also display trash in the title but Firefox on the SunRay displays the title correctly...
Created attachment 47743 [details] HTML file which shows the problem
The HTML file starts directly with <html><head><title>GB2312_ENCODED_TITLE_BYTES</title> <meta http-equiv=Content-Type content="text/html;charset=gb2312"> Since the html header's title doesn't mention its encoding directly, OOo's html import code has to guess the encoding. The import code seems to simply use the thread specific encoding, which is obviously not sufficient in this case. Tweaking the html import heuristic to work around this particular problem doesn't sound too difficult.
@AMA: please have a look.
.
Created attachment 49680 [details] aspect in Simplified_Chinese version (XP)
Hello redflagzhulihua, *, during my TCM test I stumbled (again ... ;) ) upon one of your issues .. ;) I tested it with the Germanophone version of OOO330m4 under Debian SID/Experimental AMD64, and here it looks like your attached Snap1.jpg :) I have to add, that I went to www.baidu.com with Firefox 4.0b4, saved the whole page on my harddisk and then opened it in OOo. Would you be so kind to test it on your system(s?) as well and report back, if this issue is fixed there, too? And it would be nice, if you could close this issue, if it is the case ... ;) HTH Thomas.
Hi Thomas, Again, Thank you for concern. I tested again, and can not see the problem any more. I'll close this issue.
closing...