Stories
Slash Boxes
Comments
NOTE: use Perl; is on undef hiatus. You can read content, but you can't post it. More info will be forthcoming forthcomingly.

All the Perl that's Practical to Extract and Report

use Perl Log In

Log In

[ Create a new account ]

blazar (7356)

blazar
  (email not shown publicly)
http://blazar.perlmonk.org/
Yahoo! ID: bik.mido (Add User, Send Message)
Jabber: blazar@jabber.org

Journal of blazar (7356)

Friday April 06, 2007
10:00 AM

What's this? (Chinese? Japanese? And what does it mean?)

[ #32931 ]

The following text is extracted from a popup that escaped FF's blocker. Actually, it was broken and showed me the source code of the script running under the intended page, instead of the page itself. In it, I could recognize the stringified version of a Perl hashref, which sounds very suspect and makes me think of a program that is generated by something else itself. Whatever, I am so ignorant of web development matters...

Another thing I could see in that "page" was a long string of sequences like "%u42fc", thus I guessed they were unicode specifications, in hexadacimal. Then I wrote a tiny script to translate that string into HTML entities, so that I could paste it here; the result is as follows:

䋼幪��⑴寴玁㤓碰菇ﳫꃒ财礊縞넅⤳㶚뗒Ⲑ俆랇⥵廡䷺湀⦠헡◠⹪蒼Ṫꊨ픹ᗓ⩪冾귢圫巁ꉛ䪯쒓ষꋱ州ꉛ㐓炮⇔Ƹᯥ쁂Ƹꋾ픬෦鷢족鿢햇ꊨ䊧뼠 픠⪠픢ꉚ�❦蹪ⷊ뚸⧰廡೬훶ꊶꋺ㐹炡按⦠�㩦ᢷᜠ⭡꥚�꫾縍纋훳늶ⵧˢ߁馄⫤⛥⧅淡祠න秶ञꉜ฽훳꺶훰ꪶ�屠༘翲ꆲ獲 읂黒셣ꇇ홟㮦秔㆓棃㪅䳒ⶒ溠⪄僳⪒䓅㞥䳒⪂寏ᾘ纠よ凥㶄沠㞙緔ⲉ䣅庅䛬㪀䃬ⲃ寁ᾘ岠㊓䛍序篵᪭廏㊏䣏અ濏㊈棅꿡쑍ﴈ鼖鸎鸍i鸌鼎ꤑ쑊 葊ꉖ핁쟸

The fact that the resulting charachters are somewhat homogeneous makes me strongly think my guess was right. Otherwise I suspect I would have more easily got text from many different alphabets. However I wonder what language it is, and what it means, if it means anything at all... in the hope that's nothing offensive or outrageous. But perhaps it's only a piece of an advertisement for Mr. Sparkle!

The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More | Login | Reply
Loading... please wait.
  • I tried looking at it in a number of East Asian encodings, but couldn't get anything which looked like it was text. Perhaps it is random digits.