Identify the language of an string.
-
Hi All, I am developing an SMS application using VC++ 2005 on Vista with Unicode character set. Is it possible to find the language (i.e, either Chinese, English, french etc etc) of the incomming string? That is, if I recieve an message in french language, I should be able to recognize the language as French just from the string itself. Are there any API's which will give me that info? Thanks in advance.
-
Hi All, I am developing an SMS application using VC++ 2005 on Vista with Unicode character set. Is it possible to find the language (i.e, either Chinese, English, french etc etc) of the incomming string? That is, if I recieve an message in french language, I should be able to recognize the language as French just from the string itself. Are there any API's which will give me that info? Thanks in advance.
Nope, nada, non. (*) For that you would need an infinite corpus for all/most languages, you'd also have to be able to analyze said language based on no "hint" of a language so you'd need to have an analyser for all languages, and imagine also if a sentence used multiple words for multiple languages ... I was thinking maybe CString could encode "something" to identify the string, but that you not make sense since there is nothing preventing you to use "latin" characters in a japanenese string and make a french sentence. (*) probably someone will prove to the contrary, but I don't think so, that's why all online translation ask for in (and out) languages.
Watched code never compiles.