intro | start | example | explain | code | demo | test-unicode | more

This website currently analyzes up to 75 different languages, depending on the encoding. Use some example text from the vertical language-list.


Start - choose your options

You can either paste a text into the textarea [1] or upload a text file [2] . For the sake of testing you can choose among the already uploaded text files [3] as well. At last you might generate yourself a finger-print [4] of a language.


1. Enter some text to analyze
2. Upload a textfile to analyze
3. Choose from a selection of text files
4. Generate your own finger-print



If your analysis fails, try with another encoding option. Otherwise leave the default setting.

Test your installed unicode fonts on this demo page and download Arial Unicode MS standard font if necessary.

Besides regular text files, DivX-subtitles (.sub or .srt) can also be uploaded and analyzed in a proper style. There are several subtitles files available for testing.

Read about how this detection works and find some more continuative information including a Demo & code to implement your own language detection.






© 2007 Reto Fassbind


  Afrikaans
  Albanian
  Alemannic
  Amharic
  Arabic
  Armenian
  Basque
  Belarusian
  Bosnian
  Breton
  Bulgarian
  Catalan
  Chinese
  Croatian
  Czech
  Danish
  Dutch
  English
  Esperanto
  Estonian
  Finnish
  French
  Frisian
  Georgian
  German
  Greek
  Hawaian
  Hebrew
  Hindi
  Hungarian
  Icelandic
  Indonesian
  Irish_gaelic
  Italian
  Japanese
  Korean
  Latin
  Latvian
  Lithuanian
  Malay
  Manx
  Marathi
  Middle_frisian
  Mingo_iroquois
  Nepali
  Norwegian
  Persian_farsi
  Polish
  Portuguese_brazil
  Portuguese_europe
  Quechua
  Romanian
  Rumantsch
  Russian
  Sanskrit
  Scots
  Scots_gaelic
  Serbian
  Serbian_cyrillic
  Slovak
  Slovenian
  Spanish
  Swahili
  Swedish
  Tagalog
  Tamil
  Thai
  Turkish
  Ukrainian
  Vietnamese
  Welsh
  Yiddish