Computerized comparative linguistics

This blog presents a completely computerized model to identify language relationships, quantify them and classify the languages into families in a completely automated way.

You can query comparisons between languages in the calculator and get values for the relatedness (genetic proximity) between languages. A rooted evolutionary tree summarizes all results of the distances between over 160 languages.

So this blog is not just about comparative linguistics: it takes you to a short digital trip in the history of languages... You will see how 18 words (when carefully chosen) can deliver values which are enough to calculate a distance between two and more languages and represent it on a tree. The distances are expressed as values between 0 (the nearest distance - so the same language) to 100 (biggest possible distance). Play with these values in the calculator! You will recognize proximities you can feel by yourself if you know some of the languages used in this study...

A few examples to illustrate the idea behind the project: the system's assessment for the distance from 0 to 100 between following languages is:
Tower of Babel

Make comparisons between 160 languages in the calculator!

This gives you a first idea what this site is about. With the few examples above, you can conclude that the degree of proximity between Russian and German (both Indo-European languages) is quite the same as the degree of proximity between Finnish and Hungarian (both Finno-Ugric).

Once we can get such values, we can generate a matrix. like this one, summing up distances between some languages (values from the few examples above have a green background in the matrix):

Language matrix
...and finally, out of this distance matrix, we generate a rooted evolutionary tree - using the same system - and in fact the same software - like in genetics and biology (details under Resources):
Language evolitionary tree

Blog author: