Monday, January 9, 2017

Going ahead with the new data, clustering

My new data makes possible to cluster better samples according to ethnicities. It is now possible to see at least

Middle-Eastern
Abkhasian-Armenian-Georgian-Assyrian
Caucasian
South European
West European
East European
Finnish dwelling zone
Baltic dwelling zone

Unfortunately none of those new sample sources give reasonable South European view, which makes impossible to see inside the Mediterranean area.  With better sampling I probably could create at least Balkan, South-Italian, Iberian and Basque clusters.   It is probably now possible to classify also project individuals by PCA.

Europe, clustered by Saami, Mongolian, South-Asian and Middle-Eastern samples


Zoomed in



Europe, plotted exclusively.  You can see clearly western and eastern clusters, as well as Balts and the Baltic-Finnic group splitting into Scandinavian and East-Slavic relations.   We could see also a clearly distinct Scandinavian group with more proper samples.  Unfortunately the South European picture is fuzzy due to too few samples.  Due to the shortage of samples I narrowed each group down to four samples, except Tuscany to strengthen the southern cluster.  It is very possible that with a larger South European sampling the European west and east would diverge even more than we see now on this plot below.