Ethio Helix ኢትዮ:ሒሊክስ: TreeMix analysis on the African Dataset

Monday, March 12, 2012

TreeMix analysis on the African Dataset

Thanks to a commenter going by the moniker 'Eze', who notified me the other day of a new program called Treemix, in which it infers “patterns of population splitting and mixing from genome-wide allele frequency data”, I had a chance to give it a try on the Intra-African Dataset that I have described previously.

After converting the input file into the desired format, I decided to play with several of its functionalities to become familiar with it,

1) Default Maximum Likelihood (ML) Tree,

2) Default ML graph with 4 assumed migrations,

3) ML graph rooted with the San-nb,

4) ML graph with 4 migrations and rooted with the San-nb.

A remaining option of the software that I have not as yet tried is that which groups SNPs together to account for linkage disequilibrium.

Other than that, the results are quite as expected, the North Africans are shown in both the default and rooted trees, but especially with the San-n rooted tree, as a branch of East Africans, and where East Africans in turn are seen as a branch of other Africans, consistent with evidence from uni-parental markers, as well as published papers, for an East African genesis of Eurasians, of which North-Africans can be used as a proxy for this particular Dataset.

The 4 inferred migrations in order of decreasing edges were;

-(Biaka Pygmy, Ancestral Sotho/tswana) → Sandawe, Migration edge:0.457032; likely an old hunter gatherers link. This was noted by Tishkoff (2009) : “These results suggest the possibility that the SAK, Hadza, Sandawe, and Pygmy populations are remnants of an historically more widespread proto-Khoesan- Pygmy population of hunter-gatherers.”

-(!kung,Ancestral to Biaka and Mbuti Pygmies) → Hadza, Migration edge:0.44087; potentially another early hunter gatherers link.

-Ethiopian Jews → San, Migration edge:0.188914; this could be a relic of early hunter-gatherer connections with Ethiopia (See: Ethiopians and Khoisan share the deepest clades of the human Y-chromosome phylogeny.) Another possible connection for this could be the migration of YDNA E1b1b1b2b (E-M293) carriers from Eastern Africa to Southern Africa within the past few millennia.

-Mbuti Pygmy → Alur, Migration edge:0.140627; this was also picked up by the ADMIXTURE analysis, where the Alur had significant amounts of Mbuti and Biaka pygmy components.

Further reading on the details behind the software featured in this post, TreeMix, can be found here: http://hdl.handle.net/10101/npre.2012.6956.1.

UPDATE: Run another one again rooted with the SAN from Namibia and 10 migrations assumed and got the following results, left column is Migration edge weight

0.586693 luhya →hema,hadza

0.508001 egyptans → EtA

0.504407 egyptans → EtT

0.442291 egyptans → Ethiopian-jews

0.432858 moroccans → fulani

0.27746 mbutipygmy,pygmy → sandawe

0.203223 mbutipygmy,pygmy → hadza

0.156929 egyptans → maasai

0.154406 moroccans → san

0.129901 pygmy → alur

Some of the results from the previous 4 assumed migrations run disappeared, it is not clear if migrations inferred from a lower m assumption are more statistically significant than those inferred from higher m assumptions. In general, this newer run resembles more of the K10 ADMIXTURE run, however there are some obscure differences, for instance, while it picked up a North to East African migration in the EtA, EtT and EtJ samples, it skipped the EtO samples and then picked up the same migration pattern in the maasai samples, whom had a lower 'North-African' component in the K10 ADMIXTURE run than the EtO samples. My take on this is that the program is not yet sophisticated enough to accommodate for bidirectional migrations that have happened for thousands of years, like the ones that have taken place between East and North Africa for instance. Indeed the authors of the software do list the following pertinent point as one of their assumptions:

"We also have modeled migration between populations as occurring at single, instantaneous time points."

and

"This model will work best when gene flow between populations is restricted to a relatively short time period. The relevance of this assumption will depend on the species and the populations considered."

UPDATE2: Residual plot for 10 migrations rooted with the San-nb.

54 comments:

MajuMarch 12, 2012 at 7:25 PM
That's very interesting, Etyopis. Thanks.

I'm still learning to appreciate this "gadget" so I do not have much to say, other than a West Eurasian control would have been interesting to compare with, specially as the African tree is probably the basal tree of Humankind as well.

It's clear in any case that most West Africans and all North Africans form clear compact clusters, while the rest seem more "diffuse".

...

Related because of Treemix but not directly with the content of this post:

Also, I asked this to Dienekes but did not get any answer: when a West Eurasian migration to North African appears to weight 73% (with zombies, from SW Asia) or 70% (with real populations, from Sardinia to Mozabites), does this mean that c.70% of North African ancestry is West Eurasian? I understand that from the explanation in the paper (fig. 1, where the root has a weight 1-w, in this case 1-0.7=0.3) but being a total 'noob' with this program, I'm in serious doubts anyhow. If so, I'd expect the algorithm to hang the NA population/zombie from the West Eurasian node and show the migration the other way around (as the weight is smaller) but it does seem further down the paper that such alterations are possible (errors or limitations of the algorithm it seems) and the illustrate with some examples.

Do you have an opinion on this?
ReplyDelete
Replies
jes-rMarch 13, 2012 at 1:39 AM
Interesting! Thanks for taking the time to do this.
ReplyDelete
Replies
MajuMarch 13, 2012 at 5:48 AM
"... what is the point of using TreeMix to analyze ADMIXTURE components?"

You know how Dienekes is... I generally prefer real populations over zombies, unless maybe when comparing individuals or some other special exercise for which a fixed conceptual frame can be useful.

I'm pretty sure that I saw another TreeMix showing Sardinian flow to Mozabites at the same c. 70% level but can't find it anymore even with the help of browser history (has D. changed the content of his posts or am I going senile early and fast?)

Guess I'll have to learn and design my own exercises.
ReplyDelete
Replies
EtyopisMarch 13, 2012 at 8:29 PM
Just updated for 10 assumed migrations.
ReplyDelete
Replies
DienekesMarch 14, 2012 at 7:27 AM
The point of using ADMIXTURE components over "real populations" is that real populations include admixed individuals/outliers, etc. and result in the inference of phantom migration edges.

It's quite different if there are 2-3 admixed in individuals in a dataset of N=20 vs. low level admixture at similar levels in all 20. TreeMix, since it works with allele frequencies, rather than individuals, would fit a migration edge in both cases. By using ADMIXTURE components, this layer of recent admixture events is removed.
ReplyDelete
Replies
LankMarch 14, 2012 at 1:12 PM
My take on this is that the program is not yet sophisticated enough to accommodate for bidirectional migrations that have happened for thousands of years, like the ones that have taken place between East and North Africa for instance.

You would almost certainly see sub-Saharan migration into Egypt if you had included Eurasians. In your analysis, Egyptians are the closest thing to a Eurasian sample, so there's not much of a need to weigh in the influence of migrations from SSA.
ReplyDelete
Replies
dalouhMarch 21, 2012 at 4:27 AM
0.154406 moroccans → san
the outer epicanthic fold connection that endogenous North Africans share with the San ?
the migration must be very ancient...I believe...unless it was an error .
ReplyDelete
Replies
dalouhMarch 21, 2012 at 2:09 PM
probably so , but why Moroccans the most endogenous and "secluded" of the Northern group and not the Egyptians the most Eurasian and within the vital Nile migratory route to East Africa and beyond ?
could the old basal alleles of the hybridized Northwest component be responsible for the strange results ?
ReplyDelete
Replies
EtyopisMarch 21, 2012 at 11:32 PM
good question, but the Eurasian type of ancestry between egyptians and Moroccans may also be different in type. I'm not fully discounting an old connection between Morrocans and the SAN, Cruciani found some of the most basal YDNA in North and Central Western Africa, so the connection could be there, but the fact that the there was a difference in the Autosomal profiles between the Namibian San and the SAfrican San, makes me suspicious.
ReplyDelete
Replies
dalouhMarch 22, 2012 at 7:16 PM
it seems that there are plenty of E Y-chromosomes in SAN groups...

http://2.bp.blogspot.com/_Ish7688voT0/TO-pMuAmfNI/AAAAAAAAC6w/U3hRZNYQtnY/s1600/southafrica.png

it seems that Moroccans picked up the Eurasian input for the reason I stated early...
I think if you introduce a West Eurasian sample and repeat the experiment, the migration edge scores will change somehow...
ReplyDelete
Replies
EtyopisMarch 22, 2012 at 8:21 PM
The E1b1b1 input in South Africa is most, if not all, E-M293 (E1b1b1b2b) with a likely genesis in East Africa, the paper :

"Development of a single base extension method to resolve Y chromosome haplogroups in sub-Saharan African populations"

from which you cited those YDNA frequencies for, did not test for E-M293 :

"Those Y chromosomes assigned to haplogroup E1b1b1 were screened further using the multiplex assay, Hg-E1b1b1, which consisted of the markers M78, M148, M81, M107, M165, M123, M34, M136 and M281",

of those; M81,M107 & M165 are the only ones that are really associated with N.W. Africa. So if you are speculating that the Morroco-->san connection that treemix found is via a node higher than E-M81, which most of those nodes are to be found in the Eastern Part of Africa, but then why did the program not find at 10 migrations an Eastern Africa --> San connection, although it did so at 4 migration assumptions (see post of this thread), i.e. Ethiopia --> San.
ReplyDelete
Replies
jes-rMarch 23, 2012 at 1:03 PM
Some South African San outliers have recent European ancestry from Afrikaners and Cape Coloureds who live in the Cape region. Therefore it is not surprising that TreeMix picked up a migration from Morocco (who have European alleles via Iberian input) to the San.
ReplyDelete
Replies
dalouhMarch 24, 2012 at 3:20 AM
@ Eze

I am not convinced it is the case,are you sure that the recent ADMIXTURE is having an effect on the results ?
if so , the whole exercise is worthless ..

@ Etyopis

"which most of those nodes are to be found in the Eastern Part of Africa, but then why did the program not find at 10 migrations an Eastern Africa --> San connection, although it did so at 4 migration assumptions (see post of this thread), i.e. Ethiopia --> San."
you asking me ?! it is your run ,you are the one who should come up with a convincing interpretation ...
I don't know what that Eastern Africa --> San connection was about ..."Lucy" ? Omo ? or what the whole African trunk represent, the AMH or a much deeper evolutionary ancestry ? you tell me ..
ReplyDelete
Replies
MajuMarch 24, 2012 at 5:06 AM
AMH, Dalouh, AMH.

There are several elements in an Eastern-Southern Africa connection surely:

1. It appears to my eyes that the L0 clan (to which Bushmen essentially belong) originated in East Africa (Lake Victoria area?) and left its legacy in East Africa and surely also Arabia Peninsula.

2. There was a paper some time ago arguing for a secondary pastoralist migration from East Africa into the South (as you may know the Khoikhoi or Hottentots used to be pastoralists at the time of arrival of Europeans and Bantus). The amount of genetic flow would have been small (but detectable), in a clear case of Neolithic diffusion without replacement.

As for the Treemix debate in general, I strongly suspect that the algorithm is bugged. Otherwise Dienekes' results (c. 70% West Eurasian admixture into Yoruba!?) should not exist. So I'd take all with a good pinch of salt until the method gets well tested (and maybe fine-tuned).

But I can't figure what's up with Morocco: all the arrows I see heading to the San spawn from the root of the North African cluster, which, if not identical, should at least be close to that of West Eurasians in general, of which North Africans are largely a subset (with aboriginal African blood but not as dominant). So I'm essentially agreeing with Eze in that this may well be nothing but minor European admixture in one of the San samples - just that I think it reflects the West Eurasian dominant ancestry of North Africans and not just the Iberian immigration, which would be only part of it - that's why the arrow stems from the North African root (i.e. it's some other related branch in fact or generic admixture from the whole cluster).
ReplyDelete
Replies
dalouhMarch 24, 2012 at 2:36 PM
@ Maju
"AMH, Dalouh, AMH. "
but at what evolutionary stage(s) ?

"As for the Treemix debate in general, I strongly suspect that the algorithm is bugged. Otherwise Dienekes' results (c. 70% West Eurasian admixture into Yoruba!?) should not exist."
may need some refinement, but bugged ?
I am actually pleased with Dienekes's results regarding the composition of the NorthWest component ..
and this what I said a while ago :
"anyway, I have the same reservations on the reconstruction of the Mechta Afalou man by Elisabeth Daynes....Because it was Assumed that Northwest Africa was not occupied by its natives....the Afalou man was most likely a mix and not a West Asian looking one...

http://s1.zetaboards.com/anthroscape/topic/2264152/1/ "

http://forwhattheywereweare.blogspot.com/2011/06/homo-sapiens-childs-remains-found-in.html
thats right a MIX !

it was 64% for the Yoruba and 73% with the Mozabites.. and both populations are dominated by the E..where is the mystery part ?
it will be interesting to know if that 73% will hold or change with the South Moroccans /Atlasians ...as your North African ADMIXTURE experiment showed at K=11 a 14,4% of a distant pre-Dabban component...

http://4.bp.blogspot.com/-qpX29zXLVfc/TvyVB5WETPI/AAAAAAAAAxY/ZGJBT8BXevk/s1600/FstTable.png

http://forwhattheywereweare.blogspot.com/2011/12/north-african-genetics-through-prism-of.html

why are the Ethiopians the least distant ?
ReplyDelete
Replies
MajuMarch 24, 2012 at 4:12 PM
"but at what evolutionary stage(s) ?"

Not sure what you mean: fully evolved Homo sapiens (but before the OoA for phase 1).

"may need some refinement, but bugged ?"

When you have a new software and extremely strange results, there is strong reason to distrust.

Surely bugged, yes.

"it was 64% for the Yoruba and 73% with the Mozabites.. and both populations are dominated by the E"...

I don't even imagine how "the E" (Y-DNA E, I presume) can justify anything like that, much less when it's a minor lineage of clear African origin among Europeans and West Asians.

What is clear is that Yoruba are NOT 68% Vasco-Sardinian or otherwise West Eurasian (nor vice-versa), so something very bad is happening to that software, because it is producing results that are not consistent with anything we know. Sadly Dienekes' wishful thinking has led him to believe and defend this bugged result but it's so obviously wrong that, even knowing how Dienekes tends to be biased and wishful thinking in so many things, I am astonished: he should be full of doubts and instead he does not even blink.

"why are the Ethiopians the least distant ?" [in my Fst results with ADMIXTURE for North Africans-plus]

IMO, and I said in text, both the Ethiopian and Fulani components show ancient WEA-African admixture, old enough to have been settled in a single component, apparent only at certain K levels.

Lower K levels show the admixture rather obviously: at K=9 Ethiopians still showed up as roughly 2/3 "Arab" 1/3 "Mandenka", while at K=7 Fulani show up as a mixture of roughly 2/3 "Mandenka" and 1/3 "Sahrawi". At those levels, the Arab and Sahrawi component appear as West Eurasian (close per Fst to other WEA components, distant from Tropical African ones), while the Mandenka component appears as clearly distant from WEA components, hence "aboriginal African" (previous to the "Aurignacoid" back-migration).

Both components appear as mixed but while the Fulani's closest component is the Mandenka one, the Ethiopian closest component is the Moroccan and then the Iberian and Arab ones, reflecting the different levels of apparent admixture or whatever it is (Etyopis said it might be proto-Eurasian/East African affinity rather than only Eurasian back-flow, neither have tested this yet).

But in no moment there is meaningful WEA ancestry apparent in the Mandenka nor Mandenka ancestry in WEA populations out of Africa. And this result is consistent in many other analysis, for example Henn 2012 (who worked with different samples, including Yorubas and Basques, getting very similar results to mine).

The problem is that extraordinary claims require extraordinary evidence and so far Dienekes' "evidence" only casts doubts on the methods themselves, including the TreeMix software (what may be useful to know but it's bad for proving anything).
ReplyDelete
Replies
EtyopisMarch 24, 2012 at 7:00 PM
"Etyopis said it might be proto-Eurasian/East African affinity rather than only Eurasian back-flow, neither have tested this yet)"

Well, there is nothing to test, and it is not me who have said it but geneticists, anyway, you already know where West Eurasians originated. Nobody has the tools to discern yet which autosomal genetic signature in East Africans is west Eurasian or aboriginal African, as of now, only uniparental markers have the ability to discern that, Ethiopians have on average ~ 80% aboriginal African Y and mtDNA markers, mtDNA haplogroup M, that is prevalent in East Africa is not really 'West Eurasian' by any stretch of the imagination, it is likely an OOA marker, that happened shortly after the initial migrations. The reason West Eurasians are closer to Ethiopians, than the remaining further distant OOA populations, or further distant African populations, is partly because Africans never stopped breeding with the OOA populations once they left Africa, Li and Durbin concluded that Africans were breeding with the OOA population for 40,000 years after the initial migration before West and East Eurasians even split, we have further YDNA marker evidence that originated in Ethiopia (E1b1b) 20000 years ago and spread out of Africa to the 'West Eurasian' areas. So I think it is a mistake to think that Africans stopped breeding with the OOA populations after they left, and some how these populations became 'pure' West Eurasians just because they left Africa, no, the breeding exchange happened and it was likely continuous and hardly discrete.The other part is off-course, that genetics is largely a function of geography, East and North Africa are closer to West Eurasia than other parts of Africa just by virtue of Geography.
ReplyDelete
Replies
jes-rMarch 24, 2012 at 7:51 PM
''mtDNA haplogroup M, that is prevalent in East Africa is not really 'West Eurasian' by any stretch of the imagination, it is likely an OOA marker, that happened shortly after the initial migrations.''

M1 is dated to about 27 kya, while the OOA event occurred 70-60 kya. That's a considerable time gap. So I doubt it is simply an aboriginal OOA remnant, because of the considerable time gap and complete lack of other basal M lineages. Also, researchers have linked the initial spread of M1 to have occurred alongside U6 from the southern Levant.
ReplyDelete
Replies
MajuMarch 24, 2012 at 8:49 PM
Etyopis: the coalescence of the West Eurasian (macro-)population must have been through the following process (emphasis in the possible interactions with Africa):

1. OoA: the proto-Eurasian population parts ways with the East African one and migrates towards (tropical and subtropical) Asia
2. Eurasian diversification: This early Eurasian population diverges in several groups, notably South-West Eurasians and East Asians (also the various Negrito, Melanesian and Australian pops.)
3. Aurignacian era: The West Eurasian core diverges from the South Asian one and migrates Westward, eventually driving Neanderthals to extinction and probably also making some inroads into Arabia and Africa (mtDNA M1, U6, etc.), although the exact extent of these is not clear.
4. Probably a second WEA flow affecting North Africa arrives from SW Europe during the LGM. Maybe also some back-flow into Iberia (Y-DNA E-M81, mtDNA U6).
5. The Afroasiatic expansion from East/NE Africa reaches also parts of Eurasia (E-V13 and such). This took place probably in the late UP (Capsian culture, African influences in Harifian, etc.)
6. Possibly further flows from West Asia into North and East Africa (and certainly Arabia peninsula) with the Neolithic. Possible flow from North Africa to Iberia (option B for the arrival of NW African lineages to West Iberia, and even as far as some localities of Wales).

That's how I see it. It implies some important episodes of back-migration to Africa, not all well documented archaeologically however but rather apparent in the genetics. The details are surely arguable but in any case there must have been some back-flow since deep in the Paleolithic co-influencing African genetics.

"mtDNA haplogroup M, that is prevalent in East Africa is not really 'West Eurasian' by any stretch of the imagination, it is likely an OOA marker"

That's not possible. M must have a single origin and the diversity in Africa is extremely low. Not just that, M1 can be easily demonstrated to be original of Asia because its "sisters" M20 and M51 (together making M1'20'51) are from Asia (M51 from Indonesia in fact, Min Peng 2007; M20 I think is from India but may also be from SE Asia).

Considering M1 to be a remnant of the OoA is simply wrong. And as it's been mentioned there's certain parallel between the scatter of mtDNA M1 and that of Y-DNA T (important in The Horn) along the Indian Ocean's coasts.

There is some backflow for sure. We don't know exactly the timing (because I do not think any relevant archaeology is known yet) but there was a small back-migration of Eurasians into the Horn and other areas of East Africa long ago.

(And, sincerely, I hate to discuss this with Africans because I end up looking like the white guy trying to push some sort of neocolonial notion, what is totally out from my intent: I'm just stating the facts as I see them, please understand that).

[continues]
ReplyDelete
Replies
MajuMarch 24, 2012 at 8:50 PM
[cont.]

"Ethiopians have on average ~ 80% aboriginal African Y and mtDNA markers"

For Y-DNA that's more than correct but not so for mtDNA, showing like 30-40% of Eurasian lineages (M1, HV and others).

Still the affinity that Ethiopians appear to display towards West Eurasians is greater than that, so it's probably in part due to the fact of not being more akin to West Africans than to West Eurasians from the beginning.

"Li and Durbin concluded that Africans were breeding with the OOA population for 40,000 years after the initial migration before West and East Eurasians even split"...

Haven't read it, sorry.

But with whom exactly and how? Because once the migrant population left Arabia and entered into Asia proper (Pakistan, India and beyond) there's no way there could be any more contact until the back-flow of West Eurasians into West Eurasia first of all but also outflowing into parts of North Africa and whatever was left of the early OoA population in Arabia, etc.

The OoA population soon branched out and was mostly at a long distance from Africa in any case, so this claim looks difficult to conciliate with the expanding and diversifying dynamics of the early Eurasian population. There would be contacts with specific sub-branches of the OoA population but not all. These specific sub-branches can be two: (1) the remnant OoA population of Arabia (or Fertile Crescent if you prefer that model), which was surely very small and rather closer to Africans than to the bulk of Eurasians, who were being redefined by founder effects (= bottlenecks) and (2) the West Eurasian population which surely migrated westwards from South and SE Asia c. 50 Ka ago and whose most distinctive trait appears to be Aurignacoid industries (early UP).

"we have further YDNA marker evidence that originated in Ethiopia (E1b1b)"

I do think that E, E1, E1b, E1b1b... originated all in Africa. The expansion of these lineages may have displaced some Eurasian Y-DNA (I think this is correct in North Africa and may be also the case in parts of East Africa, where there is more Eurasian mtDNA than Y-DNA).

"So I think it is a mistake to think that Africans stopped breeding with the OOA populations after they left, and some how these populations became 'pure' West Eurasians just because they left Africa, no, the breeding exchange happened and it was likely continuous and hardly discrete".

I instead think it was more discrete because there was a migration to Southern Asia first, well documented archaeologically and genetically, and only later the Aurignacoid back-flow happened. In fact:

- c. 125 Ka ago: First indications of OoA in Arabia and Palestine (but possibly not beyond)
- c. 90 Ka ago: much more general OoA presence in Arabia
- c. 80 Ka ago: African-derived MSA-like industries in South Asia, soon also stone blades (defining West Eurasian UP later on)
- c. 55 Ka ago: Homo sapiens with Aurignacoid (UP) industries in Palestine, c. 48 Ka Aurignacoid industries in Central Europe, some time around those dates (c. 40 Ka or so) in Altai and the Pyrenees and Libya (Dabban industries)

If you need references feel free to ask, but it's all documented in my blog: Petraglia, Armitage, Rose, etc.

There was a separation and then a "reunion" (sounds nice but maybe they killed each other in part, I do not know).
ReplyDelete
Replies
dalouhMarch 25, 2012 at 4:00 PM
"Sadly Dienekes' wishful thinking has led him to believe and defend this bugged result but it's so obviously wrong that, even knowing how Dienekes tends to be biased and wishful thinking in so many things, I am astonished: he should be full of doubts and instead he does not even blink."

this new tool just confirmed his theory..I don't think it is some wishful thinking as you are trying to describe it..
I find his argument very convincing, as this one at Razib's blog :

http://blogs.discovermagazine.com/gnxp/2012/03/we-are-all-sardinians/

"Exceptional claims require exceptional evidence, and this is not it at all. "

Henn et al paper about North Africans IS the evidence..the Afalou (27 ky) and the Ibero-Maurusian remains (16 ky) from Ifri n Ammar seen here in this video :
http://www.youtube.com/watch?v=VSKaa1Uh-h8

are the direct descendants of the Eurasian DE.. and that is why North Africans cluster with other Eurasian groups...what other extraordinary evidence do you need Maju ?

@ Etyopis

don't make big assumptions on who were the proto-Eurasians...

argiedude said

"he Batini study? I wish I could've been able to give them a few personal opinions before they started the tests. The most interesting thing I found of y-dna B is the existence of a cluster with a very distinctive haplotype, namely the presence of 392=13, almost unheard of outside of y-dna P (Q, R1a, R1b, etc.). And this cluster is located geographically almost on the northern half of the Sahara, including Morocco and Egypt. But incredibly, despite finding 2 dozen samples, not one of them had tested any downstream SNPs, just B. when the Batini study came out, I thought for sure the mystery would be settled, but they found just a single sample from this cluster... and they didn't test it (for downstream SNPs)!"

http://forwhattheywereweare.blogspot.com/2011/05/major-upheaval-of-human-y-dna-phylogeny.html
ReplyDelete
Replies
dalouhMarch 25, 2012 at 5:56 PM
the genetic profile of the North Africans supports the Eurasian origin of the DE ..and that is good enough evidence for me..
"The Treemix algorithm is only now being widely tested and is producing some results that clearly look spurious on light of all other known genetic data. Why?"
genetic data need to be supported by the Archaeological data...it is a simple rule.
while NorthWest Africa has both , South Africa is lacking the Archaeological evidence ( no AMH remains)..and no excuse here, both are wine producing regions today because of their mild climate (unlike tropical Africa )..
we may see more surprises in coming years , be ready...
ReplyDelete
Replies
MajuMarch 25, 2012 at 8:53 PM
"the genetic profile of the North Africans supports the Eurasian origin of the DE ."

Y-DNA DE? No way! The origin of DE is only determined by itself and, in this equation North Africans are very much secondary: Tibetans and Japanese are in fact more intriguing. North African (and West Eurasian) DE (E1b1b1 variants in fact, nothing else) comes from the area of the Upper Nile or other parts of Tropical Africa, where most of the diversity is (DE* for example is found in West Africa - I always assume that the two DE* individuals reported once in Tibet are pre-D rather than truly hanging from the common origin of both D and E, but the matter is not well researched).

You are wishful thinking like Dienekes: daydreaming because of unspoken racial prejudices (I understand). You two cannot embrace naturally our (minor) "recent" African ancestry so you'd like to make it all "Eurasian" somehow. That's not scientific but ideological and of a quite bad ideology in fact.

"genetic data need to be supported by the Archaeological data...it is a simple rule".

True in the sense that genetic data should not contradict overwhelming archaeological data but be conflated one with the other in order to provide the truest possible solution to the puzzle of human origins. Not true in the sense you claim:

"South Africa is lacking the Archaeological evidence ( no AMH remains)"

There's strong archaeological evidence of presence of H. sapiens in Southern Africa since >100 Ka ago. No skulls? May be but lack of evidence is not evidence of lack. The archaeological data does not contradict the genetic data in any case it just fails to produce even stronger support.

Mind you that the genetic data in North Africa does not support apparent continuity from Djebel Irhoud. I have found what seems to be a small remnant or three of Aterian age (at the most) but not from earlier times. Maybe there is something but so thin that it's almost impossible to detect.

In any case these remnants would link to Tropical Africa as direct ancestor: NW Africa is not some other planet: its patterns are related to the rest of the World and very much so in fact. It is rather a place where many migrations have ended and where probably not a single major migration began. Prove me wrong in this if you can: minor flows have sprang from NW Africa in Northwards and Eastwards and even Southwards direction - but did not reach too far nor was overwhelmingly dominant ever. That's what the genetics say: Jebel Irhoud is not more our direct ancestor than some random Neanderthal.

"we may see more surprises in coming years , be ready"...

We'll see what is to see in due time.
ReplyDelete
Replies
EtyopisMarch 26, 2012 at 9:55 AM
“Considering M1 to be a remnant of the OoA is simply wrong.”
Considering ANY haplogroup outside of Africa to be a remnant of OOA is not wrong at all, it is simply a fact. Even if you consider the Ethiopian specific M1 to have coalesced 30 KYA outside of Africa as, we have evidence, like I pointed out to you from Li and Durbin from studying complete diploid genome sequences that the West and East Eurasian populations did not even genetically diverge at that time. They may very well have diverged physically by that time but signatures of what we identify today as West Eurasian and East Eurasian Autosomal genetics may not even have appeared at that time, therefore trying to use M1's presence in Ethiopia as evidence to back up the seemingly West Eurasian genetic affinity found in Ethiopians in ADMIXTURE runs is dubious at best.

“In summary, the existence of long segments of low divergence between YRI1 and KOR supports the inference from PSMC that there was substantial genetic exchange between West African and non-African populations up until 20–40 kyr ago, and is not consistent with a simple separation approximately 60 kyr ago.”

“Notably, a recent study using an orthogonal type of data (analysis of allele frequencies) also inferred that gene flow between Africans and non- Africans continued well after the initial out-of-Africa migration: in the case of that study, until 17–26 kyr ago25."

^ Li & Durbin (2011)

“but there was a small back-migration of Eurasians into the Horn and other areas of East Africa long ago.”
We have uniparental evidence for back migrations, but we do not have any evidence of what the Autosomal affinity of these OOA populations that back migrated could have been, for instance they could, possibly, have been more African-like than Eurasian like autosomally back then.

“For Y-DNA that's more than correct but not so for mtDNA, showing like 30-40% of Eurasian lineages (M1, HV and others).”
Even with your estimates that would make for Ethiopians possessing ~70-75% Aboriginal African haplogroups on average, which is only 5-10% off my ~80% average estimate.

“c. 125 Ka ago: First indications of OoA in Arabia and Palestine (but possibly not beyond)
- c. 90 Ka ago: much more general OoA presence in Arabia
- c. 80 Ka ago: African-derived MSA-like industries in South Asia, soon also stone blades (defining West Eurasian UP later on)
- c. 55 Ka ago: Homo sapiens with Aurignacoid (UP) industries in Palestine, c. 48 Ka Aurignacoid industries in Central Europe, some time around those dates (c. 40 Ka or so) in Altai and the Pyrenees and Libya (Dabban industries)”

Most of those pre-70K populations could have very well died out, the most recent and up-to-date molecular clock based on mtDNA mutations has ruled out the possibility that the ancestors of all humans alive today exited Africa before the toba eruption. I know you have a problem with these time estimates, which your final or root reasoning boils down to pushing back the human chimp divergence time by millions of years, but this position of yours is contrary to the Academic orthodoxy, and I neither have the desire nor the knowledge to challenge it, but suffice it to say that I accept the Academic orthodoxy on this matter.

“@ Etyopis
don't make big assumptions on who were the proto-Eurasians... “

By the way, did you see the supervised Global K10 run? North Africans had ~40% Ethiopian like, ~50% Basque like and ~10% Dogon like affinities. Peninsula Arabs and even delta Egyptians had less Ethiopian like affinities than far NorthWest Africans, why is that? Although the Arabian peninsula is closer to Ethiopia than Northwest Africa, and the Nile is a highway that connects Ethiopia to Egypt.
ReplyDelete
Replies
MajuMarch 26, 2012 at 1:09 PM
Also, I was realizing that we do have specific direct evidence which shows that the descendants of N were inhabiting Europe some 30,000 years ago, what is consistent with the no-massive-replacement model, which I spouse, and contradictory with the no-differentiation model until 20 or, in the case of Europeans, less than 20 Ka ago.

We know and nobody questions it that an individual in Kostenki, Russia, was mtDNA U2 some 30 Ka ago, what implies that U2 itself but also its ancestors U2'3'4'7'8'9, U, R and N had coalesced previously to that date. In about the same same period (before 17 Ka ago) there are also U5, R0 (reported as HV(xH)), and some other R-derived lineages (of which one I think is an H subclade almost for sure) directly spotted via HVS-I in Europe.

The less-than-20-Ka hypothesis of Li and Durbin is not sustainable on light of the actual empirical data. That much is the kind of confusion that trusting MCH age estimates can lead to.
ReplyDelete
Replies
joshua.gateraMarch 26, 2012 at 6:29 PM
"Etyopis said it might be proto-Eurasian/East African affinity..."

It's no longer possible to deny or rather disregard the importance of indigenous East African genetic variation in reference to the relationship(s) between NE Africans and other important ancestral populations, especially Eurasians (specifically Western Eurasians).

East Africans (Afrasan speakers, Southern Sudanese Nilotic speakers, and indigenous SE Hunter-Gather groups, i.e. the Hadze and Sandawe) are closer to Eurasians than other Africans (Niger-Kordofanian speakers, Central African pygmies, and the Khoisan) are to the later.

This "Eurasian" cline in Africa, in reference to Eurasian affinity or vise-versa, is likely primarily related to the time depth of divergence between the respective ancestral African populations (I'll use linguistic terms in a rough association with these aforementioned AAPs) and the ancestors of non-Africans.

In the order of oldest to most recent divergence date with the ancestors of non-Africans...

Khoisan > Mbuti > Biaka > Niger-Kordofanian > Nilo-Saharan > Hadze-Sandawe > Afrasan in relation to non-Africans

Take a look at the results of this Dinka sample from 23andme...

http://i204.photobucket.com/albums/bb178/beyoku/Y.png
http://i204.photobucket.com/albums/bb178/beyoku/Mtdna.png
http://i204.photobucket.com/albums/bb178/beyoku/Paint.png
http://i204.photobucket.com/albums/bb178/beyoku/Globalsim.png

Independent PCA...

http://i204.photobucket.com/albums/bb178/beyoku/Full_20120201123442BGA2.png

^ As you can see, this Dinka sample clusters in between Somalis and West Africans; his results are supported by Tishkoff et al. 2009 whom also sampled groups from South Sudan. It's clearly obvious that what ever's causing the Eurasian affinity in this particular Dinka sample, and the Southern Sudanese in general, isn't due to any admixture (or commonality) from/with any other particular source, either be it Eurasian admixture and/or other indigenous East African gene-flow by way of the African Horn or groups like the Sandawe. What ever the case, a closer similarity to Eurasia relative to other parts of Africa seems to be the norm among all groups in East Africa, either be it Nilo-Saharan, Afrasan, or Hadze-Sandawe.

If we were to use the Dinka as a proxy for the African component of the Somalis for example, the African ratio would increase from ~50% African (with a West African proxy) to about ~67% with the Dinka proxy, it would then increase to about ~75% African if we were to use indigenous Hunter-Gather groups from SE Africa. It's therefore logical to assume that the pre-Western Eurasian admixed NE Africans would have been more similar to Eurasians than their SE African and S.Sudanese counterpartsa.

Western Eurasian admixture is clearly playing a notable role in the genetic affinities of at least some NE Africans; most importantly NE Africans groups in the northern Horn of Africa, i.e. Eastern Sudan, Eritrea, and Northern Ethiopia, whom cluster away (in the direction of Arabia) from groups like the Somali and some Oromos who lack "excess" Eurasian and/or other African admixture.

If I had to make an estimated guess Arabian admixture would peak in the northern Horn of Africa at about ~20-25%, where it would then decrease significantly to about ~10% among Somalis and other lowlander NE Africans.
ReplyDelete
Replies
MajuMarch 26, 2012 at 8:45 PM
"... until enough non-African samples are added at which point it would become a global PCA and the differentiation of North/East Africans from other subsaharan Africans vanishes, and the main differentiation becomes Africans from non-Africans"

That's not correct and you should be familiar with global datasets in ADMIXTURE/STRUCTURE analysis. For example Behar 2010 (sup. figures, scroll down to fig. 4a) at K=3 has North Africans (Moroccans, Mozabites, Egyptians) looking 80% West Eurasian and some 20% (West-Central-Southern) African. Ethiopians also look that way although the apportions are different: like 60% and 40%.

And that is a sample full of non-Africans of all kinds. And it is just an example.

(And curiously enough, maybe because Africans are not sampled in sufficient numbers, the main distinction at K=2 in that case is East Asians versus the rest - just for the record because IMO this is a matter of sample size and less important admixture between West Eurasia and Tropical Africa).

There are other examples and you can experiment with the 1000 genomes yourself and see if what you just said is correct or rather does not hold. And I think it does not hold in fact (based on all my genetic experience).
ReplyDelete
Replies
EtyopisMarch 27, 2012 at 9:07 AM
I never said all PCA's come in an 'L' shape, I said the first two dimensions of a Global PCA approximate an 'L' shape, that still remains the fact, it also remains the fact that on one side of the 'L', where the highest amount of the PCA's variation (eigenvalue) is explained, Africans are found, on the other side non-Africans are found, the corner of the 'L' houses those that are intermediate between Africans and Non-Africans, including some North Africans, Southern Euro and Near Eastern Populations, this is consistent with global genetic diversity decreasing as a function of distance from Africa.

Since PCA's are essentially vector spaces with both magnitude and direction, their dimensions specify the number of independent directions in space, hence PC1,PC2,.... are roughly independent and also orthogonal. Eigenvectors are only a special case of vectors, where an orthogonal basis for such vectors, is required to explain the variance of the data for a square matrix, like for instance an IBS matrix with an N X N size will yield N orthagonal eigenvectors after decomposition.
ReplyDelete
Replies

Add comment

Ethio Helix ኢትዮ:ሒሊክስ

Pages

Monday, March 12, 2012

TreeMix analysis on the African Dataset

54 comments:

Blog Archive

Search This Blog

Contact Form

Ethio Helix ኢትዮ:ሒሊክስ

Pages

Monday, March 12, 2012

TreeMix analysis on the African Dataset

54 comments:

Blog Archive

Search This Blog

Subscribe To

Contact Form