Gene testing; neat things and some stats jackassery
I got myself gene tested for the lulz and the raw data. I was primarily interested in ancestry results. For my ancestry results in the broad strokes, check out the article I recently wrote for Takimag.
Over the years, quite a few physical anthropologists have learned interesting things about historical human migrations from examining genetic data. I was hoping to learn what tribes my ancestors hail from. I was pretty sure they’d be Celts, Germans and/or paleolithic non-Indo Europeans like the Basques (who appear to make up a large fraction of the European population). The results indicate that, in fact, that’s what my genes and chromosomes are made of. I can’t tell how much of each, as the science on this subject isn’t really done yet. Where it is done, the databases are not yet publicly available for comparisons. There are interesting tools for getting more detail than 23andme.com and FTDNA give on your genetic ancestry (both services allow the individual to download their results). The one I liked the best was the interpretome. The interpretome has a tool for doing PCA on reference populations. The results put me closest to the Swiss in my genetic makeup among European populations. This makes a lot of sense, as I’m part French, part British (aka Germano Celtic) and part Bavarian German.

The results of doing PCA against non-European populations are interesting, though they mostly confirm what is already known about human diversity. My African results? Well, Africa is a very genetically diverse continent; it shows me as closest to a Berber tribe called the Mozabite.

My Asian PCA results: again, people think of Asia as China, India and Japan. There are also other tribes. The tribe I ended up closest to? The Kalash people, who are related to the pagan Nuristanis of “Man Who Would Be King” fame. They’re not descendants of Alexander the Great’s Army after all (they seem culturally more related to old Persians), but they look like they could be.

For “medical” results: my test results indicated I got the luck of the draw in the brains and self-discipline department. They also indicated that I also have no genetic potential for body building or sprinting, something which is painfully obvious to me whenever I go to the gym. My genes peg me for asthma and hay fever, which is true. They also nailed me as brown eyed, lactose tolerant and sensitive to caffeine. On the other hand, the fancy pants genetic testing got my blood type wrong, which seems rather odd.
Gene testing is feared for its ability to classify people by their genetic potential. Looking at the results: I don’t understand people’s fears. A quick look at my CV, a quick look at me, and feeding me a fancy coffee drink in the presence of a cat would tell you all of these things in a few minutes: no affymetrix chips required.
There are some weak genetic correlates with common diseases. The gene tests tell me I have a higher probability of glaucoma than average, and a lower probability of diabetes than average. This is what I would expect from looking at my immediate family’s medical history. I’m also supposedly immune to norovirus infection, which could help explain my cast iron stomach. On the other hand, my gene tests indicate I have both a higher and lower chance of celiac disease and Parkinsons disease. The science behind these results is obviously inconclusive at best. Possibly, it is complete nonsense.
Many of the studies the genetic tests are based on were conducted on a mere handful of people who I may have nothing in common with. Quite a few of these genetic studies of health outcomes do not pass the statistical sniff test and are likely to be measurements of random noise; caveat emptor. 23andme generally provides links to the studies of interest. I found this tool, by Enlis Genomics to be even more useful for sating my curiosity.
Is any of this useful, or is this just a high-tech version of astrology? The genealogical data is vague and often questionable, but useful to the curious. While I have no immediate plans for breeding, knowing that I carry no genes for purely genetic diseases such as Cystic Fibrosis is reassuring, though I suspected as much from family history. If you don’t know your family history, or have a very small family, such tests could be very helpful. I don’t think the “abilities” tests are useful at all; most of it can be told by looking in the mirror. Finally, virtually all of the disease risk and other tests should presently be taken with an enormous grain of salt.

I’m looking for a DNA test that will tell me what modern nationality’s Y chromosome is most similar to my own. Any brand suggestions?
I can’t recommend any of them; it’s really luck of the draw. If you have one which is well studied, you’ll know. If you have one which isn’t well studied (like me), you’ll be shit out of luck. Anything will tell you generalities like “Western European” or “out of Africa” at least. 23andme is easiest to use and comes with the most over all features, IMO. FTDNA for deep clade studies on your Y and mit-DNA. Those were the two I have direct experience with. Filtered out the rest before ordering by process of elimination.
The statistics package R has some tree-construction programs you can run. I used data from a service called Y-search and a British Isles project. The programs were developed for serious genetics uses, so it’s kind of a secret vice to use them for trivial ego curiosity. But we can all use a few secret vices.
So far, I just know I’m R1a. The boys with that marker started off in a backwater south of the Urals, developed some technology having to do with horses and went riding off in all directions. The marker is found in Uigars, people of the subcontinent, Saudi Arabia and northern Europe. It’s about 5% of the British pop, entering with Angles, Saxons, Danes and Vikings. The big concentration of the surname is in NW England, so it was probably adopted there by guys with Angle markers. When I run the R program, there’s distant matches to Norfolk and northern Europe but the common ancestor is 2-3 K years ago. When some of my lazy relatives in England get around to testing, the situation will be a little clearer.
I figured there was a bunch of stuff on bioconductor and CRAN. I haven’t tried to uncork this yet.
Can you recommend some packages for me to noodle around with? I’d mostly wonder what data sets you’d be using to compare the genome against.
I think if you’re y-search compatible, you can use the interpretome, which is a pretty cool gizmo for autosomal DNA analysis.
Flip me your email address and I’ll send you my stuff. For R, there’s several packages. I chose a simple one that outputs a tree in PDF. The file is big but you can magnify to look at interesting parts. My understanding of the algorithm is that it picks the two closest, replaces them with an ancestor and recurses. The numbers are repeats of a single nucleotide which changes when the copying mechanism stutters and repeats or leaves one out. Assuming equal probabilities at each locus and for adding or leaving one out, the distance metric is straightforward.
The only thing you’ll need to change is to download R1b data from Ysearch and/or British Isles DNA. My R1a is only 5% of British pop, which is still large. I think I may have some R1b already downloaded and cleaned up for R. I correspond with a guy with the same surname who has a Baron in the family and I was hoping to see the Norman names from the Middle Ages group in the same part of the tree. No such luck.
I have a big file of cleaned-up R1b markers from British Isles DNA. IIRC it’s a subset, but you may find it useful.
“While I have no immediate plans for breeding”
You should. The next generation(s) need intelligence and self discipline.
The problem with intelligence and self discipline is it makes you realize what a pain in the ass having kids would be, and how easy it is to have a nice life without it.
KIds bring their own rewards. And torments.
And yes, you can have a nice life without kids. Maybe even nicer in a lot of ways.
But that’s ok, as it means my daughter will have less competition and be able to charge higher prices.