Labels

Search This Blog

Showing posts with label Ketchum DNA Study. Show all posts
Showing posts with label Ketchum DNA Study. Show all posts

Saturday, September 7, 2019

Sample 26 mtDNA Electropherograms Show Degradation/Contamination




Figure 1.  Ketchum S26 Electropherograms from SGP website.  Green bars above each peak proportional to quality of read.  Peak color indicates base:  A T  C.  For research and education only, "Fair Use." 

Melba Ketchum has consistently claimed in her paper [1], on radio, on TV, on the Sasquatch Genome Project (SGP) website [2], on YouTube [3],  and at every other opportunity,  that her samples are not contaminated or degraded, in spite of very anomalous results, many of which are easily explained by such.  She posted Sample 26 (the Smeja find) raw data mtDNA electropherogram files on the 
SGP website.  I downloaded all 272 such files, which require special software to open, examine and assemble into a complete mtDNA sequence.  I purchased DNABaser software from Heracle Biosoft to do just that.  It cost $159 and runs on my PC.  The above Fig. 1 is a screen grab of  six randomly selected, representative electropherograms of the 272.

This data was obtained by the Sanger Dideoxy Sequencing Method, outlined in Fig. 2.


Figure 2.  Sanger Dideoxy Method.  After Wikipedia.  For research and education only, "Fair Use."   

In this method, segments are elongated (as complementary strands) and terminated with "fluorochrom" (i.e. fluoroprobe, or fluorescent tag) nucleotides with different fluorescent wavelengths for each of the four nucleotides (A, G, T, C).  The successive elongated segments are separated by electrophoresis according to their length, then detected and identified by their characteristic fluorescent wavelength (stimulated by a laser).  Software makes the plots as represented schematically in Fig. 2 lower right or actually in Fig. 1 and generates a sequence (Fig. 1 below each electropherogram, Fig. 2 lower right).

All but the shortest (few hundred bp) sequences must be sequenced in segments, and then assembled by computer, based on overlaps, as shown in Fig. 3.

Figure 3.  Assembly.  Overlaps underlined.  After WikipediaFor research and education only, "Fair Use."


Mitochondrial DNA assembly is handily done on a PC.  Only very large data sets (e.g. whole nuclear genomes) from next generation sequencing, require a "super computer" or a network of linked servers to assemble.

Sanger sequencing and subsequent assembly does not work for severely degraded DNA, whether from a single or multiple sources.  This is because of the presence of sequence fragments which are self-primed and which are never the less elongated and tagged with a fluorescent probe.  These fragments coelute with the fragments of interest as shown in Fig. 1 ("Poor").  The "Excellent" electropherogram has no such coelution:  each peak stands alone and is separated from its neighbors so that the fluorescent detector can make an unambiguous base call (A, G, T, or C). The "Good" electropherogram can also be used because coeluting peaks are much smaller in most cases and can be ignored.  The few ambiguous, double peaks will hopefully be clarified by another overlapping sequence fragment.  Otherwise these few remain unknown bases (or possibly polymorphisms).  The "Poor" electropherograms are totally uninterpretable.  They have multiple overlapping peaks of comparable intensity.  Those "?" electropherograms might have some salvageable sequence runs, but should be considered dubious at best.   So, two usable e-grams out of these six.  A well collected and preserved single species DNA sample would produce only usable e-grams.

Unfortunately, the 272 electropherograms contained only 47 that could be assembled into a sequence of 5260 bp out of an expected 16,568 bp human mitochondrial genome.  The remaining 225 electropherograms contained only 11 which could be assembled into a 1923 bp sequence.  Other assemblies were shorter still.

The first assembly (47 e-grams) aligned with the published S26 mtDNA sequence [4], base positions 11401-16551, only 96% with 74 gaps.   The second best assembly (11 e-grams) aligned with published [4] S26 base positions 13718 - 15435 only 97% with 27 gaps.  This is rather poor agreement between these raw data and the published S26 sequence [4].   The 47 e-gram sequence aligned with the 11 e-gram sequence only 93% with 1643 identities and 69 gaps.

Thus, only 32% of the mtDNA genome could be sequenced with these data, and the sequence agreed very poorly with the published S26 sequence produced by Family Tree DNA [4].  FTDNA assumes a human sequence and uses human primers, so DNA of any other species present would not be sequenced. 

These data call into question the mtDNA sequencing and haplogroup designation of Sample 26, and hence also the Ketchum claim that this sample represents a human-unknown primate hybrid.  Most likely, the sample is a black bear contaminated by one or more humans and severely degraded, as shown in my previous blogs. 

References

[1]  Ketchum M. S. et al. (2013) Novel North American hominins: next generation sequencing of three whole genomes and associated studies. DeNovo 1:1.  Online only: http://sasquatchgenomeproject.org/ 


Friday, March 24, 2017

Die Tiefe: A Great Blog for Bigfoot Issues

Christoph Kummer is a journalist who writes for various newspapers in Switzerland.  His blog, "Die Tiefe: Der Wahrheit auf der Spur..." (The Deep:  Truth on Track) is an online newspaper that deals with mysteries and controversies of all kinds ( http://www.dietiefe.com/ ).  So far, the main themes have been 9/11 and bigfoot.  Articles are in English and German.  He has covered the bigfoot phenomenon historically and has interviewed a number of involved researchers, including me.  So, of course, I encourage you to check out my recent interview with Chris on "Die Tiefe" to get an overview of what went wrong with the Ketchum bigfoot DNA study.  A number of readers have said that it was understandable to the layman.  Chris' poignant and to-the-point questions have made this a very clear expose.   

Tuesday, January 17, 2017

Two New Peer-reviewed Papers are at Odds with Ketchum et al. Conclusions

“The above commonly reported traits, as well as other scientific evidence lending credence to the existence of Sasquatch, have been thoroughly researched and documented in both books and in peer reviewed manuscripts. refs.4-13”  (Ketchum et al., 2013)

Peer review is the process by which journal articles are reviewed by experts prior to publication.  Reviewers are selected by the editor of the journal for their proven expertise in the relevant area, and their identity is kept anonymous. Their criticisms and suggestions are forwarded to the author for consideration and possible revision of the manuscript.  It’s not a perfect system, but it greatly reduces the number of scurrilous publications, honest errors, and unclear narratives.  Recall that Ketchum et al. (2013) failed peer review in two journals before being self-published.

Recently I had published two, peer-reviewed papers (Hart, 2016a, 2016b).  “Not Finding Bigfoot in DNA” made it to Volume 4 (pp. 39-51) of The Journal of Cryptozoology, a publication of the Center for Fortean Zoology, edited by Dr. Karl Shuker.  Zoologist Shuker is well known for his many books and blog articles on cryptozoology – the study of animals not (yet) proven to exist by science.  My paper addresses the three nuclear DNA sequences published by Ketchum et al. (2013), said by them to be “a novel mosaic pattern of nuclear DNA comprising novel sequences that are related to primates interspersed with sequences that are closely homologous to humans.“  As proven in my previous blogs, this claim is false:  the sequences are from a bear (S26), a human (S31), and a dog (S140), with no significant traces of other primates.  I review the five different approaches to producing realistic phylotrees* of S26 and S140 that clearly show the samples are related to bears and dogs, respectively.   In contrast, Ketchum phylotrees for S31 (their Supp. Fig. 5) and S140 (their Supp. Fig. 6) showed homology to mice, chicken, and fish, not at all in support of their conclusion above.   Volume 4 of the Journal of Crypozoology is available from Amazon.com.
 
My second paper, “DNA as Evidence for the Existence of Relict Hominoids,” is a review of known publications related to DNA of purported cryptid hominoids, and can be downloaded free from:  https://www.isu.edu/media/libraries/rhi/research-papers/HART-DNA-Evidence.pdf
, the Relict Hominoid Inquiry website of Prof. Jeff Meldrum of Idaho State.  Meldrum’s specialty is anatomy, especially primate bipedal motion, and he has studied hundreds of purported sasquatch footprint casts in great detail, the great majority of which he sees as evidence for the existence of a large North American primate.

Finally, one of the criteria for publication in peer-reviewed journals is that the references are authentic and support the statements in the text to which they are appended.  Some journals require that the author certify this.  Padding of references to give the appearance of command of the literature is explicitly discouraged.  References 5 (Malinkovitch et al., 2004) and 6 (Coltman and Davis, 2006) of Ketchum et al. (2013), both with titles in apparent support of the Ketchum et al. (2013) claim quoted at the beginning of this blog, are anything but supportive, if one bothers to read them.  Both references are reviewed in my second paper.

The Milinkovitch et al. (2004) paper has an admitted April Fool’s joke title.  The Himalayan hair sample was found to be from a horse, nothing close to a primate.  Their phylotree shows this clearly.

Coltman and Davis (2006) matched the DNA of a Yukon hair to the American bison exactly, in spite of their tongue-in-cheek title.   Their phylotree of related ungulates supports their conclusion.

Both of these papers are good examples of how an unknown DNA sample should be analyzed without bias.  Read your references, Melba.  You might learn something. 


 *  A phylotree is a DNA-based evolutionary tree of life with a topology determined by degree of match (distance – or % of matching base pairs in homologous DNA sequences) between the various species in the branches.  A phylotree is produced from the results of a DNA search, for example using BLAST® as both Ketchum et al. and I did.


REFERENCES  (Unpadded)

Coltman, D. and Davis, C. (2006) “Molecular cryptozoology meets the Sasquatch.“ TRENDS in Ecology and Evolution 21(2): 60–61.

Hart, H. V. (2016a)  “Not Finding Bigfoot in DNA.”  Journal of Cryptozoology 4: 39-51.

Hart, H. V. (2016b)  “DNA as Evidence for the Existence of Relict Hominoids” Relict Hominoid Inquiry 5: 8-31.  
https://www.isu.edu/media/libraries/rhi/research-papers/HART-DNA-Evidence.pdf

Ketchum, M. S. et al. (2013). Novel North American Hominins: Next Generation Sequencing of Three Whole Genomes and Associated Studies.“ DeNovo,  1 (1):  Online only:  http://sasquatchgenomeproject.org/sasquatch_genome_project_002.htm


Milinkovitch, M. C. et al. (2004) “Molecular phylogenetic analyses indicate extensive morphological convergence between the ‘yeti’  and primates.” Molecular Phylogenetics and Evolution 31: 1–3.

Friday, June 17, 2016

More Inconsistencies and Evidence for Contamination in Ketchum et al. Supplementary Figures


ABSTRACT

Scientific results should be consistent, otherwise more experiments are needed to clarify discrepancies. The Ketchum et al.(2013) Supplementary Figures 1, 2, and 3, which are mitochondrial DNA phylotrees for samples 26, 31, and 140, respectively, are inconsistent with Supplementary Figures 7, 8, and 9, respectively. Haplogroups of the closest relatives do not agree: Supp. Fig. 1 with 7 (S26), 2 with 8 (S31), or 3 with 9 (S140). The Most likely explanation is contamination by at least one human in each case.

INTRODUCTION


Ketchum et al. (2013) mitochondrial DNA results have been previously reviewed (Paper 2 at right), and the Ketchum claim that they are all 100% human was found to be an overstatement. In fact, it was found that eight of 18 samples with complete mitochondrial sequences had too many mutations from the closest haplogroup to be statistically probable (less than 1% probability). Also, eight of 11 samples with HVR-1 (hypervariable region 1) only mutations listed were phylogenetically ambiguous, i.e., alternate haplogroups were equally likely. From these new results, it was concluded that either the samples were contaminated and/or degraded, or that any possible hybridization events would have to have been followed by subsequent mutations along nonhuman evolutionary lines and on a different time scale. The results in the current paper prove that S31 has human contamination by an individual with a different haplogroup than previously reported for that sample. Samples 26 and 140 have previously been shown to be from a black bear and a dog, respectively, from nuclear DNA matches (See Paper 1 at right). They are now shown here to be contaminated by two humans of different haplogroups.

METHODS

This study involves extracting data from six circular phylotrees, Supp. Figs. 1, 2, 3, 7, 8, and 9 from Ketchum et al. (2013) for comparisons. Phylotrees of this kind are generated from the query results (hits) in BLAST (TM) through the "Distance tree of results" option. The goal was to determine the haplogroup [1] of the nearest match to each query: S26, S31, or S140 in Supp. Figs. 1 and 7, 2 and 8, and 3 and 9, respectively. Phylotrees in Supp. Figs. 1, 2, and 3 were generated from complete mitochondrial sequences produced by Family Tree DNA. Phylotrees in Supp. Figs. 7, 8, and 9 [3] were generated from supercontigs, but the details of which mitochondrial genes were employed were not stated.

The title of the nearest phylotree branch tip was searched in GenBank (R)(the NCBI databases), and the accession number retrieved. A BLAST(TM)[3] alignment of this accession with rCRS (Revised Cambridge Reference Sequence, Genbank accession NC_012920.1) produced a set of rCRS-based mutations, as seen in the "Graphics" option of the results page. From these mutations a haplogroup was determined using the programs FASTmtDNA and mtDNAable as previously described (Paper 2 at right).

RESULTS

Sample 26

Table 1 presents the results for S26. The nearest haplogroup from Supp. Fig. 1 (H1)closely matches that determined by Family Tree DNA (H1a). However, The Supp. Fig. 7 result, T2b, is far removed. Interestingly, this is the haplogroup of the human contamination determined by two independent studies (Cassidy, 2013; Khan and White, 2012-the Tyler Huggins Report at right). Also to be noted is that S26 is one of the samples previously found to have too many extra mutations (16) to be called "modern human," according to the accepted mtDNA phylotree (van Oven, 2010) and a Poisson Distribution of mutations (Paper 2 at right). Given that the nuDNA of this sample matched a black bear (Paper 1 at right; Cassidy, 2013; Khan and White, 2012; Sykes et al., 2014 - See Tyler Huggins Report and Sykes Paper at right), it can be concluded that there are two sources of human contamination in this sample, with haplogroups H1a/H5e and T2b.



Table 1.  S26  
Nearest Matches to S26 mtDNA in Ketchum phylotrees
Supp.
Accession
Mis.vs.
Hap.
Fig.
S26
Homo sapiens clone 3760 mitochondrion,
1
JQ703795.1
16
H1
complete genome
Homo sapiens isolate NEC20 mitochondrion
7
JQ664540.1
22
T2b
complete genome
                  From Ketchum Supp. Data 2:
H1a
S26 from mtDNAable:
H5e



Columns left to right: Accession title, Ketchum et al. Supplementary Figure number,
Accession number (GenBank), Mismatches vs.S26, Haplogroup.
 







Sample 31

Table 2 presents the results for S31. The nearest haplogroup from Supp. Fig. 2 (L1a1) is close to that determined by Family Tree DNA (L0d2a). However, The Supp. Fig. 8 results, T2b and T2b8, are far removed. The nuDNA of this sample matches modern human (Paper 1 at right). Sample 31 is contaminated by another human of T2b haplogroup.



Table 2.  S31





Nearest Matches to S31 mtDNA in Ketchum phylotrees

Supp..

Accession

Mis. vs.

Hap.

Fig.

S31






Homo sapiens haplotype A10L1A2 mitochondrion

2

AY195777.1

2

L0d2a1

complete genome

(A10L1A2)*

Homo sapiens isolate 157 T2i Tor354 mitochondrion

8

JQ798131.1

100

T2 or T2b-16362C

complete genome

(T2i)*

Homo sapiens isolate 13T mitochondrion

8



complete genome

JX081995.1

104

T2b8








From Ketchum Supp. Data 2:

L0d2a

S31 from mtDNAable:

L0d2a1

*(Haplogroup) taken from Accession




Sample 140

Table 3 presents the results for S140. The nearest haplogroup from Supp. Fig. 3 (D4b2b1) matches that determined by Family Tree DNA for HVR-1 only(D). However, The Supp. Fig. 9 results, both R2'JT, are far removed. The nuDNA of this sample matches a dog (First paper at right). This sample is contaminated by two humans with haplogroups D and R2.

Table 3. S140


Nearest Matches to S140 mtDNA in phylotrees

Supp.

Accession

Mis vs

Hap.

Fig.

S140

Homo sapiens mitochondrial DNA complete genome

3

AP008361.1

No complete sequence available**

D4b2b1

isolate PDsq0023

Homo sapiens isolate R1 mitochondrion,

9

JX155264.1

R2'JT(R2a1)*

complete genome

Homo sapiens isolate R2 mitochondrion

9

JX155265.1

R2'JT(R2a1)*

complete genome

           From Ketchum Supp. Data 2:

D (HVR-1)

      From Behar, et al.(2012):

D (HVR-1)






*  (Haplogroup) taken from Accession

**  Oddly Supp. Figs, 3 and 9 require a full sequence, but Supp. Data 2 contains only HVR-1 mutations



CONCLUSION

Over all three samples, using supercontigs resulted in phylotrees with haplogroups which were inconsistent with full sequence derived haplogroups.

Samples 26 and 140 are contaminated by two modern humans. Sample 31 is contaminated by one additional human.

Insistence by Dr. Melba Ketchum, DVM, that her samples were not contaminated when analyzed is not warranted. Very likely some additional Ketchum et al. anomalous mtDNA samples are so because of contamination (Paper 2 at right).


NOTES


[1] Haplogroups are unique human mtDNA sequences, represented by their mutations from a standard, either rCRS (revised Cambridge Reference Sequence) or RSRS (Reconstructed Sapiens Reference Sequence). All known haplogroups of modern humans are represented in the phylotree of van Oven (2010) at www.phylotree.org. This tree stems from the root called "Mitochondrial Eve", the most recent common maternal ancestor (MRCA) of all humans. A haplotype is a particular allele (combination of SNPs-mutations) within a haplogroup and is designated by a preceding letter and number.

[2] Supp. Figs. 7, 8, and 9 are erroneously referred to in the Ketchum et al. (2013) text as Supp. Figs. 4, 5, and 6 in the last paragraph of the "Next Generation Whole Genome Sequencing" section. Supp. Figs. 4, 5, 6 are actually nuDNA-based phylotrees. See my blog "Melba Ketchum's Experts and Their Mistakes: What's in a Phylotree."

[3] BLAST (TM) is a search/match program which utilizes the National Center for Biotechnology Information (NCBI) GenBank databases. (Altschul et al.,1990; Madden, 2003). Its application has been described extensively on this blogsite. See under BLAST Search and Ketchum DNA Study Tabs above.


REFERENCES


Altschul, S. F.; Gish, W.; Webb, M.; Meyers, E. W.; Lipman, D. J. (1990). Basic local alignment search tool. Journal of Molecular Biology, 215 (no.3): 403-410.

Behar D.M.; van Oven, M.; Rosset, S.; Metspalu, M.; Loogväli, E.-L.; Silva, N. M.; Kivisild, T.; Torroni, A.; Villems, R. (2012) A “Copernican" reassessment of the human mitochondrial dna tree from its root. American Journal of Human Genetics, 90 (no.4): 675-684. http://dx.doi.org/10.1016/j.ajhg.2012.03.002

Cassidy, B. G. (2013). Technical Examination Report DNAS Case Number: 2012-006524. DNA Solutions, Inc. (Oklahoma City).  (See this blog at right)


Ketchum, M. S. et al. (2013). Novel north american hominins: next generation sequencing of three whole genomes and associated studies, DeNovo, 1:1. Online only: 
http://sasquatchgenomeproject.org/sasquatch_genome_project_002.htm

Khan, T. and White, B. (2012) Final report on the analysis of samples submitted by Tyler Huggins. Wildlife Forensic DNA Laboratory Case File 12-019, Trent University Oshawa (Peterborough, Ontario, Canada).  (See this blog at right.)


Madden, T. (2003). The BLAST sequence analysis tool. The NCBI Handbook; McEntyre, J; Ostell, J., Eds.; National Center for Biotechnology Information (Beth
esda, MD). http://www.ncbi.nlm.nih.gov/books/NBK21097/.

Sykes, B. C.; Rhettman A.; Mullis, R. A.; Hagenmuller, C.; Melton, T. W.; Sartori, M. (2014) Genetic analysis of hair samples attributed to yeti, bigfoot and other anomalous primates. Proceedings of the Royal Society B, 281: 20140161.

https://royalsocietypublishing.org/doi/full/10.1098/rspb.2014.0161

van Oven, M. (2010). Revision of the mtDNA tree and corresponding haplogroup nomenclature. Proceedings of the National Academy of Sciences USA, 107 (no. 11): E38-E39. http://dx.doi.org/10.1073/pnas.0915120107