• 3 Gbps
  • 20k genes
  • 37.2 trillion cells[ref]
gunzip --keep GRCh38_latest_genomic.fna.gz
The key cladograms:
Video 1. When We Took Over the World by PBS Eons (2019) Source.
Video 1. When We Met Other Human Species by PBS Eons (2019) Source.
Video 1. How Humans Lost Their Fur by PBS Eons (2020) Source. Says it is linked to bipedalism to help hunting in hot weather. But could only happen fully after the invention of fire, otherwise you'd be too cold at night.
The point of these is that they are good for transfection apparently.
20k genes, 3 billion base pairs. We can handle this!!!
This is really cool. Ciro Santilli would be tempted to participate, but his wife is not a fan, in part due to the loss of privacy of children. Maybe she is right...
Someone should implement a version of that where you can upload your privately sequenced genome and get analytics for free.
This was the first large part of the genome that was sequenced, in 1981: Cambridge Reference Sequence. Presumably they picked it because it is short and does not undergo crossover.
About 16.6 kbp:
TODO: many places say "exactly" 16,569, it seems that variable number tandem repeat are either rare or don't occur!
By Fred Sanger's group.
As mentioned by Craig Venter in 100 Greatest Discoveries by the Discovery Channel (2004-2005), the main outcomes of the project were:
  • it established the ballpark number of human genes
  • showed that human genomes are very similar across individuals.
Important predecessors:
This was one of the first notable country-led large scale sequencing efforts of the world.
UniProt human: It is interesting to see in the Mutagenesis how many known mutations can increase or decrease SARS-CoV-2 S protein binding affinity.