Using DNA hashing as a way to identify individuals?
This is something tangential I've developed for my science-fantasy world with intelligent animals. For context: In this world, different taxonomic governments represent groups of related species. You have the Felines, Vulpines, Rodents, Avians, etc. Each of them technically belong to a different State but frequently intermingle and live in the same area, and taxonomic governments tend to also have territory/land associated with them where they primarily control the area, but other animals can and very much do still live there. Taxonomic governments have jurisdiction of the species within their scope no matter where they live, and are the ones responsible for having an ID system that works both within their own taxon and with other taxonomic governments and other official organizations.
Instead of making everyone carry ID cards or passports, which would be cumbersome for four-legged or winged animals to use, I envisioned a DNA-based ID system. The tech for this is definitely in the Star Trek levels of sci-fi, but it's basically a flat surface that you press your paw, wing, or other body part firmly onto, and a mechanism below produces a mild energy beam through your fur and skin which interacts with DNA in your cells and gives returns based on the specific sequence, and it's a safe, non-invasive DNA sequencer that can get a full read of your genetic code in seconds. The DNA scanner also checks for things like active metabolism and DNA synthesis and are generally configured to not even attempt to scan non-living cells, so you can't do something like use someone else's severed paw to make the system think you're them.
But since your full DNA sequences can be, for one, several gigabytes long and not conducive to things like printing onto certificates and migration papers or even just sending over the network to other agencies, and also contain actual information about things like your species, sex, family history and a bunch of sensitive stuff that you wouldn't want just anyone having access to, they typically take a cryptographic hash of the DNA and use that as an identifier for an individual animal. Kind of like how humans might have something like a social security number, animals in this world have a DNA Hash that governments use to identify them. Whenever a government agency in our world asks you to show some kind of ID like a driver's license, passport, health card, etc, they just have to scan their DNA and their information is automatically pulled from the right agency, using the hash to look it up. Even things like crossing international borders (of friendly nations) can be done with just a single biometric scan with no passport or ID card required. Basically, if you're animal in this world, the various government agencies around you refer to you as something like "8ed254569e8ddccea1784f569609aa32ced2691e2d22e99583ebd426cac76bd8" which is derived from your DNA sequence, and since you can't change your DNA, the same hash algorithm will always produce the same identifier, but better for privacy since it's impossible to reverse the algorithm and derive the original DNA sequence from the hash, and in theory only your own taxonomic government would have your full DNA sequence stored away on a server somewhere. Also extremely hard to falsify since it's literally identifying your body and not a card or anything that can be replaced.
What do you think? Does a system like this make sense? Are there glaring logistical or security issues that I'm not seeing? (Beyond just having a non-invasive and rapid DNA sequencing system in the first place, but that's what sci-fi handwaving is for.) Do you think a system like this is actually superior compared to physical ID media?
The issue that I see is that DNA isn't necessarily unique. For example, identical twins and species that were born asexually can have identical DNA. I don't really know any biology, so maybe there are other considerations too?
Also, hash collisions would happen randomly sometimes, so even if every individual had unique DNA, you couldn't use them as a unique identifier since there would sometimes be collisions.
Hope that helps.
Edit: I was wrong about hash collisions. Also, removed last paragraph because I was repeating myself.
Strong hashing algorithms have extremely rare collisions. There are many, many orders of magnitude more hash values than individuals. With a 512 bit hash space and 1.6×10^68 individuals, there is only a 1 in 10^18 chance of there being one or more collision according to this table: https://en.m.wikipedia.org/wiki/Birthday_problem#Probability_table
I would worry more about a random mutation in the sample yielding a completely different hash value for a person. Even one base pair mutating would yield an entirely different hash value. I suppose you could take multiple samples and pick the most common one.
Plus you have to worry about telomere shortening, but they could likely be ignore by the algorithm.
I think I was getting confused by how frequent collisions are in a hash-table. But those aren't actually hash collisions, they're hash-table entry collisions. So it makes sense that the likelihood of collisions is orders of magnitude smaller than I was thinking - the number of entries in any given hash-table is unfathomably tiny compared to the number of possible permutations that a reasonably-sized hash can have.
Good point. They would most likely need other factors to completely positively identify an individual animal, but DNA would probably get them 90% of the way there. For example they would also have a password set by each individual animal with the DNA acting as the "username," potentially combined with things like facial and fur-pattern recognition as well, and animals who are known to share the same DNA or have a hash collision would also be flagged in the database and return all possible results when looked up, which the authority looking up their information has to deal with by crosschecking with other information.