Improved Future-Plans.md.

This commit is contained in:
Simon Sarasova 2024-05-02 07:48:30 +00:00
parent 89b6a2d3d4
commit 6fd4a15125
No known key found for this signature in database
GPG key ID: EEDA4103C9C36944
5 changed files with 9 additions and 8 deletions

View file

@ -6,6 +6,7 @@ Small and insignificant changes may not be included in this log.
## Unversioned Changes ## Unversioned Changes
* Improved Future-Plans.md. - *Simon Sarasova*
* Improved Documentation.md. - *Simon Sarasova* * Improved Documentation.md. - *Simon Sarasova*
* Upgraded all uses of math/rand to math/rand/v2. - *Simon Sarasova* * Upgraded all uses of math/rand to math/rand/v2. - *Simon Sarasova*
* Improved Whitepaper.md. Race and ancestry are now seperate concepts. - *Simon Sarasova* * Improved Whitepaper.md. Race and ancestry are now seperate concepts. - *Simon Sarasova*

View file

@ -9,4 +9,4 @@ Many other people have written code for modules which are imported by Seekia. Th
Name | Date Of First Commit | Number Of Commits Name | Date Of First Commit | Number Of Commits
--- | --- | --- --- | --- | ---
Simon Sarasova | June 13, 2023 | 239 Simon Sarasova | June 13, 2023 | 240

View file

@ -101,7 +101,7 @@ Seekia is not fully operational.
Hosts and clients will not connect to the internet, and you will not be able to download profiles or chat with users. Hosts and clients will not connect to the internet, and you will not be able to download profiles or chat with users.
There are many TODOs throughout the code, and the `/documentation/Future Plans.md` document describes many features that need to be built. There are many TODOs throughout the code, and the `/documentation/Future-Plans.md` document describes many features that need to be built.
## Simulating Use ## Simulating Use

View file

@ -315,11 +315,11 @@ The current method for predicting polygenic disease risks and traits is not as i
Our current model adds and subtracts the likelihood values of various SNPs that are reported to have an effect on polygenic diseases and traits. Our current model adds and subtracts the likelihood values of various SNPs that are reported to have an effect on polygenic diseases and traits.
A much better is to train a neural net to predict traits and polygenic diseases on a large number of genes. There are methods that exist to find the list of genes that have an effect on each trait/disease. For example, height is said to be effected by ~10,000 SNPs. These are the genes to feed into the neural net for each trait/disease. These are also the genes that users will share in their profiles. See `createGeneticAnalysis.go` for information on how offspring predictions would work. A much better method is to train a neural net to predict traits and polygenic diseases on a large number of genes. There are methods that exist to find the set of genes that have an effect on each trait/disease. For example, height is said to be effected by ~10,000 SNPs. Many GWAS studies exist which report which genes are responsible for certain traits and diseases. These are the genes to feed into the neural net for each trait/disease. These are also the genes that users will share in their profiles. I have already started to try to build this system. See `geneticPrediction.go` for an implementation of trait prediction using neural networks, and `createGeneticAnalysis.go` for information on how offspring predictions would work.
This method requires training data, which is largely unavailable for public use. We need fully open training data, not data that requires registration or permission to download. This method requires training data, which is largely unavailable for public use. We need fully open training data, not data that requires registration or permission to download.
OpenSNP.org is a free genomic data repository. OpenSNP relies on user submitted data, which can be falsified. OpenSNP should add a verification system so data provided by trustworthy people can be prioritized. [OpenSNP.org](https://opensnp.org) is a free genomic data repository. OpenSNP relies on user submitted data, which can be falsified. OpenSNP should add a verification system so data provided by trustworthy people can be prioritized.
More people should create public domain genome banks. If they had multiple locations surveying and sequencing people every day, they could sequence tens of thousands of people a year. This data would quickly be sufficient to train the neural nets to predict attributes with some accuracy. Each participant would have to sign an agreement to release their response and genome into the public domain. More people should create public domain genome banks. If they had multiple locations surveying and sequencing people every day, they could sequence tens of thousands of people a year. This data would quickly be sufficient to train the neural nets to predict attributes with some accuracy. Each participant would have to sign an agreement to release their response and genome into the public domain.
@ -439,13 +439,13 @@ These servers should be run by trusted entities, and their domain names and sign
Users should be able to send files through calls. This way, users can send their genomes through a secure channel without having to meet in person. Users should be able to send files through calls. This way, users can send their genomes through a secure channel without having to meet in person.
### Offspring Appearance Prediction ### Offspring Race Prediction
Users of Seekia should be able view a predicted appearance image of their offspring with each user. This is useful when deciding who to mate with. Users of Seekia should be able to understand what their offspring with each potential mate is likely to look like. This is useful when deciding who to mate with.
Seekia should be able to download images of people who are racially similar to each user pair's calculated offspring using the same racial similarity calculation method already present in Seekia. This calculation includes comparing genes which influence traits such as skin color, eye color, hair color, and facial structure. The calculation also incorporates ancestral similarity. Seekia should generate many prospective offspring images by using both user's photos, ancestry, and trait information. A service called BabyAC exists which creates a prospective baby image from images of both parents (see [Baby-AC.com](https://baby-ac.com/en)). Seekia should predict an adult appearance.
Seekia could also generate a prospective offspring image by using both user's photos, ancestry, and trait information. A service called BabyAC exists which creates a prospective baby image from images of both parents (see [Baby-AC.com](https://baby-ac.com/en)). Seekia should also be able to download images of people who are racially similar to each user pair's calculated offspring using the same racial similarity calculation method already present in Seekia. This calculation includes comparing genes which influence traits such as skin color, eye color, hair color, and facial structure. The calculation also incorporates ancestral similarity.
To create this feature, we need many people to upload their genomes, ancestry composition results, and images of themselves to a database. Each participant must digitally consent to releasing this information into the public domain. The ancestral analyses can be provided by various companies and eventually Seekia. To create this feature, we need many people to upload their genomes, ancestry composition results, and images of themselves to a database. Each participant must digitally consent to releasing this information into the public domain. The ancestral analyses can be provided by various companies and eventually Seekia.