User:TiagoLubiana/Modeling of RNAs
(Redirected from User:TiagoLubiana/Modelling of RNAs)
(as of October 2023)
The modelling of RNA species on Wikidata has been relatively neglected in the past initiatives.
- For Homo sapiens alone, over 24,000 entries on Wikidata include, at the same time, an non-coding RNA and the gene that encodes for it: https://w.wiki/7pNi
- Another 1,200 harbour mixes of gene and small nucleolar RNA: https://w.wiki/7pN$
- With all subclasses of RNA, the mix reaches 27,300 entries: https://w.wiki/7pP2
- Additionally there are over 4,400 entries for Homo sapiens that are catalogued as RNAs, but not genes: https://w.wiki/7pPB
- Most of them are microRNAs: https://w.wiki/7pPC, most with a miRBase pre-miRNA ID (P2870) value.
- There are >1,000 English Wikipedia pages about RNA species that are not genes: https://w.wiki/7pPK
- Some of them are duplicated on Wikidata, e.g. https://en.wikipedia.org/wiki/Mir-223 points to mir-223 (Q6871718) and MIR223 (Q18058135) points to https://en.wikipedia.org/wiki/MIR223. This is a clear case of bot-operated duplication.
- The non-coding RNA MALAT1 (Q18056223) is mapped to 2 UMLS entries, one for the RNA species (C3180393) and another for the gene (C1537647).
- Properties for RNAs (like Rfam ID (P3523) , RefSeq RNA ID (P639) and Ensembl transcript ID (P704)) are connected to genes.
- In contrast, UniProt protein ID (P352), UniProt protein ID (P352) and Ensembl protein ID (P705)) are connected to the proteins.