Computational analysis of full-length mouse cDNAs compared with human genome sequences

Domon, Shūhei; Shinagawa, Akira; Saito, Tetsuya; Kiyosawa, Hidenori; Yamanaka, Itaru; Aizawa, Katsunori; Fukuda, Shiro; Hara, Ayako; Itoh, Masayoshi; Kawai, Jun; Shibata, Kazuhiro; Hayashizaki, Yoshihide

Computational analysis of full-length mouse cDNAs compared with human genome sequences

2001

Although the sequencing of the human genome is complete, identification of encoded genes and determination of their structures remain a major challenge. In this report, we introduce a method that effectively uses full-length mouse cDNAs to complement efforts in carrying out these difficult tasks. A total of 61,227 RIKEN mouse cDNAs (21,076 full-length and 40,151 EST sequences containing certain redundancies) were aligned with the draft human sequences. We found 35,141 non-redundant genomic regions that showed a significant alignment with the mouse cDNAs. We analyzed the structures and compositional properties of the regions detected by the full-length cDNAs, including cross-species comparisons, and noted a systematic bias of GENSCAN against exons of small size and/or low GC-content. Of the cDNAs locating the 35,141 genomic regions, 3,217 did not match any sequences of the known human genes or ESTs. Among those 3,217 cDNAs, 1,141 did not show any significant similarity to any protein sequence in the GenBank non-redundant protein database and thus are candidates for novel genes.

Показать больше [+]

Ключевые слова АГРОВОК

algorithms animals complementary dna computational biology databases databases dna dna exons exons genes genetics humans humans introns methods mice mice sequence analysis

Библиографическая информация

Опубликовано в

Mammalian genome

Том 12 Выпуск 9 Нумерация страниц 673 - 677 ISSN 0938-8990

Издатель

John Wiley & Sons, Ltd

Другие темы

Genome; Human; Factual; Nucleic acid; Sequence homology; Expressed sequence tags; Complementary; Molecular sequence data

Язык

Английский

Примечание

2019-12-05

Тип

Journal Article; Text

В АГРИСе с: 2024-02-28

Формат: MODS

Поставщик данных

Эту запись предоставил National Agricultural Library

Откройте коллекцию этого поставщика данных в AGRIS

Ссылки

DOI DOI http://dx.doi.org/10.1007/s00335-001-2048-4

Посмотрите в Google Scholar

If you notice any incorrect information relating to this record, please contact us at [email protected] [email protected]

ФАО АГРИС — международная информационная система по сельскохозяйственным наукам и технологиям

Share

Computational analysis of full-length mouse cDNAs compared with human genome sequences

2001

Ключевые слова АГРОВОК

Библиографическая информация