FREP

A database of functional repeats in mouse cDNAs

Takeshi Nagashima, Hideo Matsuda, Diego G. Silva, Nikolai Petrovsky, Takeya Kasukawa, Takahiro Arakawa, Piero Carninci, Jun Kawai, Yoshihide Hayashizaki, Akihiko Konagaya, Christian Schönbach

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

The FREP database (http://facts.gse.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16 527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of poly-adenylation sites in 3′UTR-located repeats, (iii) effect on translation, (iv) position in the protein-coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5%) inferred FREPs derived from 6861 (41.5%) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.

Original languageEnglish
JournalNucleic Acids Research
Volume32
Issue numberDATABASE ISS.
Publication statusPublished - Jan 1 2004
Externally publishedYes

Fingerprint

Complementary DNA
Databases
Exons
Sequence Alignment
Nucleic Acid Repetitive Sequences
Protein Biosynthesis
Transcriptome
Open Reading Frames
Theoretical Models
Proteins
Protein Domains

ASJC Scopus subject areas

  • Genetics

Cite this

Nagashima, T., Matsuda, H., Silva, D. G., Petrovsky, N., Kasukawa, T., Arakawa, T., ... Schönbach, C. (2004). FREP: A database of functional repeats in mouse cDNAs. Nucleic Acids Research, 32(DATABASE ISS.).

FREP : A database of functional repeats in mouse cDNAs. / Nagashima, Takeshi; Matsuda, Hideo; Silva, Diego G.; Petrovsky, Nikolai; Kasukawa, Takeya; Arakawa, Takahiro; Carninci, Piero; Kawai, Jun; Hayashizaki, Yoshihide; Konagaya, Akihiko; Schönbach, Christian.

In: Nucleic Acids Research, Vol. 32, No. DATABASE ISS., 01.01.2004.

Research output: Contribution to journalArticle

Nagashima, T, Matsuda, H, Silva, DG, Petrovsky, N, Kasukawa, T, Arakawa, T, Carninci, P, Kawai, J, Hayashizaki, Y, Konagaya, A & Schönbach, C 2004, 'FREP: A database of functional repeats in mouse cDNAs', Nucleic Acids Research, vol. 32, no. DATABASE ISS..
Nagashima T, Matsuda H, Silva DG, Petrovsky N, Kasukawa T, Arakawa T et al. FREP: A database of functional repeats in mouse cDNAs. Nucleic Acids Research. 2004 Jan 1;32(DATABASE ISS.).
Nagashima, Takeshi ; Matsuda, Hideo ; Silva, Diego G. ; Petrovsky, Nikolai ; Kasukawa, Takeya ; Arakawa, Takahiro ; Carninci, Piero ; Kawai, Jun ; Hayashizaki, Yoshihide ; Konagaya, Akihiko ; Schönbach, Christian. / FREP : A database of functional repeats in mouse cDNAs. In: Nucleic Acids Research. 2004 ; Vol. 32, No. DATABASE ISS.
@article{5596678072554d2882920cc63c41ebaf,
title = "FREP: A database of functional repeats in mouse cDNAs",
abstract = "The FREP database (http://facts.gse.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16 527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of poly-adenylation sites in 3′UTR-located repeats, (iii) effect on translation, (iv) position in the protein-coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5{\%}) inferred FREPs derived from 6861 (41.5{\%}) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.",
author = "Takeshi Nagashima and Hideo Matsuda and Silva, {Diego G.} and Nikolai Petrovsky and Takeya Kasukawa and Takahiro Arakawa and Piero Carninci and Jun Kawai and Yoshihide Hayashizaki and Akihiko Konagaya and Christian Sch{\"o}nbach",
year = "2004",
month = "1",
day = "1",
language = "English",
volume = "32",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "DATABASE ISS.",

}

TY - JOUR

T1 - FREP

T2 - A database of functional repeats in mouse cDNAs

AU - Nagashima, Takeshi

AU - Matsuda, Hideo

AU - Silva, Diego G.

AU - Petrovsky, Nikolai

AU - Kasukawa, Takeya

AU - Arakawa, Takahiro

AU - Carninci, Piero

AU - Kawai, Jun

AU - Hayashizaki, Yoshihide

AU - Konagaya, Akihiko

AU - Schönbach, Christian

PY - 2004/1/1

Y1 - 2004/1/1

N2 - The FREP database (http://facts.gse.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16 527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of poly-adenylation sites in 3′UTR-located repeats, (iii) effect on translation, (iv) position in the protein-coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5%) inferred FREPs derived from 6861 (41.5%) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.

AB - The FREP database (http://facts.gse.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16 527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of poly-adenylation sites in 3′UTR-located repeats, (iii) effect on translation, (iv) position in the protein-coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5%) inferred FREPs derived from 6861 (41.5%) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.

UR - http://www.scopus.com/inward/record.url?scp=9144274362&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=9144274362&partnerID=8YFLogxK

M3 - Article

VL - 32

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - DATABASE ISS.

ER -