Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

General information

URL: https://www.ebi.ac.uk/patentdata/nr
Full name: Non-redundant Patent Sequences
Description: Non-redundant patent sequence databases providing access to full-text patent documents.
Year founded: 2010
Last update:
Version:
Accessibility:
Manual:
Accessible
Real time : Checking...
Country/Region: United Kingdom

Classification & Tag

Data type:
Data object:
NA
Database category:
Major species:
NA
Keywords:

Contact information

University/Institution: European Bioinformatics Institute
Address:
City: Cambridge
Province/State:
Country/Region: United Kingdom
Contact name (PI/Team): Rodrigo Lopez
Contact email (PI/Helpdesk): ls@ebi.ac.uk

Publications

23396323
The annotation-enriched non-redundant patent sequence databases. [PMID: 23396323]
Li W, Kondratowicz B, McWilliam H, Nauche S, Lopez R.

The EMBL-European Bioinformatics Institute (EMBL-EBI) offers public access to patent sequence data, providing a valuable service to the intellectual property and scientific communities. The non-redundant (NR) patent sequence databases comprise two-level nucleotide and protein sequence clusters (NRNL1, NRNL2, NRPL1 and NRPL2) based on sequence identity (level-1) and patent family (level-2). Annotation from the source entries in these databases is merged and enhanced with additional information from the patent literature and biological context. Corrections in patent publication numbers, kind-codes and patent equivalents significantly improve the data quality. Data are available through various user interfaces including web browser, downloads via FTP, SRS, Dbfetch and EBI-Search. Sequence similarity/homology searches against the databases are available using BLAST, FASTA and PSI-Search. In this article, we describe the data collection and annotation and also outline major changes and improvements introduced since 2009. Apart from data growth, these changes include additional annotation for singleton clusters, the identifier versioning for tracking entry change and the entry mappings between the two-level databases. Database URL: http://www.ebi.ac.uk/patentdata/nr/

Database (Oxford). 2013:2013() | 3 Citations (from Europe PMC, 2024-05-04)
19884134
Non-redundant patent sequence databases with value-added annotations at two levels. [PMID: 19884134]
Li W, McWilliam H, de la Torre AR, Grodowski A, Benediktovich I, Goujon M, Nauche S, Lopez R.

The European Bioinformatics Institute (EMBL-EBI) provides public access to patent data, including abstracts, chemical compounds and sequences. Sequences can appear multiple times due to the filing of the same invention with multiple patent offices, or the use of the same sequence by different inventors in different contexts. Information relating to the source invention may be incomplete, and biological information available in patent documents elsewhere may not be reflected in the annotation of the sequence. Search and analysis of these data have become increasingly challenging for both the scientific and intellectual-property communities. Here, we report a collection of non-redundant patent sequence databases, which cover the EMBL-Bank nucleotides patent class and the patent protein databases and contain value-added annotations from patent documents. The databases were created at two levels by the use of sequence MD5 checksums. Sequences within a level-1 cluster are 100% identical over their whole length. Level-2 clusters were defined by sub-grouping level-1 clusters based on patent family information. Value-added annotations, such as publication number corrections, earliest publication dates and feature collations, significantly enhance the quality of the data, allowing for better tracking and cross-referencing. The databases are available format: http://www.ebi.ac.uk/patentdata/nr/.

Nucleic Acids Res. 2010:38(Database issue) | 7 Citations (from Europe PMC, 2024-05-04)

Ranking

All databases:
4749/6000 (20.867%)
Metadata:
459/619 (26.01%)
4749
Total Rank
10
Citations
0.714
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2019-04-17
Curated by:
Lina Ma [2019-04-17]