We want to hear from you about changes to NIH’s Sequence Read Archive data format and storage

NIH’s Sequence Read Archive (SRA) is the largest, most diverse collection of next generation sequencing data from human, non-human and microbial sources. Hosted by the National Center for Biotechnology Information (NCBI) at the National Library of Medicine (NLM), SRA data … Continue reading

Improved access to SARS-CoV-2 data

NCBI Datasets has a simple, new way to get Coronoviridae data, including from SARS-CoV-2 (Figure 1). The data package includes genomic, protein and CDS sequences, annotation and a comprehensive data report for all complete genomes. You can also target your search … Continue reading

New GenBank submission options for SARS-CoV-2 submitters

NCBI is pleased to announce ongoing enhancements to submission of SARS-CoV-2 assembled genomes to GenBank, including a streamlined workflow on the web and a new API option. Both new options mean that you can receive accessions for SARS-CoV-2 data submissions … Continue reading

New viral protein domain models for annotation of coronaviruses

NLM’s Conserved Domain Database (CDD) has expanded its scope to now include 153 new viral protein domain family models for the annotation of coronaviruses, including models such as for the S1 subunit of coronavirus Spike proteins (cd21527), the nucleocapsid (N) protein of coronavirus (cd21595), and the coronavirus … Continue reading

The New and Improved PubMed® — We Are Listening

Today marks 5 weeks since the new PubMed was made the default version. Throughout this process, we promised to listen, and we heard from you! This was a huge change We know change isn’t always easy, especially with major changes to a … Continue reading

The New and Improved PubMed® — We Are Listening

Today marks 5 weeks since the new PubMed was made the default version. Throughout this process, we promised to listen, and we heard from you! This was a huge change We know change isn’t always easy, especially with major changes to a … Continue reading

New BLAST default parameters and search limits coming in September

To provide a more efficient BLAST experience for everyone, we’re changing some parameters and limits on the web BLAST service on September 8, 2020. The new settings, listed below, will improve overall performance and make search times more consistent. The … Continue reading

dbSNP human build 154 release + ALFA data

dbSNP human build 154, now available, includes new ALFA (Allele Frequency Aggregator) variants and allele frequency. This build contains over two billion Submitted SNP (ss) records and 730 million Reference SNP (rs) records. New features include: dbSNP search now support HGVS … Continue reading

New annotations in RefSeq: budgerigar, bony fish, fly and more

In May, the NCBI Eukaryotic Genome Annotation Pipeline released new annotations in RefSeq for the following organisms: Acipenser ruthenus (sterlet) Arvicanthis niloticus (African grass rat) Cannabis sativa (eudicot) Crassostrea gigas (Pacific oyster) Cyclopterus lumpus (lumpfish) Drosophila albomicans (fly) Drosophila guanche (fly) Drosophila innubila (fly) … Continue reading

Enhanced prokaryote type strain report now with details on needed type strain data

The Prokaryote type strain report provides information on type-strains for over 18,000 species. We revised and expanded the report to make it easier to identify cases where sequencing or establishing type material would have the biggest impact on improving prokaryote taxonomy … Continue reading