Significant Updates Coming to the NCBI Datasets APIs and Command-Line Tools

As part of our ongoing effort to enhance your experience, we are updating the NCBI Datasets application programming interfaces (APIs). Beginning in June 2024, the v2alpha APIs will be promoted to the stable v2 version. At this time, the v1 API, the command-line interface (CLI) version 13 and older versions, and the Python library v1 … Continue reading Significant Updates Coming to the NCBI Datasets APIs and Command-Line Tools

NCBI Celebrates International Love Data Week

Providing free and open access to scientific literature and data  Do you love biological data? Join NCBI in celebrating International Love Data Week, February 12-16, 2024! Love Data Week is an international celebration of data, which takes place every year during the week of Valentine’s Day. This year’s theme is “My Kind of Data,” focusing on … Continue reading NCBI Celebrates International Love Data Week

New RefSeq Annotations Now Available!

During October to January, the NCBI Eukaryotic Genome Annotation Pipeline released seventy new annotations in RefSeq! New Annotations Alnus glutinosa (eudicot) Amyelois transitella (moth) Anolis sagrei ordinatus (Brown anole) Apis cerana (Asiatic honeybee) Balaenoptera ricei (Rice’s whale) Bombus pascuorum (bee) Bos javanicus (banteng) Bos taurus (cattle)  Cannabis sativa (eudicot) Colius striatus (speckled mousebird) Cololabis saira (Pacific saury) … Continue reading New RefSeq Annotations Now Available!

Enhancements to ClinVar Website Now Live

As previously announced, we updated the ClinVar website as part of our effort to better support the display of submitted somatic variation data.  What changed?  Updated ClinVar search results Germline / Somatic origin filter Somatic Classification and Review Status summary Updated Variant (VCV)  record page Includes both Germline and Somatic data for the same variant … Continue reading Enhancements to ClinVar Website Now Live

HomoloGene Now Redirects to NCBI Datasets Gene

A new way to view and download related genes  As previously announced, HomoloGene now automatically redirects to the NCBI Datasets Gene page giving you easy access to up-to-date sequence and homology data. The NCBI Datasets Gene Table provides a link to NCBI Orthologs with expanded gene and protein information and links to tools. NCBI Orthologs … Continue reading HomoloGene Now Redirects to NCBI Datasets Gene

BLAST FASTA Files Will No Longer Be Available on the FTP Site Effective April 2024

Easily generate BLAST FASTA files yourself!  In April 2024, the FASTA (sequence text) files of the sequences in the Basic Alignment Search Tool (BLAST) databases will no longer be available on the FTP site. However, you can easily generate FASTA files yourself from the formatted BLAST databases by using the BLAST utility blastdbcmd that comes … Continue reading BLAST FASTA Files Will No Longer Be Available on the FTP Site Effective April 2024

Updated Bacterial and Archaeal Reference Genome Collection is Available!

Download the updated bacterial and archaeal reference genome collection! This collection of 18,943 genomes was built by selecting the “best” genome assembly for each species among the 330,000+ prokaryotic genomes in RefSeq (except for E. coli for which two assemblies were selected as reference). You can speed up your sequence searches by running them against … Continue reading Updated Bacterial and Archaeal Reference Genome Collection is Available!

RefSeq Release 222 Now Available!

Check out RefSeq release 222, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets. What’s included in this release? As of January 8, 2024, this full release incorporates genomic, transcript, and protein data containing: 411,137,832 records 304,562,770 proteins 59,343,570 RNAs sequences from 145,371 organisms  The release is provided in several directories as … Continue reading RefSeq Release 222 Now Available!

Now Available: NCBI Hidden Markov Models (HMM) Release 14.0!

Download release 14.0 of the NCBI protein profile Hidden Markov models (HMMs) used by the Prokaryotic Genome Annotation Pipeline (PGAP)! Search this collection against your favorite prokaryotic proteins to identify their function using the HMMER sequence analysis package. What’s new? Release 14.0 contains: 16,383 HMMs maintained by NCBI 252 new HMMs since release 13.0 Several … Continue reading Now Available: NCBI Hidden Markov Models (HMM) Release 14.0!

Best of 2023: A Look at the NCBI Insights Blog

As we begin a new year, let’s look back at the best NCBI Insights Blog posts of 2023.   In case you missed any of these, check them out!  Learn more  The NCBI Insights Blog is intended to help you better understand NCBI and our resources, explore issues of scientific interest that drive our resource development, … Continue reading Best of 2023: A Look at the NCBI Insights Blog