BLAST FASTA Files Will No Longer Be Available on the FTP Site Effective April 2024

Easily generate BLAST FASTA files yourself!  In April 2024, the FASTA (sequence text) files of the sequences in the Basic Alignment Search Tool (BLAST) databases will no longer be available on the FTP site. However, you can easily generate FASTA files yourself from the formatted BLAST databases by using the BLAST utility blastdbcmd that comes … Continue reading BLAST FASTA Files Will No Longer Be Available on the FTP Site Effective April 2024

Updated Bacterial and Archaeal Reference Genome Collection is Available!

Download the updated bacterial and archaeal reference genome collection! This collection of 18,943 genomes was built by selecting the “best” genome assembly for each species among the 330,000+ prokaryotic genomes in RefSeq (except for E. coli for which two assemblies were selected as reference). You can speed up your sequence searches by running them against … Continue reading Updated Bacterial and Archaeal Reference Genome Collection is Available!

Using NCBI Data and Tools for Your Research Project

Are you a biology student working on a research project? NCBI offers free access to a wide variety of resources and tools to help you find and download data for your project.   How and why do you use our resources? Check out the example below: Your professor has assigned you a research project looking at … Continue reading Using NCBI Data and Tools for Your Research Project

Now Available! Updated Bacterial and Archaeal Reference Genomes Collection

An updated bacterial and archaeal reference genome collection is available! This collection of 18,343 genomes was built by selecting exactly one genome assembly for each species among the 312,000+ prokaryotic genomes in RefSeq, except for E. coli for which two assemblies were selected as reference. The criteria for selecting the reference assembly for a given species include assembly contiguity … Continue reading Now Available! Updated Bacterial and Archaeal Reference Genomes Collection

Important Update! Changes to ASSEMBLY_REPORTS and GENOME_REPORTS on FTP

Do you currently access genome assembly data through the FTP site? We are consolidating information provided in the ASSEMBLY_REPORTS and GENOME_REPORTS directories on the genomes FTP site to simplify access and ensure that you have the most accurate, up to date, and consistently reported data.   The assembly_summary files in the ASSEMBLY_REPORTS directory are gaining information … Continue reading Important Update! Changes to ASSEMBLY_REPORTS and GENOME_REPORTS on FTP

Join NCBI at ASM Microbe 2023

Houston, TX, June 15-19, 2023 NCBI is looking forward to seeing you in person at the American Society for Microbiology Annual Meeting (ASM Microbe 2023). NCBI staff will participate in a variety of activities and events and will also be available at our booth (#2410) to address your questions. We’re especially excited to share our … Continue reading Join NCBI at ASM Microbe 2023

RefSeq Release 218

RefSeq release 218 is now available online and from the FTP site. You can access RefSeq data through NCBI Datasets. What’s included in this release? As of May 1, 2023, this full release incorporates genomic, transcript, and protein data containing: 356,619,635 records 260,776,371 proteins 52,503,423 RNAs sequences from 133,740 organisms The release is provided in several directories as a … Continue reading RefSeq Release 218

New Release! Updated Bacterial and Archaeal Reference Genomes Collection Now Available

As previously announced, we are continuously curating a better Prokaryotic Reference Genomes Collection. An updated bacterial and archaeal reference genome collection is now available! This collection of 17,623 genomes was built by selecting exactly one genome assembly for each species among the 283,000+ prokaryotic genomes in RefSeq, except for E. coli for which two assemblies were selected as reference.  What’s … Continue reading New Release! Updated Bacterial and Archaeal Reference Genomes Collection Now Available

RefSeq Release 217

RefSeq release 217 is now available online and from the FTP site. You can access RefSeq data through NCBI Datasets. What’s included in this release? As of March 8, 2023, this full release incorporates genomic, transcript, and protein data, containing: 348,351,219 records 254,500,694 proteins 50,975,429 RNAs sequences from 130,837 organisms The release is provided in … Continue reading RefSeq Release 217

New & Improved NCBI Datasets Genome and Assembly Pages

Legacy pages will be redirected effective June 2023 In June 2023, NCBI’s Assembly and Genome record pages will be redirected to new Datasets pages as part of our ongoing effort to modernize and improve your user experience. NCBI Datasets is a new resource that makes it easier to find and download genome data.   We will … Continue reading New & Improved NCBI Datasets Genome and Assembly Pages