RefSeq Release 209 is available

RefSeq release 209 is now available online, from the FTP site and through NCBI’s Entrez programming utilities, E-utilities. This full release incorporates genomic, transcript, and protein data available as of November 1, 2021, and contains 296,293,486 records, including 215,655,378 proteins, 41,751,205 RNAs, and sequences from 114,396 organisms. The release is provided in several directories as a complete … Continue reading RefSeq Release 209 is available

NCBI will assign 64-bit numeric GIs by November 15th. Update affected software!

As announced  last month, NCBI will begin assigning larger (64-bit) numeric ‘GIs’ to the remaining sequence types that still receive these identifiers. This change is expected as soon as Nov. 15th, 2021 but could occur earlier if data submission volumes are unexpectedly high. This is a reminder that all organizations and developers using our products should review software for any remaining … Continue reading NCBI will assign 64-bit numeric GIs by November 15th. Update affected software!

NCBI’s GI sequence identifiers will soon exceed 32-bit numbers. Are you and your software ready?

In 2016, NCBI announced that it was curtailing its display of its numeric ‘GI’ in popular sequence data formats such as FASTA and GenBank flatfiles. Due to the continued growth of GenBank, NCBI will soon begin assigning GIs exceeding the signed 32-bit threshold of 2,147,483,647 for those remaining sequence types that still receive these identifiers. The exact date … Continue reading NCBI’s GI sequence identifiers will soon exceed 32-bit numbers. Are you and your software ready?