Automated Downloads of PDB Data
From PDBWiki
The following text is given in the RCSB PDB Newsletter Number 36. It is included here for convenience only.
See;
[edit] Downloads of PDB Data from ftp://ftp.wwpdb.org
As previously announced, the PDB archive has been moved to ftp://ftp.wwpdb.org.
Updated weekly, this location maintains the files from the wwPDB Remediation Project and all newly released files. The archive currently contains approximately 350,000 files, including coordinate data in PDB, mmCIF, and PDBML/XML formats, and experimental data. Since the entire archive requires more than 70 GBbytes of storage, fresh downloads require a substantial amount of time. In December 2007, more than 27 million files were downloaded from ftp://ftp.wwpdb.org.
During the same period, approximately 2.4 million files were downloaded from the snapshot of unremediated data at ftp.rcsb.org. Users should be aware that this site is no longer updated, and are strongly encouraged to update any automatic scripts or bookmarks to ftp://ftp.wwpdb.org.
Data files from the archive can be accessed online in a variety of ways, including:
- The RCSB PDB website offers a tool to download multiple data files at www.rcsb.org/pdb/download/download.do
- URLs for automatic downloads are described at www.rcsb.org/pdb/static.do?p=home/faq.html
- Data files are available for download from each entry's Structure Summary page.
At ftp://ftp.wwpdb.org/pub/pdb/README, users will find download information for downloading:
- A single file via ftp
- The entire archive via ftp
- The entire archive via rsync
- All files in a given format (PDB, CIF, XML) via rsync
- All files in a given format (PDB, CIF, XML) via ftp using tar balls
