---------------------------------------------------------------------- Protein Data Bank Quarterly Newsletter Number 67 January 1994 ---------------------------------------------------------------------- ---------------------------------------------------------------------- January 1994 PDB Release ---------------------------------------------------------------------- 2327 full-release atomic coordinate entries (605 new additions) 2143 proteins, enzymes and viruses 156 DNA's 9 RNA's 9 tRNA's 10 carbohydrates 353 structure factor entries 31 NMR experimental entries The total size of the atomic coordinate entry database is 678 Mbytes uncompressed. ---------------------------------------------------------------------- New Head for PDB ---------------------------------------------------------------------- In the past two years, the number of entries in the Protein Data Bank has tripled. The electronic network has become the major mode of access, allowing entirely new possibilities. To head PDB in this time of major new challenges and existing opportu- nities, Brookhaven National Laboratory (BNL) has appointed Joel L. Sussman, a protein crystallographer and professor at the Weizmann Institute of Science in Israel. Dr. Sussman's background is in structural and computational molecular biology, and his research focuses on structure and function of proteins related to the nervous system, such as ace- tylcholinesterase. He holds a joint appointment in Brookhaven's Chemistry and Biology Departments. This connection will aid him both in his continuing research and in leading the develop- ment of PDB, an organization which is becoming increasingly important not only for structural biology but also for much of bio- medical science. During the coming year, Dr. Sussman will divide his time between BNL and the Weizmann Institute. Joel's involvement with PDB began long before his recent appointment. Joel, Dr. Helen Berman, Dr. Walter Hamilton and others assisted in PDB's birth at a 1971 symposium at Cold Spring Harbor, where the scientific community first formally dis- cussed the need and mechanism for ensuring accessibility of the known protein structures. This discussion led to the estab- lishment of PDB at BNL under Walter's leadership and, follow- ing Walter's death, under Tom Koetzle. Tom recalls that over the years Joel has been both a depositor to, and a user of, the Data Bank, having contributed entries of nucleic acids including those of tRNA and several DNA's as well as protein structures for concanavalin A, avidin, acetylcholinesterase and their com- plexes. In addition, Joel was one of the early developers of methodology and programs on biological macromolecule refine- ment. For the past year, he has served on the PDB Advisory Board, representing the European Science Foundation Network for the Crystallography of Biological Macromolecules. As head, Joel would like to elevate PDB beyond its initial goal of an information repository. As well as making the data even more accessible and speeding the deposition process for both the authors and processors, he would like PDB to evolve into an actual database, where users can ask questions across the full collection of structures instead of just about individual ones. He envisions PDB as a true international collaboration, receiv- ing not only data but also a free flow of ideas from its growing community of users. The formation of a PDB User Group headed by Dr. Jane Richardson will help to further this goal (see article on page 3 pertaining to User Group). By facilitating the study of how nucleic acid sequences relate to the 3-D structures of their final protein products, Joel visualizes PDB locating itself at the crossroads between structural biology and the gene-sequencing work of the Human Genome Project. ---------------------------------------------------------------------- Other Management Changes ---------------------------------------------------------------------- David Stampf has been appointed Senior Project Manager and will assume responsibility for daily operations as well as continu- ing to head the software development group. Dr. Enrique Abola will continue to manage the scientific activities related to data- base production and will act as Science Coordinator for PDB. ---------------------------------------------------------------------- January PDB Release Includes 605 New Full-Release Entries ---------------------------------------------------------------------- An important milestone was achieved in January 1994 when PDB completed the task of converting to full-release form all entries previously distributed as prerelease. In July 1993 the PDB stopped preparing entries in prerelease form and a con- certed effort to convert all prerelease entries to full release was begun. Since July 1993 PDB staff processed about 1500 entries and has since strived to remain current by releasing newly deposited entries in full-release form as soon as possible after deposition. The January 1994 release contains 605 addi- tional coordinate entries, bringing the total being distributed to 2327. This milestone marks the completion of the effort by PDB to address issues that arose in the late 1980's when a substantial fraction of the data deposited was awaiting processing and validation and was therefore unavailable to the user community. The figure below and to the left provides a historical view (from 1987) of the growth of the Data Bank. It also illustrates the strategy followed to bring PDB up-to-date. During 1992, the number of entries available to the community was significantly raised by the prerelease. There was also a marked decrease in the number of entries being processed or awaiting approval. Then in 1993 the number of full-release entries available rose very rapidly as prerelease was phased out. The turnaround time for entry release was improved from an average of nine months to the current three- to four-month time period. Although PDB returns entries to depositors for approval an average of thirty days after complete deposition, it may take another ninety days before PDB receives release approval and data is loaded onto the FTP (file transfer program) server. One of PDB's primary goals in 1994 is the reduction of this turnaround time. Of equal importance is the installation of new procedures pro- viding enhanced automation in data processing and signifi- cantly improving the quality of the data distributed by PDB. These procedures were needed to check and validate the large number of new data entries that were included in the October 1993 and January 1994 releases of the PDB. The number of entries deposited with PDB continues to grow at an exponential rate. In 1993 a total of 940 entries were depos- ited. The number of depositions is projected to rise to 1200 in calendar year 1994. ---------------------------------------------------------------------- Newsletter Availability ---------------------------------------------------------------------- The PDB Newsletter is available from FTP, Gopher and List- server archives as soon as it is prepared. Allowing for printing and mailing time, we expect this to be about four weeks sooner than a printed copy would reach you by mail. Availability of each new issue is announced on the Listserver (see article on page 6 pertaining to Listserver). If you are satisfied receiving the PostScript or ASCII version of the Newsletter and no longer wish to receive a printed copy, or if you know of copies that are being discarded because of obso- lete addresses, please let us know electronically by sending a message to pdb@bnl.gov or by mail to our address on page 27. Printing and mailing fewer copies saves both time and trees. ---------------------------------------------------------------------- Revised Newsletter Format Considered ---------------------------------------------------------------------- PDB is considering eliminating the tables of newly released and newly deposited entries in future Newsletters. All available and pending PDB entries, with newly released and newly deposited entries flagged, are listed in the Full Tables document. This document accompanies each order sent and is available on FTP in the `pub' subdirectory in both PostScript and ASCII formats. A printed copy may be obtained upon request. We would appreciate feedback from you about this possible change. Please send your response via e-mail to pdb@bnl.gov or by mail to our address on page 27. ---------------------------------------------------------------------- PDB User Group ---------------------------------------------------------------------- A User Group for the Protein Data Bank is currently being organized under the leadership of Jane S. Richardson of Duke University. A Coordinating Committee to represent the diverse spectrum of users will soon be named. The User Group aims to facilitate communication in both directions between PDB and all categories of its users, improving knowledge of what is already available or in progress, diagnosing problems quickly, and arriving collectively at the best ideas and innovations for the future. The User Group also intends to collect and make available various subsets and annotations of PDB entries for such purposes as teaching and structural analysis. Your input is solicited. Please let us know what type of user you are and what you consider priorities for the future. Please respond by e-mail to PDBusrgp@suna.biochem.duke.edu or by regular mail to Jane Richardson, PDB User Group, Box 3711 DUMC, Durham, NC 27710 USA. ---------------------------------------------------------------------- New Distribution Directory Structure ---------------------------------------------------------------------- As discussed on the PDB Listserver, the directory layout for entries has been changed. Instead of using tape01, tape02, ..., tapeNN, we are now grouping entries by the middle two characters of the ident code. So, entry file pdb1abc.ent will be found in */compressed_files/ab and */uncompressed_files/ ab. This change was mandated due to difficulty users were having finding entries using the old tapeNN layout method. Now, once an ident code is known it is simple to find an entry. If an ident code is not known, the index files provide a quick means of finding it. ---------------------------------------------------------------------- Full Entry Datestamps ---------------------------------------------------------------------- All entry files will now be datestamped to show when they were released. This datestamp will not change in subsequent releases unless data in a file changes. Entries will continue to have a REVDAT 0 record as well. All entries released through the Jan94 full release will be given a January 31, 1994 datestamp. ---------------------------------------------------------------------- Access to PDB using FTP ---------------------------------------------------------------------- PDB has an anonymous FTP account on the computer system pdb.pdb.bnl.gov (Internet address 130.199.144.1). Files may be transferred to and from this system using anonymous as the FTP user name and your e-mail address as the password. Be- sides downloading entries, data files and documentation, it is possible to upload any files that you may wish to send to PDB. Please note that those using VMS may need to place quotes around file names. ---------------------------------------------------------------------- FTP Access Help ---------------------------------------------------------------------- Useful FTP commands: ------------------- ascii - Set file transfer type to network ASCII. binary - Set file transfer type to support binary image transfer. bye - Terminate FTP session with remote server and exit FTP. cd remote-directory - Change working directory on remote machine to remote-directory. cdup - Change remote machine working directory to parent of current remote machine working directory. dir [ remote-directory ] [ local-file ] - Print listing of directory contents in directory, remote- directory, and, optionally, placing output in local-file. get remote-file [ local-file ] - Retrieve remote-file and store it on local machine. help [ command ] - Print informative message about meaning of command. ls [ remote-directory ] [ local-file ] - Print listing of contents of directory on remote machine. put local-file [ remote-file ] - Store local file on remote machine. pwd - Print name of current working directory on re- mote machine. quit - Synonym for bye. Useful directory and file descriptions: -------------------------------------- directory: all_entries - Contains up-to-date fullrelease entries. - All entries together in single directory (not divided by 2-character code). directory: crystallographer_info - Informational files of interest to crystallogra- phers. directory: current_release - Contains up-to-date full-release entries. - Made up of last quarterly full-release entries and updated and additional full-release entries. - Always current full release (last quarterly plus updates). - Divided into 2-character directories. directory: fullrelease - Contains last quarterly full-release entries. - Divided into 2-character directories. directory: index - Index files that cross-reference ident codes to various parameters. directory: new_uploads - Uploads to PDB accepted here. directory: newly_released - Contains all updated and additional fullrelease entries since last quarterly fullrelease entries. - Divided into 2-character directories. directory: newsletter - PDB Newsletters and Full Tables. directory: nmr_restraints - NMR restraint files. directory: pub - Various useful items. directory: structure_factors - Contains last quarterly full-release structure factor files. file: README - FTP login message and README file. file: advisory.doc - PDB Advisory Notice. This notice should be signed and returned if you intend to download files. file: contents.lis (same as ls-lR) - Listing of files and directories on FTP. file: datestamp.txt - Description of datestamping method used for entry files. file: expert.txt - Quick instructions for FTP experts. file: ftp.hlp - Help on using FTP. file: how2dnld.txt - Information on how to download file. file: how2find.txt - Instructions on how to find a file (entry). file: how2upld.txt - Information on how to upload file. ---------------------------------------------------------------------- How to Download Files from PDB using FTP ---------------------------------------------------------------------- Once you are logged into PDB using FTP, you can download files to your computer using the "get" command. This example shows how you can download a file named oct93newsletter.txt to your computer. ****************************** ****************************** ftp to pdb.pdb.bnl.gov ****************************** ****************************** do not use rsh, rlogin or ****************************** telnet -- they won't work ****************************** prompt> ftp -n pdb.pdb.bnl.gov Connected to pdb.pdb.bnl.gov. 220- PDB Network Login to 220 pdb.pdb.bnl.gov FTP server ready. Remote system type is UNIX. Using binary mode to transfer files. ftp> user (username) anonymous 331 Guest login ok, type your name as password. Password: ******** 230- ...................................................................... Welcome to the PDB ANONYMOUS FTP account. Unauthorized access to this computer system is prohibited. Please limit the number of files you download each day. If abused, access to this service will be restricted as needed. If you have not done so, please complete and return the advisory notice (advisory.doc). ...................................................................... ... rest of README deleted ... 230 Guest login ok, access restrictions apply. ****************************** ****************************** do a listing to see what's ****************************** at the top level of the FTP ****************************** directory ****************************** ftp> ls -l 200 PORT command successful. 150 Opening ASCII mode data connection for `/bin/ls'. total 2402 -rw-r--r-- 1 root sys 0 Mar 9 08:21 PRERELEASE_FILES_ARE_GONE -rw-r--r-- 1 root sys 3492 Mar 12 10:21 README -rw-r--r-- 2 root sys 1884 Mar 9 15:59 advisory.doc drwxr-xr-x 4 root sys 512 Jan 31 07:39 all_entries drwxr-xr-x 2 root user 512 Feb 3 23:01 bin -rwxr-xr-x 1 root sys 1199235 Mar 10 11:56 contents.lis drwxr-xr-x 3 root user 1024 Feb 14 08:51 crystallographer_info drwxr-xr-x 4 root sys 512 Jan 31 07:39 current_release -rwxr-xr-x 1 root sys 505 Mar 12 09:50 datestamp.txt drwxr-xr-x 2 root user 512 Feb 24 1993 etc -rwxr-xr-x 1 root sys 1935 Mar 10 11:29 expert.txt_NOT_READY -rwxr-xr-x 1 root sys 273 Mar 10 11:29 ftp.hlp_NOT_READY drwxr-xr-x 4 root sys 512 Jan 31 07:55 fullrelease -rwxr-xr-x 1 root sys 273 Mar 10 11:29 how2dnld.txt_NOT_READY -rwxr-xr-x 1 root sys 273 Mar 10 11:29 how2find.txt_NOT_READY -rwxr-xr-x 1 root sys 406 Mar 10 11:30 how2upld.txt_NOT_READY drwxr-xr-x 2 root sys 512 Mar 10 12:10 index drwx-wx-wx 2 root user 13312 Mar 12 10:18 new_uploads drwxr-xr-x 4 root sys 512 Jan 31 07:39 newly_released_EMPTY drwxr-xr-x 5 root sys 512 Mar 10 07:17 newsletter drwxr-xr-x 2 root sys 512 Feb 3 10:14 nmr_restraints drwxr-xr-x 9 root user 512 Mar 10 07:25 pub drwxr-xr-x 3 root user 512 Apr 30 1993 structure_factors 226 Transfer complete. ****************************** ****************************** remember, you want file ****************************** oct93newsletter.txt ****************************** ****************************** it's not visible here, but ****************************** there is a "newsletter" ****************************** directory ****************************** ****************************** change to the "newsletter" ****************************** directory ****************************** ftp> cd newsletter 250 CWD command successful. ****************************** ****************************** do another listing of the ****************************** directory ****************************** ftp> ls -l 200 PORT command successful. 150 Opening ASCII mode data connection for `/bin/ls'. total 3 drwxr-xr-x 2 root sys 512 Mar 10 07:17 newsletter93apr drwxr-xr-x 2 root sys 512 Mar 10 07:17 newsletter93jul drwxr-xr-x 2 root sys 512 Mar 10 07:17 newsletter93oct 226 Transfer complete. ****************************** ****************************** you still have to go into ****************************** another directory ****************************** ****************************** change to "newsletter93oct" ****************************** directory ****************************** ftp> cd newsletter93oct 250 CWD command successful. ****************************** ****************************** do another listing of the ****************************** directory ****************************** ftp> ls -l 200 PORT command successful. 150 Opening ASCII mode data connection for `/bin/ls'. total 2671 -rw-r--r-- 1 root sys 1251 Dec 22 14:37 oct93avail_on_tape.txt -rw-r--r-- 1 root sys 4256 Dec 22 14:37 oct93biblio.txt -rw-r--r-- 1 root sys 1939 Dec 22 14:37 oct93corrections.txt -rw-r--r-- 1 root sys 144076 Dec 22 14:37 oct93entries.txt -rw-r--r-- 1 root sys 772028 Dec 20 12:02 oct93full.ps -rw-r--r-- 1 root sys 290398 Dec 22 14:18 oct93newsletter.ps -rw-r--r-- 1 root sys 33336 Dec 22 14:05 oct93newsletter.txt -rw-r--r-- 1 root sys 3236 Dec 22 14:37 oct93nmr.txt -rw-r--r-- 1 root sys 10526 Dec 22 14:37 oct93onhold.txt -rw-r--r-- 1 root sys 70671 Dec 22 14:38 oct93pending.txt -rw-r--r-- 1 root sys 2294 Dec 22 14:38 oct93prog_misc.txt -rw-r--r-- 1 root sys 30016 Dec 22 14:38 oct93struct_fact.txt 226 Transfer complete. ****************************** ****************************** there's the file you want ****************************** ****************************** go ahead and "get" it ****************************** ftp> get oct93newsletter.txt local: oct93newsletter.txt remote: oct93newsletter.txt 200 PORT command successful. 150 Opening BINARY mode data connection for `oct93newsletter.txt' (33336 bytes). 226 Transfer complete. 33336 bytes received in 0.10 seconds (325.55 Kbytes/s) ****************************** ****************************** return to your local ****************************** computer ****************************** ftp> quit 221 Goodbye. ****************************** ****************************** check for the file ****************************** prompt> ls -l oct93newsletter.txt -rw-r--r-- 1 root sys 33336 Mar 12 10:39 oct93newsletter.txt ****************************** ****************************** view the file locally ****************************** prompt> head oct93newsletter.txt ...................................................................... Protein Data Bank Quarterly Newsletter Number 65 October 1993 ...................................................................... October PDB Release Includes 523 New Full-Release Entries We are pleased to announce that the October 1993 PDB release includes 523 new full-release entries. In addition, all ...rest deleted... ****************************** ****************************** the FTP program has ****************************** many options ****************************** ****************************** see the FTP manual page ****************************** for more information ****************************** ****************************** Send e-mail to skora@bnl.gov ****************************** with questions. ****************************** ---------------------------------------------------------------------- How to Upload Files to PDB using FTP ---------------------------------------------------------------------- Once you are logged into PDB using FTP, you can upload files from your computer to PDB using the "put" command. This example shows how you can upload a file named mydata123.dat to PDB. ****************************** ****************************** ftp to pdb.pdb.bnl.gov ****************************** ****************************** do not use rsh, rlogin or ****************************** telnet -- they won't work ****************************** prompt> ftp -n pdb.pdb.bnl.gov Connected to pdb.pdb.bnl.gov. 220- PDB Network Login to 220 pdb.pdb.bnl.gov FTP server ready. Remote system type is UNIX. Using binary mode to transfer files. ftp> user (username) anonymous 331 Guest login ok, type your name as password. Password: ******** 230- ====================================================================== Welcome to the PDB ANONYMOUS FTP account. Unauthorized access to this computer system is prohibited. Please limit the number of files you download each day. If abused, access to this service will be restricted as needed. If you have not done so, please complete and return the advisory notice (advisory.doc). ====================================================================== ... rest of README deleted ... 230 Guest login ok, access restrictions apply. ****************************** ****************************** do a listing of the directory ****************************** ftp> ls -l 200 PORT command successful. 150 Opening ASCII mode data connection for `/bin/ls'. total 2402 -rw-r--r-- 1 root sys 0 Mar 9 08:21 PRERELEASE_FILES_ARE_GONE -rw-r--r-- 1 root sys 3492 Mar 12 10:21 README -rw-r--r-- 2 root sys 1884 Mar 9 15:59 advisory.doc drwxr-xr-x 4 root sys 512 Jan 31 07:39 all_entries drwxr-xr-x 2 root user 512 Feb 3 23:01 bin -rwxr-xr-x 1 root sys 1199235 Mar 10 11:56 contents.lis drwxr-xr-x 3 root user 1024 Feb 14 08:51 crystallographer_info drwxr-xr-x 4 root sys 512 Jan 31 07:39 current_release -rwxr-xr-x 1 root sys 505 Mar 12 09:50 datestamp.txt drwxr-xr-x 2 root user 512 Feb 24 1993 etc -rwxr-xr-x 1 root sys 1935 Mar 10 11:29 expert.txt_NOT_READY -rwxr-xr-x 1 root sys 273 Mar 10 11:29 ftp.hlp_NOT_READY drwxr-xr-x 4 root sys 512 Jan 31 07:55 fullrelease -rwxr-xr-x 1 root sys 273 Mar 10 11:29 how2dnld.txt_NOT_READY -rwxr-xr-x 1 root sys 273 Mar 10 11:29 how2find.txt_NOT_READY -rwxr-xr-x 1 root sys 406 Mar 10 11:30 how2upld.txt_NOT_READY drwxr-xr-x 2 root sys 512 Mar 10 12:10 index drwx-wx-wx 2 root user 13312 Mar 12 10:18 new_uploads drwxr-xr-x 4 root sys 512 Jan 31 07:39 newly_released_EMPTY drwxr-xr-x 5 root sys 512 Mar 10 07:17 newsletter drwxr-xr-x 2 root sys 512 Feb 3 10:14 nmr_restraints drwxr-xr-x 9 root user 512 Mar 10 07:25 pub drwxr-xr-x 3 root user 512 Apr 30 1993 structure_factors 226 Transfer complete ****************************** ****************************** see the directory named ****************************** "new_uploads"? ****************************** ****************************** that's where you need to ****************************** go ****************************** ****************************** change to "new_uploads" ****************************** directory ****************************** ftp> cd new_uploads 250 CWD command successful. ****************************** ****************************** do a listing of the directory ****************************** ftp> ls -l 200 PORT command successful. 150 Opening ASCII mode data connection for `/bin/ls'. cannot access directory. total 26 226 Transfer complete. ****************************** ****************************** you cannot view the files ****************************** in this directory ****************************** ****************************** ****************************** you can only "put" files ****************************** here ****************************** ****************************** ****************************** go ahead and "put" your ****************************** file ****************************** ftp> put mydata123.dat local: mydata123.dat remote: mydata123.dat 200 PORT command successful. 150 Opening BINARY mode data connection for `mydata123.dat'. 226 Transfer complete. 373227 bytes sent in 0.00 seconds (6835.94 Kbytes/s) ****************************** ****************************** return to your local ****************************** computer ****************************** ftp> quit 221 Goodbye. ****************************** ****************************** the FTP program has ****************************** many options ****************************** ****************************** ****************************** see the FTP manual page ****************************** for more information ****************************** ****************************** ****************************** Send e-mail to skora@bnl.gov ****************************** with questions. ****************************** ---------------------------------------------------------------------- Access to PDB using Listserver ---------------------------------------------------------------------- PDB has a mailing list devoted to discussions concerning its operation, contents and access to the Data Bank. If you would like to subscribe, please send e-mail to: LISTSERV@PDB.PDB.BNL.GOV with the message: subscribe PDB-L Firstname Lastname To find out what you can do with this mailing list, send e-mail to the same address (LISTSERV@PDB.PDB.BNL.GOV) with the one-line message of: help To send a message to all PDB-L subscribers, e-mail the message to: PDB-L@PDB.PDB.BNL.GOV ---------------------------------------------------------------------- Access to PDB using Gopher ---------------------------------------------------------------------- PDB is accessible using Gopher software which follows a simple protocol to "tunnel" through a TCP/IP Internet. Gopher is recommended for obtaining information and files quickly and easily from PDB. As a Gopher client, you can navigate through a hierarchy of directories and documents or ask an index server to return a list of all documents that contain one or more specified words. You can choose "The PDB Anonymous FTP" after reaching PDB's Gopher server in order to search and download the same information and coordinate files as through FTP. Alternatively, you can select "An (almost) full-text search of the PDB Bibliographic Headers", in order to search PDB using any keyword, such as an ident code, author or compound name. Users running a Gopher client can access the PDB server by including the following link: Name = Protein Data Bank FTP site Type = 1 Host = pdb.pdb.bnl.gov Port = 70 Path = 1/ Information for setting up an Internet Gopher client including source files for different machines is available from anonymous FTP at boombox.micro.umn.edu (134.84.132.2) under the directory /pub/gopher. For more information or help in searching the PDB from Gopher, send e-mail to oeder@bnl.gov. ---------------------------------------------------------------------- Call for Sequences of Structures to Predict ---------------------------------------------------------------------- To all protein crystallographers and NMR spectroscopists: Methods for obtaining information about protein structure from sequence appear to be advancing rapidly. But just what can these methods currently deliver? A meeting to assess the state of the art is being organized by CARB (University of Maryland and NIST), Lawrence Livermore National Laboratory and Sandia National Laboratory, with additional generous financial support from NIST. Modelers will be asked to make blind predictions of structures using comparative modeling, threading, or ab initio methods and deposit their structures before experimental reality is available. The aim is to have approximately ten structures predicted in each of these categories before the meeting (which will take place in Asilomar in December) to evaluate the predictions. This will not work unless extensive help is obtained from the com- munity of experimentalists. Seventeen prediction targets have so far been collected. This is a very good start, but more are needed. If you are working on a new protein structure and would like to have the structure predicted or you would like to obtain additional information, contact John Moult at CARB, University of Maryland Biotechnology Institute, 9600 Gudelsky Drive, Rockville, MD 20850 USA (telephone: 301-738-6241; facsimile: 301-738-6255; e-mail: jmoult@iris4.carb.nist.gov). ---------------------------------------------------------------------- Distribution of Obsolete Entries ---------------------------------------------------------------------- Over the years a number of PDB entries have been withdrawn by depositors, usually because improved data was available or, less frequently, because a serious problem was discovered with the data. Entries that were withdrawn before their public release will not be distributed by PDB except with the express permis- sion of the depositor. Entries that were withdrawn following their release exist on old PDB distribution tapes and are stored in numerous laboratories that have received these data from Brookhaven. In the near future, we plan to make these obsolete entries available to all those who may wish to make use of them. We anticipate that the obsolete entries will be of interest to individuals studying new approaches to solving or refining structures and needing `problem' cases with which to evaluate their methods, and to those interested in the evolution of the PDB database over time. Watch FTP, Gopher and Listserver for announcements on how to access the obsolete entries. All obsolete entries will be prominently flagged to distinguish them from current PDB entries. Any publication that may result from the use of obsolete entries should include an explicit statement that the entries in question have been withdrawn and/or replaced. ---------------------------------------------------------------------- New Programs Available from PDB ---------------------------------------------------------------------- Christopher Marzec and Loren Day of the Public Health Research Institute in New York City have contributed two Fortran programs that are now included in PDB distributions. Program BREAKRING will generate pseudo-rotation parameters (q, P, S, Gamma) and the five bond lengths from the coordinates of the atoms in a five- membered ring. The program MAKERING will generate coordi- nates of the atoms in a five-membered ring from these pseudo- rotation parameters and the bond lengths. These programs are available from FTP and Gopher in subdirec- tory program_tape. They are also included in distribution items DATAPRTP and PDBPGMTP. If you have any questions relating to these programs, please contact Dr. Loren Day at the Public Health Research Institute (telephone: 212-578-0828; e-mail: day@phri.nyu.edu). ---------------------------------------------------------------------- Procheck Software Package ---------------------------------------------------------------------- The Procheck software package is being made available for electronic distribution from PDB. Oxford Molecular, Ltd. and PDB have agreed that upon receipt of a signed license agree- ment at PDB, the source and documentation for Procheck will be made available free of charge. The Procheck software pack- age, which was created by J. M. Thornton, M. W. MacArthur, R. A. Laskowski and D. S. Moss, performs evaluations of the stereochemical quality of protein structures. To acquire a copy of Procheck, you must obtain the license agreement, copies of which are available from FTP, Gopher and Listserver archives in the file called procheck-license. You must complete and sign this license agreement and return it to PDB (Protein Data Bank Procheck License, Chemistry Depart- ment, Building 555, Brookhaven National Laboratory, P.O. Box 5000, Upton, NY 11973-5000 USA). Once we have your signed agreement, we will either e-mail the source to you or place it on the machine of your choice by FTP. Your signed license agree- ment will be forwarded to Oxford Molecular who will keep you up to date about further developments. All queries concerning the software should be directed to Steve Gardner, Macromolec- ular Product Manager, Oxford Molecular, Ltd., The Magdalen Centre, Oxford Science Park, Sanford-on-Thames, Oxford, England OX4 4GA (telephone: +44-865-784600). ---------------------------------------------------------------------- PDB-Shell Update ---------------------------------------------------------------------- Version 1.3 of PDB-Shell, the Windows PDB browser, is included in the latest release of PDB on CD-ROM. Several new features requested by users have now been incorporated. Results of queries can be used to further narrow searches by using previous search results. Screens have been modified to improve readability and to improve reporting of errors. In addi- tion, a number of options are available that permit PDB-Shell to access entry files using various directory/disk arrangements. The program is shipped with a default configuration that uses the new PDB directory structure. ---------------------------------------------------------------------- Deposition Form and Guidelines for Deposition ---------------------------------------------------------------------- A set of helpful guidelines to use when preparing an entry for submission to PDB is available from FTP. These guidelines address issues of representation that often arise in the prepara- tion of coordinates for distribution. For example, the document includes an explanation of how PDB represents entries with multiple chains. Also included is advice on how to avoid errors commonly found in new depositions. Submitted entries that fol- low these guidelines take less time to process, expediting the prompt issuance of ident codes. PDB strongly recommends that depositors review the docu- mented guidelines in order to facilitate the deposition process. The latest version of the Deposition Form is available from FTP in the /pub subdirectory. We are requesting that depositors dis- card all old versions of the Deposition Form (printed and/or electronic) and that they pick up the latest electronic version each time they are preparing data for deposition. Documents are available from FTP, Gopher and Listserver as well as in printed form upon request. ---------------------------------------------------------------------- Deposition Form and Guidelines for Deposition ---------------------------------------------------------------------- A set of helpful guidelines to use when preparing an entry for submission to PDB is available from FTP. These guidelines address issues of representation that often arise in the prepara- tion of coordinates for distribution. For example, the document includes an explanation of how PDB represents entries with multiple chains. Also included is advice on how to avoid errors commonly found in new depositions. Submitted entries that fol- low these guidelines take less time to process, expediting the prompt issuance of ident codes. PDB strongly recommends that depositors review the docu- mented guidelines in order to facilitate the deposition process. The latest version of the Deposition Form is available from FTP in the /pub subdirectory. We are requesting that depositors dis- card all old versions of the Deposition Form (printed and/or electronic) and that they pick up the latest electronic version each time they are preparing data for deposition. Documents are available from FTP, Gopher and Listserver as well as in printed form upon request. ---------------------------------------------------------------------- Depositing Data with PDB ---------------------------------------------------------------------- PDB accepts depositions of atomic coordinates, bibliographic citations, primary sequence and secondary structure informa- tion, as well as crystallographic structure factors and NMR experimental data on biological macromolecules. These may include proteins, RNA, DNA, viruses and carbohydrates. Depos- ited data are processed with PDB verification and checking pro- grams, converted to a standard format, archived and distributed worldwide. A deposition has three essential components which must be received before a submission can be processed: the completed Deposition Form, relevant reprints and preprints, and the actual data in PDB format. The Deposition Form can be submitted elec- tronically or on paper. Data must be submitted in machine-read- able form using FTP, e-mail, magnetic tape or disk. A Deposition Form and more information on depositing entries is available in the FTP directory /pub and from the Listserver archives. A printed copy of this document may also be obtained upon request. For additional information about the Deposition Form, see the preceding article. ---------------------------------------------------------------------- Assignment of Ident Codes ---------------------------------------------------------------------- Each entry in PDB is uniquely identified by a four-character ident code (also sometimes referred to as an accession code). Present PDB practice assigns ident codes without regard for the structure name. However, we recognize that many depositors would like to have ident codes that are related mnemonically to the names of their structures. Should you have a preference for a particular ident code, the PDB requests that you inform us about this on your Deposition Form. We promise that all reason- able suggestions will be considered. Of course, if the ident code that you are suggesting has already been used for an existing entry, then an alternative code will have to be assigned by PDB. ---------------------------------------------------------------------- Obtaining a New Entry's Ident Code ---------------------------------------------------------------------- The ident code of a new entry will be issued only after the com- plete deposition is received and initial screening verifies the cor- rectness and integrity of the entry. After processing is complete, a letter providing the ident code and requesting approval for release is sent to the depositor. To facilitate assignment of an ident code, it is necessary that the deposition include all applicable information, including any pre- prints or reprints of journal articles referenced. Data must be in PDB format, and the Deposition Form must be legible and com- plete. In the future, PDB will be implementing procedures giving depositors direct access to PDB processing and checking pro- grams via Internet. Depositors using these new procedures can expect to receive confirmation of their deposition along with the entry ident code within a few days of submission. When the new procedures are implemented, further instructions will be avail- able from FTP, Gopher and Listserver archives, as well as in future Newsletters. ---------------------------------------------------------------------- Finding an Existing Entry's Ident Code ---------------------------------------------------------------------- Each PDB entry is uniquely identified by an ident code. There- fore, retrieving the file for a particular structure requires this ident code. Lists of newly received entries are published quarterly in the PDB Newsletter and Full Tables document. Tables of all PDB entries can be obtained from FTP, Gopher and Listserver archives or by normal mail upon request. In FTP and Gopher are two subdirectories which are useful for locating ident codes. The first is /index. This contains the follow- ing files generated from the latest release: author.lst ident codes and authors compound.lst ident codes, resolutions and compound names crystal.lst ident codes, unit-cell dimensions, space groups and Z's resolu.lst ident codes and resolutions source.lst ident codes and sources The second useful subdirectory is /newsletter. This contains text (.txt) and PostScript (.ps) files of the Full Tables document listing all currently available entries and pending entries that are in preparation for future release. Retrieving files from these two subdirectories, as well as the directory listings which are in files named 'ls-lR', allows you to determine whether the molecule of interest is available, what its ident code is and where it is located. Please be aware that files can be downloaded while using FTP, but they cannot be viewed on the terminal while within this pro- gram. Therefore, it is sometimes helpful to download tables or directory listings, quit FTP, determine which additional files you want to retrieve and then reconnect to FTP to get them. If you are logged in from a UNIX computer, after retrieving a file you may view its contents by escaping to the shell with the command '! cat filename' or '! more filename', where cat and more are UNIX commands. ftp> get ls-lR (retrieves the file named ls-lR) ftp> !cat ls-lR (types the local file 'ls-lR') ftp> !more ls-lR (types the local file 'ls-lR', one page at a time) The PDB Listserver archives and Gopher are other methods to locate PDB ident codes. Please see detailed information about these services in other articles in this Newsletter. Finally, in some cases, journal articles reporting results of structural analy- ses of biological macromolecules provide their PDB ident codes. ---------------------------------------------------------------------- CD-ROM Information ---------------------------------------------------------------------- PDB releases are available on CD-ROM in ISO 9660 format. The layout of files on the CD-ROM mirrors the PDB UNIX tape distribution and uses the aa, ab, ac, ..., zz subdirectories. The entry files are ASCII format and are readable by software able to process text files. PDB-Shell, a facility for Windows users to access and display structures from the PDB database, is available on our CD-ROM. PDB-Shell allows the user to search the database for various cri- teria such as ident code, accession date, compound name and author (see article on page 7 on the latest PDB-Shell update). The PDB CD-ROM also includes the MAGE and PREKIN struc- ture display and manipulation software by David Richardson and Jane Richardson of Duke University [The Kinemage: A Tool for Scientific Communication. Protein Science 1, 3-9 (1992)] in both Windows and Macintosh versions. VAX/VMS systems currently do not directly support access to ISO 9660 CD-ROMs. The PDB CD-ROM may be accessed on VAX/VMS systems using either of the following approaches: 1. There is an ISO 9660 compliant device driver available from Digital Equipment Corporation (DEC) that allows di- rect access to the CD-ROM (driver part number YT- GS001-01). Please contact your DEC sales representa- tive for further information. 2. There is a public utility for accessing ISO 9660 CD- ROMs, called CD_ACCESS, written by Peter Stockwell, University of Otago, New Zealand, that will allow all the files on the CD-ROM to be copied to a magnetic disk drive. This utility can be obtained from the EMBL e-mail server (for additional information you may contact DataLib@EMBL-Heidelberg.DE). When copying files using CD_ACCESS, be sure to use the /BINARY qualifier to the copy command. The CD-ROM does not mount properly on Silicon Graphics sys- tems running IRIX version 4.0.1. To resolve this problem, you need to upgrade to version 4.0.2 or higher. Because of ISO 9660 limitations on symbolic links, we were unable to provide a directory on the CD-ROMs pointing to all entry files in the subdirectories. Therefore, we recommend you do so from a directory on one of your local disk filesystems. Fur- ther detailed instructions are included with each CD-ROM order. ---------------------------------------------------------------------- Affiliated Centers ---------------------------------------------------------------------- Ten affiliated centers offer DATAPRTP information for distribu- tion. These centers are members of the Protein Data Bank Service Association (PDBSA). Centers designated with an as- terisk(*) may distribute DATAPRTP information both on-line and on magnetic or optical media; those without an asterisk are on-line distributors only. CAN/SND Canadian Scientific Numeric Data Base Service Ottawa, Ontario, Canada Roger Gough (613-993-3294) cansnd@vm.nrc.ca CAOS/CAMM Dutch National Facility for Computer Assisted Chemistry Nijmegen, The Netherlands Jan Noordik (31-80-653386) noordik@caos.caos.kun.nl CINECA NE Italy Interuniversity Computing Center Caselecchio di Reno (BO), Italy Salvatore Rago (39-51-598411) argo@icineca.bitnet EMBL European Molecular Biology Laboratory Heidelberg, Germany Peter Rice (49-6221-387-247) peter.rice@embl-heidelberg.de * JAICI Japan Association for International Chemical Information Tokyo, Japan Hideaki Chihara (81-3-5978-3608) NCSA National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Champaign, Illinois Marcia Miller (217-244-0634) mmiller@ncsa.uiuc.edu * Osaka University Institute for Protein Research Osaka, Japan Yukiteru Katsube (81-6-877-5111 ext 3912) Pittsburgh Supercomputing Center Pittsburgh, Pennsylvania Hugh Nicholas (412-268-4960) nicholas@cpwpsca.bitnet SDSC San Diego Supercomputer Center San Diego, California Lynn Ten Eyck (619-534-8189) teneyckl@sdsc.bitnet SEQNET Daresbury Laboratory Warrington, United Kingdom User Interface Group (44-925-603351) uig@daresbury.ac.uk ---------------------------------------------------------------------- To Contact PDB ---------------------------------------------------------------------- Protein Data Bank Chemistry Department, Building 555 Brookhaven National Laboratory P.O. Box 5000 Upton, NY 11973-5000 USA Telephone: +1 516-282-3629 Facsimile: +1 516-282-5751 e-mail: pdb@bnl.bitnet or pdb@bnl.gov Please include your telephone number, facsimile number, mailing address and e-mail address in all correspondence. ---------------------------------------------------------------------- PDB Staff ---------------------------------------------------------------------- Enrique E. Abola, Science Coordinator Frances C. Bernstein Judith A. Callaway Minette Cummings Betty R. Deroski Pamela A. Esposito Arthur Forman Thomas F. Koetzle Patricia A. Langdon Michael D. Libeson Nancy O. Manning (Oeder) John E. McCarthy Regina K. Shea John G. Skora Karen E. Smith David R. Stampf, Sr. Project Mgr. Joel L. Sussman, Head Dejun Xue ---------------------------------------------------------------------- Statement of Support ---------------------------------------------------------------------- PDB is supported by a combination of Federal Government Agency funds (work supported by the U.S. National Science Foundation; the U.S. Public Health Service, National Insti- tutes of Health, National Center for Research Resources, National Institute of General Medical Sciences and National Library of Medicine; and the U.S. Department of Energy under contract DE-AC02-76CH00016) and user fees.