Crystallographic database

Source: Wikipedia, the free encyclopedia.

A crystallographic database is a database specifically designed to store information about the structure of

molecules. They are characterized by symmetry, morphology, and directionally dependent physical properties. A crystal structure describes the arrangement of atoms, ions, or molecules in a crystal. (Molecules need to crystallize into solids so that their regularly repeating arrangements can be taken advantage of in X-ray, neutron, and electron diffraction based crystallography
.)

Crystal structures of crystalline material are typically determined from

single-crystal diffraction data and stored in crystal structure databases. They are routinely identified by comparing reflection intensities and lattice spacings from X-ray powder diffraction data with entries in powder-diffraction fingerprinting
databases.

Crystal structures of nanometer sized crystalline samples can be determined via

crystallites. They are stored in crystal structure databases specializing in nanocrystals and can be identified by comparing zone axis subsets in lattice-fringe fingerprint plots with entries in a lattice-fringe fingerprinting
database.

Crystallographic databases differ in access and usage rights and offer varying degrees of search and analysis capacity. Many provide structure visualization capabilities. They can be browser based or installed locally. Newer versions are built on the relational database model and support the Crystallographic Information File (CIF
) as a universal data exchange format.

Overview

cross-referencing
of tables. Cross-referencing serves to derive additional data or enhance the search capacity of the database.

Data exchange among crystallographic databases, structure visualization software, and structure refinement programs has been facilitated by the emergence of the Crystallographic Information File (CIF) format. The CIF format is the standard file format for the exchange and archiving of crystallographic data.[1] It was adopted by the International Union of Crystallography (IUCr), who also provides full specifications of the format.[2] It is supported by all major crystallographic databases.

The increasing automation of the

Portable Document Format (PDF
) files.

Crystal structure data in CIF format are linked to scientific articles as supplementary material. CIFs may be accessible directly from the publisher's website, crystallographic databases, or both. In recent years, many publishers of crystallographic journals have come to interpret CIFs as formatted versions of open data, i.e. representing non-copyrightable facts, and therefore tend to make them freely available online, independent of the accessibility status of linked scientific articles.

Trends

Trends of crystal structures in databases over the last decade.[3]

As of 2008, more than 700,000

databases. The publishing rate has reached more than 50,000 crystal structures per year. These numbers refer to published and republished crystal structures from experimental data. Crystal structures are republished owing to corrections for symmetry errors, improvements of lattice and atomic parameters, and differences in diffraction technique or experimental conditions. As of 2016, there are about 1,000,000 molecule and crystal structures known and published, approximately half of them in open access
.

Crystal structures are typically categorized as

metalloproteins
tend to be attributed to organics or biological macromolecules, respectively. Nucleic acids are a subset of biological macromolecules.

Comprehensiveness can refer to the number of entries in a database. On those terms, a crystal structure database can be regarded as comprehensive, if it contains a collection of all (re-)published crystal structures in the category of interest and is updated frequently. Searching for structures in such a database can replace more time-consuming scanning of the open literature. Access to crystal structure databases differs widely. It can be divided into reading and writing access. Reading access rights (search, download) affect the number and range of users. Restricted reading access is often coupled with restricted usage rights. Writing access rights (upload, edit, delete), on the other hand, determine the number and range of contributors to the database. Restricted writing access is often coupled with high data integrity.

In terms of user numbers and daily access rates, comprehensive and thoroughly vetted

mineralogical databases and educational offshoots of the COD
.

and predicted crystal structures are collected in tailored special structure databases.

Search

Search capacities of

in search strings. If supported, the scope of the search can be constrained by the exclusion of certain chemical elements.

More sophisticated algorithms depend on the material type covered.

molecular structures (biological macromolecules). Search algorithms used for a more complex analysis of physical properties, e.g. phase transitions or structure-property relationships, might apply group-theoretical
concepts.

Modern versions of crystallographic databases are based on the

search engines
.

Crystal phase identification

molecules is defined by a single crystal structure in one orientation. Twin crystals, on the other hand, consist of single-crystalline twin domains, which are aligned by twin laws and separated by domain walls
.

Polycrystals are made of a large number of small single crystals, or

crystallites, held together by thin layers of amorphous solid
. Crystal powder is obtained by grinding crystals, resulting in powder particles, made up of one or more crystallites. Both polycrystals and crystal powder consist of many crystallites with varying orientation.

Crystal phases are defined as regions with the same crystal structure, irrespective of orientation or twinning. Single and twinned crystalline specimens therefore constitute individual crystal phases. Polycrystalline or crystal powder samples may consist of more than one crystal phase. Such a phase comprises all the crystallites in the sample with the same crystal structure.

Crystal phases can be identified by successfully matching suitable crystallographic parameters with their counterparts in database entries. Prior knowledge of the chemical composition of the crystal phase can be used to reduce the number of database entries to a small selection of candidate structures and thus simplify the crystal phase identification process considerably.

Powder diffraction fingerprinting (1D)

Applying standard

structure determination
process more difficult, if not impossible.

Powder Diffraction File (PDF) database. The list of d-I data pairs is highly characteristic of a crystal phase and, thus, suitable for the identification, also called ‘fingerprinting’, of crystal phases.[16]

Search-match algorithms compare selected test reflections of an unknown crystal phase with entries in the database. Intensity-driven algorithms utilize the three most intense lines (so-called ‘Hanawalt search’), while d-spacing-driven algorithms are based on the eight to ten largest d-spacings (so-called ‘Fink search’).[17]

X-ray powder diffraction fingerprinting has become the standard tool for the identification of single or multiple crystal phases and is widely used in such fields as

pharmaceutical sciences
.

Lattice-fringe fingerprinting (2D)

reciprocal space
, i.e. by applying
single-crystal electron diffraction
techniques.

High-Resolution Transmission Electron Microscopy (HRTEM) provides images and diffraction patterns of nanometer sized crystallites. Fourier transforms of HRTEM images and electron diffraction patterns both supply information about the projected reciprocal lattice geometry for a certain crystal orientation, where the projection axis coincides with the optical axis of the microscope.

Projected lattice geometries can be represented by so-called ‘lattice-fringe fingerprint plots’ (LFFPs), also called angular covariance plots.[18] The horizontal axis of such a plot is given in reciprocal lattice length and is limited by the point resolution of the microscope. The vertical axis is defined as acute angle between Fourier transformed lattice fringes or electron diffraction spots. A 2D data point is defined by the length of a reciprocal lattice vector and its (acute) angle with another reciprocal lattice vector. Sets of 2D data points that obey Weiss's zone law are subsets of the entirety of data points in an LFFP. A suitable search-match algorithm using LFFPs, therefore, tries to find matching zone axis subsets in the database. It is, essentially, a variant of a lattice matching algorithm.[19]

In the case of electron diffraction patterns, structure factor amplitudes can be used, in a later step, to further discern among a selection of candidate structures (so-called 'structure factor fingerprinting'). Structure factor amplitudes from electron diffraction data are far less reliable than their counterparts from X-ray single-crystal and powder diffraction data. Existing precession electron diffraction techniques greatly improve the quality of structure factor amplitudes, increase their number and, thus, make structure factor amplitude information much more useful for the fingerprinting process.[20]

Fourier transforms of HRTEM images, on the other hand, supply information not only about the projected reciprocal lattice geometry and structure factor amplitudes, but also structure factor phase angles. After crystallographic image processing,[21] structure factor phase angles are far more reliable than structure factor amplitudes. Further discernment of candidate structures is then mainly based on structure factor phase angles and, to a lesser extent, structure factor amplitudes (so-called 'structure factor fingerprinting').[22][23]

Morphological fingerprinting (3D)

The Generalized

Steno Law[24] states that the interfacial angles between identical faces of any single crystal of the same material are, by nature, restricted to the same value.[25] This offers the opportunity to fingerprint crystalline materials on the basis of optical goniometry, which is also known as crystallometry.[26] In order to employ this technique successfully, one must consider the observed point group symmetry of the measured faces and creatively apply the rule that "crystal morphologies are often combinations of simple (i.e. low multiplicity) forms where the individual faces have the lowest possible Miller indices for any given zone axis
". This shall ensure that the correct indexing of the crystal faces is obtained for any single crystal.

It is in many cases possible to derive the ratios of the crystal axes for crystals with low symmetry from optical goniometry with high accuracy and precision and to identify a crystalline material on their basis alone employing databases such as 'Crystal Data'.

databases: the 'Bestimmungstabellen für Kristalle (Определитель Кристаллов)'[28] and the 'Barker Index of Crystals'.[29]

Since Steno's Law can be further generalized for a single crystal of any material to include the angles between either all identically indexed net planes (i.e. vectors of the reciprocal lattice, also known as 'potential reflections in diffraction experiments') or all identically indexed lattice directions (i.e. vectors of the direct lattice, also known as zone axes), opportunities exist for morphological fingerprinting of nanocrystals in the transmission electron microscope (TEM) by means of transmission electron goniometry.[30]

The specimen goniometer of a TEM is thereby employed analogously to the goniometer head of an optical goniometer. The optical axis of the TEM is then analogous to the reference direction of an optical goniometer. While in optical goniometry net-plane normals (reciprocal lattice vectors) need to be successively aligned parallel to the reference direction of an optical goniometer in order to derive measurements of interfacial angles, the corresponding alignment needs to be done for zone axes (direct lattice vector) in transmission electron goniometry. (Note that such alignments are by their nature quite trivial for nanocrystals in a TEM after the microscope has been aligned by standard procedures.)

Since transmission electron goniometry is based on

diffraction pattern or from the Fourier transform
of a high resolution TEM image that shows crossed lattice fringes.

Lattice matching (3D)

polycrystal
diffraction data via profile fitting without structural model (so-called 'Le Bail method').

Arbitrarily defined unit cells can be transformed to a standard setting and, from there, further reduced to a primitive smallest cell. Sophisticated algorithms compare such reduced cells with corresponding database entries. More powerful algorithms also consider derivative super- and subcells. The lattice-matching process can be further sped up by precalculating and storing reduced cells for all entries. The algorithm searches for matches within a certain range of the lattice parameters. More accurate lattice parameters allow a narrower range and, thus, a better match.[31]

Lattice matching is useful in identifying crystal phases in the early stages of single-crystal diffraction experiments and, thus, avoiding unnecessary full data collection and structure determination procedures for already known crystal structures. The method is particularly important for single-crystalline samples that need to be preserved. If, on the other hand, some or all of the crystalline sample material can be ground, powder diffraction fingerprinting is usually the better option for crystal phase identification, provided that the peak resolution is good enough. However, lattice matching algorithms are still better at treating derivative super- and subcells.

Visualization

Newer versions of

crystallographic databases may provide morphology or tensor
visualization output.

Crystal structures

The

molecules in a crystal. The unit cell represents the simplest repeating unit of the crystal structure. It is a parallelepiped containing a certain spatial arrangement of atoms, ions, molecules, or molecular fragments. From the unit cell the crystal structure can be fully reconstructed via translations
.

The visualization of a crystal structure can be reduced to the arrangement of atoms, ions, or molecules in the unit cell, with or without cell outlines. Structure elements extending beyond single unit cells, such as isolated

molecular
or polyhedral units as well as chain, net, or framework structures, can often be better understood by extending the structure representation into adjacent cells.

The space group of a crystal is a mathematical description of the symmetry inherent in the structure. The motif of the crystal structure is given by the asymmetric unit, a minimal subset of the unit cell contents. The unit cell contents can be fully reconstructed via the symmetry operations of the space group on the asymmetric unit. Visualization interfaces usually allow for switching between asymmetric unit and full structure representations.

ionic, hydrogen, or other bonds including hybrid forms. Bond angles can be deduced from the bond vectors in groups of atoms or ions. Bond distances and angles can be made available to the user in tabular form or interactively, by selecting pairs or groups of atoms or ions. In ball-and-stick models
of crystal structures, balls represent atoms and sticks represent bonds.

Since

Protein crystallographers are interested in molecular structures of biological macromolecules, so that provisions need to be made to be able to represent molecular subunits as helices, sheets, or coils
, respectively.

Crystal structure visualization can be integrated into a

Web-based crystallographic databases can integrate crystal structure visualization capability.[34] Depending on the complexity of the structure, lighting, and 3D effects, crystal structure visualization can require a significant amount of processing power, which is why the actual visualization is typically run on the client
.

Currently, web-integrated crystal structure visualization is based on

web browsers, often supporting wide color spectra (up to 32 bit) and window size adaptation. However, web-generated crystal structure images are not always suitable for publishing due to issues such as resolution depth, color choice, grayscale contrast, or labeling (positioning, font type, font size).[36]

Morphology and physical properties

Mineralogists, in particular, are interested in morphological appearances of individual crystals, as defined by the actually formed crystal faces (tracht) and their relative sizes (habit). More advanced visualization capabilities allow for displaying surface characteristics, imperfections inside the crystal, lighting (reflection, shadow, and translucency), and 3D effects (interactive rotatability, perspective, and stereo viewing).[37][38]

physical properties of crystals. The directional dependence of a crystal's physical property is described by a 3D tensor and depends on the orientation of the crystal. Tensor shapes are more palpable by adding lighting effects (reflection and shadow). 2D sections of interest are selected for display by rotating the tensor interactively around one or more axes.[39]

Crystal morphology or physical property data can be stored in specialized databases or added to more comprehensive crystal structure databases. The Crystal Morphology Database (CMD) is an example for a web-based crystal morphology database with integrated visualization capabilities.

See also

References

  1. PMID 12037350
    .
  2. .
  3. ^ Sources:
  4. PMID 12037354
    .
  5. PMID 12037357.{{cite journal}}: CS1 maint: multiple names: authors list (link
    )
  6. .
  7. .
  8. .
  9. .
  10. .
  11. .
  12. .
  13. doi:10.1016/s0925-8388(98)00605-7.{{cite journal}}: CS1 maint: multiple names: authors list (link
    )
  14. .
  15. S2CID 29840379.{{cite journal}}: CS1 maint: multiple names: authors list (link
    )
  16. .
  17. .
  18. .
  19. ].
  20. .
  21. .
  22. .
  23. ^ US patent 8131481, Peter Moeck, "Database Supported Nanocrystal Structure Identification by Lattice-Fringe Fingerprinting with Structure Factor Extraction" 
  24. ^ N. Steno (1923) [1669]. De solido intra solidum naturaliter contento dissertations prodromus [Vorläufer einer Dissertation über feste Körper, die innerhalb anderer fester Körper von Natur aus eingeschlossen sind]. Translated by Karl Mieleitner. Florence.{{cite book}}: CS1 maint: location missing publisher (link)
  25. ^ J. B. L. Romé de l’Isle (1783). Cristallographie, ou description des formes propres à tous les corps du règne minéral. Vol. 4. Paris.
  26. ^ P. Terpstra; L. W. Codd (1961). Crystallometry. New York: Academic Press.
  27. ^ J. D. H. Donnay, C. Donnay, E. G. Cox, O. Kennard, M. V. King, Crystal Data, Monography 5, American Crystallographic Association, Washington, William and Heintz, 1963.
  28. ^ A. K. Boldyrew and W. W. Doliwo-Dobrowolsky, Bestimmungstabellen für Kristalle (Определитель Кристаллов), Vol. I, Part 1, Einleitung, Tetragyrische Syngonie; W.W. Doliwo-Dobrowolsky and G.P. Preobraschensky, Vol. I, Part 2, Trigyrische and Hexagyrische Syngonien allgemeine Ergänzungen zu den mittleren Syngonien, Zentrales Wissenschaftliches Institut der Geologie und Schürfung, Leningrad and Moscow, 1937 and 1939.
  29. ^ M. W. Porter and R. C. Spiller, The Barker Index of Crystals, Vol. I and II, W. Heffer and Sons, Cambridge, 1951 and 1956; M. W. Porter and W. L. Codd, The Barker Index of Crystals, Vol. III, W. Heffer and Sons, Cambridge, 1964.
  30. .
  31. ^ Frank H. Allen, Günther Bergerhoff, Rolf Sievers (1987). Crystallographic Databases. Chester: IUCr.
  32. PMID 12037360
    .
  33. .
  34. ^ Moeck Peter, Čertík Ondřej, Upreti Girish, Seipel Björn, Harvey Morgan, Garrick William, Fraundorf Philip (2006). "Crystal Structure Visualizations in three Dimensions with Support from the Open Access Nano-Crystallography Database". Journal of Materials Education. 28 (1): 83–90.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  35. .
  36. .
  37. .
  38. .
  39. ^ Kaminsky Werner (2000). "Wintensor: Ein WIN95/98/NT Programm zum Darstellen tensorieller Eigenschaften". Zeitschrift für Kristallographie Supplement. 17: 51.

External links

Crystal structures

Crystal phase identification

Specialized databases