Rights / License: Research Collection In Copyright - Non ... · 2.1 Periodic MinimalSurfaces...

Research Collection

Doctoral Thesis

Structure envelopes and their application in structuredetermination from powder diffraction data

Author(s): Brenner, Simon

Publication Date: 1999

Permanent Link: https://doi.org/10.3929/ethz-a-003839470

Rights / License: In Copyright - Non-Commercial Use Permitted

This page was generated automatically upon download from the ETH Zurich Research Collection. For moreinformation please consult the Terms of use.

ETH Library

https://doi.org/10.3929/ethz-a-003839470

http://rightsstatements.org/page/InC-NC/1.0/

https://www.research-collection.ethz.ch

https://www.research-collection.ethz.ch/terms-of-use

Dissertation ETH Nr. 13280

Structure Envelopes and their Application in Structure

Determination from Powder Diffraction Data

Dissertation

submitted to the

Swiss Federal Institute of Technology

Zurich

for the degree of Doctor of Natural Sciences

Presented by

Simon Brenner

Dipl. Kristallograph (University of Leipzig)

born July 1, 1965 in Germany

Accepted on the recommendation of

Prof. Dr. W. Steurer

Dr. L.B. McCusker

Dr. Jordi Rius

1999

I

Abstract

In this study, the possibility of using periodic nodal surfaces (PNS) to facilitate structure

determination from powder diffraction data was investigated. PNS are three-dimensional

curved surfaces that describe the nodes of a density distribution and they have been used to

describe physical and chemical features of various compounds with known crystal structures.

It was hoped that such surfaces could be derived for materials with unknown crystal structures

using just the data in the measured powder diffraction pattern, and that they could then be used

in the structure determination process.

The reflection intensities in a powder pattern are dependent upon the electron density

distribution within the unit cell. It was found that by assigning the correct phases to the

structure factors of a few (1-5) strong reflections m the low-angle (high «-/-spacing, low-

resolution) region of the powder pattern, a density distribution similar to a low-resolution

electron density map could be calculated. The PNS for that density distribution separates the

regions of high and low electron densities. That is, the PNS envelops the crystal structure. For

this special PNS, the term "structure envelope" has been adopted.

The reflections needed to generate a structure envelope can be selected by following some

simple rules, and this has been demonstrated for a number of zeolite structures. In most cases,

no more than five reflections are required to calculate an envelope that describes the coarse

features of the crystal structure. To calculate the structure envelope, the phases of the

reflections used must be known. In some cases, only the origin-defining reflections (odr),

whose structure factors can phased arbitrarily, are needed, but usually a few additional

reflections arc required and their phases must be determined

A computer program (SayPerm) to estimate the phases of the structure factors needed for the

structure envelope generation has been written. It combines the pseudo-atom approach, the

application of the Sayre equation and phase permutation. Pseudo-atoms replace building units

in a structure (e.g. Si04 tetrahedron in a silicate framework) to simulate atomic resolution at

high d-spacing. SayPerm is a multisolution approach in which a certain number of phase sets

are generated using error correcting codes (ecc's). These codes sample phase space in a very

efficient manner (e.g. for 14 centrosymmetric reflections only 256 of the possible 2J4 = 16384

permutations are examined and one of the phase sets will have at most two incorrect phases).

From each phase set, a phase extension using the Sayre equation is performed. The results are

then ranked by a figure-of-merit that is calculated from the validity of the Sayre equation. II

n-

could be shown that for moderate-sized centrosymmetric structures (< 10 building units

replaced by pseudo-atoms per asymmetric unit), which can be well approximated by an equal

pseudo-atom structure (e.g. tetrahedral and octahedral structures), the phases for the strongest

structure factors can be estimated reliably. SayPerm was tested on two zeolite structures

(RUB-3 and 1TQ-1), and on 0-AlF3, which has octahedral building units. It was then used to

generate quite an informative envelope for a previously unknown tri-ß-peptidc structure

(sa322) with the chemical formula C^NßOgH^.

An alternative to the Sayre-equation approach is to use permutation synthesis. Here, phases of

only seven strong structure factors are permuted. This is done using the Hamming [7,4,3 j error

correcting code. From each of the 16 structure factor sets obtained, a Fourier map is generated.

Using chemical knowledge (e.g. size and shape of an organic molecule) there is a good chance

that the map that corresponds to the correct phase set can be recognized by eye. The method

was demonstrated on the organic structure Cimetidine (CjQNgSjHjg). For the tri-ß-peptide

structure, the results of the envelope generation mentioned above could be confirmed.

For methods of structure solution that work in direct space, a structure envelope is quite useful,

because it limits the volume m a unit cell where the atoms of the crystal structure are likely to

be located. To test its effectiveness, structure envelope masks were implemented in the

program FOCUS, which was written specifically for the determination of zeolite structures

from powder data. Exhaustive grid searches for four zeolite topologies (APD, SGT, RTE, and

MWW) using structure envelopes were performed. For comparison, these tests were repeated

without the structure-envelope mask. The amount of computer time required for the search for

the APD and SGT topologies was reduced by as much as two orders of magnitude when the

envelope mask was used. To find the RTE and MWW topologies, the envelope proved to be

essential. Without the envelope mask, those grid searches were not successful even after two

days of computing time, whereas with the mask they were found in 25 min and 27 min,

respectively.

The structure envelope for the tri-ß-peptide sa322 calculated using SayPerm was used in a

direct-space search for the structure. A simulated-annealing computer program (SAFE) was

developed for this purpose. It combines simulated annealing with a structure-envelope mask.

The molecule is moved within the unit cell via a simulated annealing algorithm to find the

conformation and orientation of the molecule that produces the lowest weighted profile R-

factor. Simultaneously, the molecule is encouraged to he within the structure envelope. This is

done by imposing a penalty function that is also controlled by the simulated annealing

— Ill —

algorithm. Using this approach, the 41 non-H-atom structure with 23 degrees of freedom (three

positional, three orientational, 17 torsion angle parameters) could be solved.

IV

Zusammenfassung

Im Rahmen der vorliegenden Dissertation wurde untersucht ob periodische Knotenflächen

(PNS) die Chancen für die Lösung von Kristallstrukturen aus Röntgenpulverdaten erhöhen.

Periodische Knotenflächen sind Flächen, die die Nullstellen einer DichteVerteilungsfunktion

im dreidimensionalen Raum verbinden. Mit diesen Flächen lassen sich in beeindruckender

Weise verschiedene physikalische und chemische Eigenschaften innerhalb einer vorher

bekannten Kristallstruktur beschreiben. Wenn es gelänge, solche Flächen für unbekannte

Strukturen einzig und allein aus den gemessenen Daten eines Pulverdiffraktogramms, zu

berechnen, könnte das für die Stmkturaufklärung dieser Strukturen sehr nützlich sein.

Die Intensitäten der einzelnen Reflexe m einem Pulverdiffraktogramm werden hauptsächlich

durch die ElektronendichteVerteilung innerhalb der Einheitszelle bestimmt. Wenn man den

Strukturfaktoren einiger der stärksten Reflexe aus dem Niedrigwinkelbereich (mit hohen d-

Werten) die richtigen Phasen zuordnet, kann eine DichteVerteilung berechnet werden, die der

einer sehr niedrig aufgelösten Elektroncndichtekarte ähnlich ist. Die Fläche aus den

Nullstellen der Verteilungsfunktion trennt hohe und niedrige Elektronendichten der

entsprechenden Kristallstruktur. Somit kann man sagen, dass es sich bei diesen speziellen

Periodischen Knotenflächen um Struktureinhüllende (Structure Envelope) handelt.

Die Reflexe, deren Strukturfaktoren in die Berechnung der Struktureinhüllenden eingehen,

können sehr leicht ausgewählt werden. In den meisten Fällen sind nicht mehr als fünf

Strukturfaktoren notwendig, um eine Fläche zu berechnen, die das Wesentliche einer

Kristallstruktur in niedriger Auflösung beschreibt. Um eine Struktureinhüllende zu erzeugen

müssen die Phasen der Strukturfaktoren bekannt sein. Manchmal genügt es ausschliesslich die

Strukturfaktoren der Reflexe zu benutzen, die den Ursprung definieren. Die entsprechenden

Phasen können dann willkürlich festgelegt werden. In den meisten Fällen aber sind zur

Berechnung einer informativen Strukturcinhüllenden zusätzliche Strukturfaktoren notwendig,

deren Phasen zuerst bestimmt werden müssen.

Zu diesem Zweck ist ein Computerprogramm (SayPerm) entwickelt worden, das die

Pscudoatommethode, die Anwendung der Sayrcgleichung und Phasenpermutationen

kombiniert. Pseudoatome ersetzen Baueinheiten der Kristallstruktur (z.B. Si04 Tetraeder in

einem Silikatgerüst) um atomare Auflösung von Daten mit niedriger Auflösung zu simulieren.

Die Phasenpermutationen werden über Fehlerkorrekturcodcs (ecc's) kontrolliert, die eine sehr

effiziente Abrasterung des durch die permutierten Phasen aufgespannten Raumes

(Phasenraum) gewährleisten. So müssen z.B. fur 14 zentrosymmtrische Strukturfaktoren statt

V

der 2 = 16384 möglichen Phasenkombinationen nur 256 geprüft werden. Trotz der

unvollständigen Abrasterung des Phasenraums weicht jede der resultierenden

Phasenkombination in höchstens zwei Stellen von einer entsprechenden Kombination ab, die

aus einer systematischen Permutation resultieren würde.

Von jeder Phasenkombination wird eine Phasenerweiterung auf weitere Strukturfaktoren

ausgeführt. Dazu wird die Sayregleichung verwendet. Die Qualität der Phasierung wird dann

über die Gültigkeit der Sayregleichung beurteilt. Es hat sich gezeigt, dass auf diese Weise die

Phasen für die Strukturfaktoren der stärksten Reflexe für zentrosymmetrische, nicht zu

komplexe Strukturen (< 10 durch Pseudoatome zu ersetzende Baueinheiten) relativ zuverlässig

berechnet werden können. Ausserdem sollten die Strukturen gut durch Pseudoatome

approximierbar sein. Unter anderen wurde die Methode an zwei Zeolithstrukturen (RUB-3 und

ITQ-1) und an einer aus Oktaedern aufgebauten Struktur (8-A3F3) getestet. Zusätzlich konnte

mit Hilfe des SayPerm-Programmes von einer unbekannten Tri-peptide-Struktur (Sa322,

C32N3O6H53) eine sehr informative Struktureinhüllende berechnet werden.

Eine Alternative zur SayPerm-Methode ist die Anwendung der s.g. Permutationssynthese.

Dabei werden nur sieben Phasen von Strukturfaktoren mit grosser Amplitude, gesteuert vom

Hamming[7,4,3]-Fehlerkorrekturcode, permutiert. Von jedem der 16 permutierten Phasensätze

wird eine Fourierkarte aus den entsprechenden Strukturfaktoren generiert. Wenn bestimmte

chemische Informationen (z.B. Grösse und ungefähre Form eines organischen Moleküls)

einbezogen werden, hat man eine gute Chance, aus den 16 Fourierkarten, die richtige

auszuwählen zu können. Diese Methode ist an der Kristallstruktur des organischen Moleküls

Cimetidin ausprobiert worden. Über die Permutationssynthese konnte auch das mit der

SayPerm-Methode erzielte Ergebnis für die Berechnung einer Struktureinhüllenden für Sa322

bestätigt werden.

Weil eine Struktureinhüllende das Volumen in der Einheitszelle, in dem sich die Atome einer

Kristallstruktur aller Wahrscheinlichkeit nach aufhalten, begrenzt, kann es für den Erfolg einer

im direkten Raum agierenden Kristallstrukturbestimmungmethode von entscheidender

Bedeutung sein. In das auf die Lösung von Zeolithstrukturen spezialisierte Programm FOCUS

wurde die Möglichkeit der Volumenbegrenzung über eine Struktureinhüllende implementiert

und bei einer automatischen Abrasterung der Lösungsraumes (exhaustive gridsearch) an vier

Zeolith Topologien (APD, SGT, RTE und MWW) getestet. Zum Vergleich wurden die

gleichen Tests ohne die Struktureinhüllenden wiederholt. Es zeigte sich, dass bei Benutzung

der Struktureinhüllenden die notwendige Rechenzeit bei zwei der Teststrukturen (APD, SGT)

VI

um zwei Grössenordnungen gesenkt werden konnte. Für das Auffinden der RTE- und MWW-

Topologien war die Anwendung der Struktureinhüllenden sogar Grundvoraussetzung. Ohne

sie war die Suche nach der Struktur nach zwei Tagen noch nicht beendet, wohingegen die

richtigen Topologien beim Einsatz der Struktureinhüllenden schon nach 25 b.z.w. 27 min

beendet war.

Die für die Tri-peptidstruktur sa322 mit der SayPerm-Methode berechnete Struktureinhüllende

wurde für die Lösung der Struktur im direkten Raum benutzt. Dazu wurde das

Computerprogramm SAFE entwickelt, dass einen "Simulated Annealing" Algorithmus mit

einer Beschränkung des Volumens durch eine Struktureinhüllende kombiniert. Das Molekül

wird, gesteuert vom "Simulated Annealing" Algorithmus, durch die Zelle bewegt, um die

Orientierung und Konformation zu finden, aus der der niedrigste gewichtete Profile-/?-Wert

resultiert. Gleichzeitig wird ein Gütefaktor, der die Lage des Moleküls innerhalb der

Straktureinhüllenden beschreibt, ebenfalls über einen "Simulated Annealing" Algorithmus

minimal gehalten, so dass das Molekül dazu tendiert, seine Einhüllende nicht zu verlassen. Mit

dieser Methode konnte die Tri-peptidstruktur mit seinen 41 Nichtwasserstoffatomen und 23

Freiheitsgraden (drei für die Position, drei für die Orientierung und 17 für die freien

Torsionswinkcl) gelöst werden.

— VII —

Table of contents

1.1 Structural investigations of chemical compounds 1

1.2 Stinctural investigations using X-ray powder diffraction techniques 1

1.3 Structure determination from powder data 2

1.3.1 Single-crystal methods applied to powders 2

1.3.2 Structure determination in direct space 4

1.4 Zeolites 5

1.5 Periodic minimal and nodal surfaces 6

1.6 Overview of the project 6

2 Structure envelopes ,7

2.1 Periodic Minimal Surfaces (PMS) and crystal structures 7

2.2 Periodic Nodal Surfaces (PNS) and crystal structures 7

2.3 From a PNS to a crystal structure? 9

2.4 Generation of a structure envelope 11

2.5 Reflection selection for the calculation of a PNS 13

2.6 Application to non-zeolite structures 15

3 Solving the phase problem for structure envelope generation 17

3.1 Introduction 17

3.2 The Sayre equation 20

3.3 The Pseudo-atom method 20

3.4 Phase permutations 22

3.4.1 Sampling the pliase space with error correcting codes (ecc's) 23

4 Phase estimation using the Sayre equation 26

4.1 Introduction 26

4.2 Data collection and reduction 26

4.3 The SayPerm procedure 28

4.3.1 Data preparation 28

4.3.2 Phase extension 29

4.3.3 Phase set evaluation 29

4.4 Test structure RUB-3 (RTE topolpgy) 30

4.4.1 Data measurement and preparation 30

4.4.2 SayPerm inputfile 30

4.4.3 SayPerm run 32

4.4.4 Results 34

4.5 Test structure TTQ-1 (MWW topology) 35

4.5.1 Data measurement and preparation 35


4.5.3 Results 37

4.6 Test structure 9-AlF^ 39



4.6.3 Results 40

4.7 Tri-ß-peptide C32N,06H5, (sa322) 42

4.7.1 Measurement, data preparation, andfirst attempts at structure solution 42

4.7.2 SayPerm run and map evaluation 44

VIII

4.7.3 Results 44

4.8 Limitations of the SayPerm Approach 47

5 Phase estimation by the method of permutation synthesis 49

5.1 Introduction 49

5.2 Test structure Cimetidine 50


5.2.2 Application of the permutation synthesis 51

5.2.3 Selection of the best Fourier map 54

5.3 Tri-ß-peptide C32N306H53 (sa322) 55

5.3.1 Selection of the best Fourier map 55

5.4 Conclusions 56

6 Determination of zeolite structures using structure envelopes 60

6.1 Introduction 60

6.2 Topology search for zeolite structures with a structure envelope 60

6.3 Test examples 62

6.4 Structure envelopes with the full FOCUS approach 64

6.5 Conclusions 65

7 From structure envelopes to organic crystal structures 66

7.1 Introduction 66

7.2 Direct-space approaches to crystal-structure determination 66

7.2.1 Model generation techniques for organic structures 67

7.2.2 Model modification control 67

7.2.3 Comparison of diffraction data 69

7.3 Simulated annealing, fragment search and structure envelopes 70

7.4 The program SAFE 70

7.4.1 Input 71

7.4.2 Variation of the trial structure 71

7.4.3 Model construction 71

7.4.4 Checking for a chemically reasonable structure 73

7.4.5 Evaluation of the fit to the powder pattern and/or the structure envelope 74

7.4.6 Acceptance ofmoves and temperature control 75

7.5 Structure of the tri-ß-peptide sa322 75

7.5.1 Combination of chemical information with a structure envelope 75

7.5.2 SAFE inputfile ! 77

7.5.3 SAFE run 81

7.5.4 Refinement of the crystal structure 81

8 Conclusions 84

9 Possible developments of the structure envelope approach 86

10 References 88

1 Introduction

1.1 Structural investigations of chemical compounds

The properties of a material are determined primarily by its atomic structure. To elucidate a

structure, various methods are available, but the most powerful methods for structural

investigations are nuclear magnetic resonance spectroscopy (NMR) and X-ray diffraction

techniques. To use the latter in a routine manner, single crystals of sufficient quality and size

are required. Then the whole chemical structure can be derived from the measured data.

Thousands of crystal structures are determined m this way every year. Single-crystal X-ray

diffraction is one of the most commonly used analytical methods in chemical laboratories. If

single crystals are not available, NMR can still provide a considerable amount of structural

information. In particular, connectivity information can be gleaned, even if the compound is in

solution or in the form of a powder. In the last ten years, a rapid development in NMR

techniques has taken place, but even so it is not usually possible to obtain a complete three-

dimensional structure from NMR data.

1.2 Structural investigations using X-ray powder diffraction techniques

A possible solution to this problem is to use powder diffraction techniques. Even if single

crystals cannot be grown, quite often a polycrystalline powder can be obtained. With some

effort, surprisingly detailed structural information can be extracted from the diffraction

patterns of such materials.

In a polycrystalline powder, millions of small crystallites are present in different orientations.

The diffraction pattern from a powder then is simply a superposition of millions of single

crystal diffraction patterns. A powder pattern can be described as a projection of three-

dimensional single crystal diffraction data onto one dimension. A consequence of this

projection is that symmetrically independent reflections with similar öf-spacings (similar

diffraction angle 29) overlap, and this results m a loss of information. Nevertheless, the one-

dimensional powder pattern can be used as a fingerprint of a crystalline compound. It is

possible to compare two compounds or to carry out a quantitative analysis of mixtures, if the

powder patterns of the components are known.

The positions of the reflections in a powder pattern are determined by the lattice constants and

the relative intensities of the reflections by the position of the atoms in the unit cell. The shape

of the peaks in the pattern depends upon the geometry of the instrument and the quality of the

sample. Peak shapes are usually described in terms of a symmetric function (e.g. pseudo-Voigt

1

which is a linear combination of a Gaussian and a Lorentzian function) with a certain full-

width at half maximum (FWHM) and a geometry-dependent asymmetry. From these

parameters (unit cell, atomic positions, and peakshape), a powder pattern can be simulated.

This can be done for any structural model, and the better the match between simulated and the

measured pattern, the more probable the model.

Rietveld (1969) used this concept to develop a whole-pattern structure refinement method.

That development was a crucial step in the advance of structural analysis using powder data.

Given a structural model, the positions of the atoms can be refined using a least squares

algorithm to obtain a better fit between the calculated and the observed patterns. The problems

in this approach are the exact descriptions of the peak shape and the background. Furthermore,

impurities in the powder sample or the presence of disorder can severely hinder or even

prevent a successful Rietveld refinement. The advantage is that overlapping reflections do not

have to be deconvoluted, because each point of the observed and calculated pattern, and not

the individual structure factors, are compared. The better the resolution of the pattern and the

higher the information content, the more reliable the refinement. Structures with up to 100

parameters can be refined routinely, and with more effort up to 200 parameters can be

handled. The Rietveld method is the standard technique for structure refinement using powder

data.

1.3 Structure determination from powder data

For a Rietveld refinement, a starting structural model close to the final structure is required.

This model must first be determined, so the development of methods to solve crystal

structures using the structural information in a powder pattern is of high interest. Many groups

are actively pursuing this theme, and impressive developments in structure solution from

powder data have been achieved. Two basic approaches have been used to date:

(1) Extraction of individual reflection intensities from the powder pattern followed by an

application of conventional single-crystal methods, such as direct methods or Patterson

methods

(2) Direct-space approaches in which powder patterns calculated for (computer generated)

structural models arc compared with the observed data

1.3.1 Single-crystal methods applied to powders

For the application of traditional single-crystal methods, the extraction of the reflection

intensities to obtain single-crystal-like data is the key. The extraction approaches available

use a Rictveld-Jike whole pattern refinement, in which the reflection intensities rather than

3 —

atomic positions are "refined". Pawley (1981) developed a computer algorithm (ALLHKL) for

a least-squares approach, but instabilities arise for strongly overlapping reflections. A more

robust extraction procedure was proposed by Le Bail (1988), in which the intensities are

adjusted in an iterative manner. This approach has been implemented in various Rietveld

packages such as GSAS (von Dreele, 1990), and EXTRAC (Baerlocher, 1990) in XRS

(Baerlochcr & Hepp, 1982).

However, because of the overlap problem, the extracted data are of much lower quality than

single-crystal data, so the chances of a successful application of Patterson or direct methods

decreases dramatically with the degree of overlap. To overcome this problem, various methods

to improve the quality of the extracted data have been developed. The simplest approach is to

equipartition the reflections that overlap, but this necessarily leads to incorrect intensities for

these reflections. In the FIPS approach developed by Estermann & Grämlich (1993), Patterson

maps, generated from extracted structure factor amplitudes are squared and then back

transformed to obtain a better partitioning of the overlapping reflections. This is done in an

iterative manner until an optimum is found.

Sivia & David (1994) used Bayesian statistics to improve the Pawley extraction approach. By

using prior information in the profile-fitting process, the instabilities in the least-squares

algorithm can be eliminated and overlapping reflections can be deconvoluted. An important

aspect of this algorithm, is that a standard deviation can be calculated for each reflection

intensity. A special approach for the deconvolution of systematically (exactly) overlapping

reflections was described by Rius, Miraviriles, Gies & Amigo (1999). Using a modified direct

method sum function, phase refinement and peak deconvolution is performed simultaneously.

Approaches that allow the estimation of the structure-factor amplitude of a reflection using

other reflections in the data set, such as direct-method approaches (e.g. Jansen, Peschar &

Schenk 1992, Cascarno, Favia & Giacovazzo, 1992, Dorset, 1997) or maximum entropy (e.g.

Gilmore, 1996) can also be used for the estimation of the relative intensities of overlapping

reflections.

Despite these advances in the development of analytical deconvolution techniques, a general

solution to the problem of peak overlap has not been found. An alternative to the

computational methods described above, is to address the problem experimentally.

Synchrotron radiation offers X-ray beams with a higher intensity and smaller divergence than

laboratory sources. This allows data to be collected at higher resolution, so overlapping peaks

can be better deconvoluted. Anisotropic thermal expansion (Shankland, David & Sivia, 1997)

_4

can also be exploited. Several data collections are performed at slightly different temperatures.

If the expansion of the three unit-cell axes occurs anisotropically as the temperature increases

and no phase transition occurs, the relative positions of the reflections will change without

affecting the relative intensities significantly. Thus, by extracting reflection intensities from all

patterns simultaneously, a better data set can be obtained.

Another possibility is to exploit texture. One of the most disturbing effects on powder data is

the corruption of the relative intensities as a result of a preferred orientation of the crystallites

in the powder. However, this effect can also be used to advantage. By measuring several

powder patterns of a carefully textured sample at different orientations, more intensity

information can be extracted (Bunge, Dahms & Brokmeier, 1989; Cerny, 1996; Lasocha &

Schenk, 1997; Wessels, Baerlocher & McCusker. 1999). The extraction and treatment of

overlapping reflections is done in such a way that the result is a pseudo-single-crystal data set,

which can be used in a conventional structure solution procedure (e.g. Patterson or direct

methods). The structural model can then be verified with a Rietveld refinement.

For the straightforward application of Patterson or direct methods, atomic resolution is

required. Unfortunately, the crystallites in a polycrystalline material are often full of defects

and the scattering power of such a powder is poor. The intensities of the reflections fall off

dramatically with decreasing d-values (increasing 28), and only relatively low-resolution data

(i.e. d-values larger than a bond distance) can be obtained. Some special low-resolution

approaches in reciprocal space have been developed. These include the application of the

Sayre equation (Dorset, 1997), the use of the tangent formula derived from Patterson

arguments (Rius, 1993), and the use of the maximum entropy method (e.g. Gilmore, 1996).

However, structure solution at low resolution cannot yet be done routinely.

1.3.2 Structure determination in direct space

An alternative to performing structure solution in reciprocal or vector space, is to use structure

solution strategies in direct space. The crucial point for this method is the active use of

chemical information to compensate for the lower information content of the powder pattern

(overlapping reflections and the lack of data at higher resolution). For example, for organic

molecules, the connectivity is almost always known. Chemically reasonable trial structures

can be generated (by hand or by computer) without reference to the powder data (Harris et al.,

1994). From the trial structure, a powder pattern can be generated, and a figure-of-merit

calculated from the match between the calculated and the observed powder patterns.

Thousands or even millions of such models can be generated and evaluated by computer.

5

Different techniques for the modification of a model are available. For example, the Monte

Carlo method modifies the model randomly (Harris et al., 1994), simulated annealing is a

more sophisticated Monte Carlo approach, where a structural fragment can be translated and

rotated through the asymmetric unit (David, Shankland & Shankland, 1998; Andreev & Bruce,

1998) or a genetic algorithm based on survival-of -the-fittest rules derived from life sciences

(Shankland, David & Csoska 1997; Harris. Johnston, & Kariuki, 1998) can be used. These

rather time consuming approaches have become possible with the increase in computer power

available. Fast computers are able to generate millions of trial structures within a reasonable

time frame and a number of structures have been solved using the methods mentioned above.

Nevertheless, for complex structures (e.g. more then 15 atoms per asymmetric unit or more

then 15 free torsion angles), the number of parameters exceeds the limits given by the

computer power currently available.

A way out this dilemma is to combine methods in reciprocal space with those in direct space.

One example of such a combination is the FOCUS approach (Grosse-Kunstleve, McCusker &

Baerlocher, 1997), which was especially developed for zeolite structures. Here diffraction data

are used in a Fourier recycling loop and the chemical information is used to interpret the

Fourier maps automatically. Other techniques where reciprocal and direct space methods are

combined are Patterson search methods where a Patterson map generated from a model

modified in direct space is compared with that calculated from the measured data (e.g. Stout &

Jensen, 1989). This comparison can also be done in reciprocal space (e.g. Rossmann & Blow,

1962). It is also possible to use a combination of both (Rius & Miravitlles, 1987).

1.4 Zeolites

Zeolites are microporous materials with three-dimensional four-connected framework

structures with the composition T02 (where T is a tetrahedrally coordinated atom such as Si,

Al, Ga, P etc.). The tetrahedrally coordinated T-atoms are connected to four neighboring T-

atoms via oxygen bridges to form a framework with channels and/or cages, which arc filled

with cations, water, and/or organic species. These guest species can be exchanged or removed,

and this is a crucial property for the application of zeolites in industrial processes (e.g. as

adsorbents, exchangers, molecular sieves, catalysts). Usually, zeolites can only be synthesized

in form of a polycrystalline material, so their structures have to be determined without the

benefit of single-crystal diffraction data. Structural information is essential to the

understanding of a zeolite"s technologically important properties, so there is considerable

interest in developing powder methods for zeolite structure analysis. Interest in extending the

6

limits of zeolite structure determination from powder data was the initial motivation for this

project.

1.5 Periodic minimal and nodal surfaces

Various classes of crystal structures can be described using periodic minimal surfaces (PMS)

(Anderson, Hyde, Larsson & Lidin, 1988) or periodic nodal surfaces (PNS) (Schnering &

Nesper, 1987). Both can be used to highlight chemical, physical, or structural properties, and a

PNS can be generated using just a few parameters (a PMS generation requires more

sophisticated mathematics). In the case of zeolites, for example, a PNS can be used to delineate

the form of the framework. That is, all of the T-atoms lie on the surface, so the channel system

and/or cages of the zeolite can be clearly discerned. If such a surface could be generated for an

unknown zeolite structure, solving the structure would be reduced from a three-dimensional to

a two-dimensional problem (i.e. the decoration of the curved surface).

1.6 Overview of the project

The starting point of this study was the investigation of the feasibility of using PNS in the

determination of complex zeolite structures. Unfortunately, preliminary studies revealed no

rules for the generation of the PNS that would be appropriate for subsequent decoration.

However, the investigation did show that a different kind of PNS, which enveloped the zeolite

framework structure could be generated using just the information in the powder diffraction

pattern. Once this had been established, this "structure envelope" was used in combination

with direct-space algorithms to accelerate the structure solution process.

7

2 Structure envelopes

2.1 Periodic Minimal Surfaces (PMS) and crystal structures

Periodic minimal surfaces (PMS) were first derived by Gergonnc, Riemann, and Schwarz

(Schwarz, 1890) in the last century. These surfaces are the simplest of the hyperbolic surfaces

and are defined as having a mean curvature of zero at each point on the surface. That is,

K, +k2 = 0 (2-1)

where iq and K2 are the principal curvatures or maximum curvatures of opposite sign at one

point. They are described in differential geometry as objects in non-Euclidean space. Neovius

(1883) discovered that certain PMS are related to one another by the Bonnet transformation,

which bends the surface without stretching it. A PMS transformed in this way is also a PMS,

and several new PMS have been so derived.

In 1976, Scriven ( 1976) suggested that PMS could serve as models for liquid crystal structures.

This idea was further developed by Larsson, Foutell and Kragh (1980). Later, the relationships

between minimal surfaces and solid crystal structures were recognized. For example, a PMS

can be used to separate the interpenetrating networks in Cu20 or ice VII (Mackay, 1979) or to

indicate the diffusion pathways in crystal structures (Andersson, Hyde, Larsson & Lidin,

1988).

The relationship between PMS and zeolite structures was first recognized by Mackay (1979).

Hyde (1993) later established that the atoms of a framework structure lie on or near a minimal

surface. That is, a zeolite framework can be considered to be a decoration of a two-

dimensional non-Euclidean object. Even the Bonnet Transformation can be applied to

transform a PMS decorated by a particular zeolite structure to a PMS describing another

zeolite structure. For example, the crystal structures of analcime and sodalite are connected by

such a transformation of decorated surfaces.

2.2 Periodic Nodal Surfaces (PNS) and crystal structures

Unfortunately, the determination and mathematical description of the PMS for an arbitrary

space group is a nontrivial exercise. However, the concept of periodic nodal surfaces (PNS),

introduced by von Schnering & Nesper (1987). provides an alternative to the complicated

mathematics. These surfaces can closely resemble PMS and have the advantage of being

8

somewhat easier to calculate. While they no longer have the elegant description within non-

Euclidean space, they do provide a straightforward link between hyperbolic surfaces and

common crystallographic formalisms. They can be generated using a few structure factors with

I •» * ^

magnitudes \F(h) and phases a(h) in a Fourier summation over all equivalents of a few h

(von Schnering & Nesper, 1991).

t}(!) = J2 F(/o|cos[2ic(/! t-)-a(/01 (2-2)

Equ. 2-2 is used to produce a density distribution r)(x). Normally, the structure-factor

I * Iamplitudes \F(h)\ are simply set to an arbitrary value of 1.0. The points at which the density is

zero describe the PNS. Such a surface can (but does not necessarily) closely resemble a PMS.

In this way the well-known Gyroid PMS (Fig. 2-1), for example, can be approximated if the

structure factor amplitudes LF{110}I and IF{TlO}l are set to 1.0 and the phases a{H0}and

a{ lTO) to 7t/2. However, in most cases, a PNS is not an approximation to a PMS. A PNS is not

even necessarily a hyperbolic surface.

Figuie 2-1 : Simulation ol the gyioid pciiodic minimal surface (PMS) usinga periodic nodal suitace (PNS) calculated using equation (2-2) with

\F{ 1 10}!= ÏF{ 170}I = 1 and a(l 10) = a( l70) = nil

Von Schnering and Nesper (1987) provided many beautiful examples of the description of

structures using PNS. For example, PNS can be used to characterize phase transitions (Leone,

9

1998), they can depict a Coulomb zero potential surface in ionic structures, and they can

describe pathways of ions in an ion conducting process (Fig. 2-2).

In all these cases, the PNS are used to describe different features of materials with known

structures, so the structures could be used as a guide in the calculation of the appropriate

surface. Such surfaces can be very useful to emphasize features of a structure which cannot be

discerned immediately or to overlay a chemical structure with a surface that highlights a

certain property (e.g. equipotential surfaces).

2.3 From a PNS to a crystal structure?

If a PNS is closely related to a crystal structure, it would appear that the PNS should contain a

substantial amount of information about the structure. Thus, if the form of the PNS were

known, it could be used to facilitate the determination of an unknown crystal structure. The

question is whether or not it is possible to generate the appropriate PNS when the structure is

not known and only a powder diffraction pattern is available.

In an attempt to answer this question, a series of known zeolite structures (Table 2-1), which

are known to lie on or near PNS (Hyde. 1993), was investigated. It was hoped that some

objective criteria for the selection of reflections to be used in the summation (2-2) to create an

10

Name* Space group {hkl} IE(hkl)l Phase(°)

ANA la3d 112 1.02 0

APD Cmca 131#

021#

1.07

1.52

0

DDR R3m 101#

003

1.01

0.53

0

180

DOH P6/mmm 11I#

002

021

0.77

0.53

1.20

0

180

180

EDI P4m2 010#

001#

1.44

0.72

0

0

FAU FcBm lll# 0.98 0

GOO CZZA i U0#

111#

002

1.48

0.53

0.78

0

270

180

LEV P3m 012#

110#

1.07

0.94

0

180

MF1 Pnma 011#

102#

301#

200

020

1.19

1.40

1.29

1.16

1.29

180

0

180

180

0

PAU Im3m 033

134

1.95

1.13

180

0

RTE Cllm 110#

111#

201

1.13

1.06

1.01

180

0

180

SGT Mfamd 121#

116

L.50

L.43

0

180

SOD Im3m 011

002

0.91

0.49

0

180

VFI Formern 221#

010#

002

1.49

1.49

1.70

180

180

0

Table 2-1 Zeolite structures tested and the refleetion(s) used to generate the structure

envelope. '•Three-letter codes taken Irom the Atlas ot Zeolite Structure Types. (Meier,01son& Baerlocher, 1906), #Rcflection used to define the origin

_ ll_

appropriate PNS could be established. In each case, a surface that fitted the structure could be

generated, but a set of generally applicable rules for the reflection selection could not be

discerned.

It was obvious that the reflections and their symmetry equivalents must describe all

dimensions of reciprocal space, and that their indices should be low, but the exact combination

needed to generate the surface that best described the framework could not be determined

without using the structural model as a guide. In some cases it was even found that

systematically absent reflections yielded the best surface, (e.g.{100} for sodalite, which is

body-centered cubic). Furthermore, some structures were found to fit a surface with a higher

density level better than they did the surface at zero density. Consequently, the concept of

using the data from the powder diffraction pattern to create a surface that could be decorated

with a 3-connected net was abandoned.

2.4 Generation of a structure envelope

Fortunately, during the testing phase described above, an alternative approach became

apparent. By assigning the correct phases to the strongest low-order reflections, a well-defined

PNS could be generated. For a better estimation of whether a reflection is really strong, the

structure factors F{h) were transformed to normalized structure factors E(h) and these were

used in the summation:

p(x) = ^JE(h)\cos\2n(ti x)-a(ti)\ (2-3)>

h

Within a small tolerance, the PNS connecting points where the density p(x) is equal to zero,

calculated from the roots of this equation, and using only a few strong reflections, was found to

separate the framework atoms of the zeolite test structures from the void space. To verify the

general validity of this observation, a number of zeolites with different symmetries were

examined (Table 2-1). In all cases, the framework was found to lie on just one side of the PNS.

A few examples are shown in Fig. 2-3.

These surfaces, which enveloped the zeolite frameworks, did not resemble PMS, but they

could be generated in a rational manner from the powder data, and they did have the

potentially exploitable property of partitioning the unit cell into regions where atoms were

likely to be found and those where they were not. For CsCl, for example, the PNS generated by

a summation over {100} (Space group Pnßm) using \ElO0\ = 1 and a100 = 0 in Eqn. 2-3

resembles the well-known PMS P-Surface (Schwarz, 1890). This surface (Fig. 2-4a) separates

12

the anions fiom the cations (1 e the zero potential smlaee) That is, the smface has a strong

relationship to the ciystal stiuctuie Howevei, the 100 icllection m the X-tay diffiaction

pattern is extiemely weak If, on the othei hand, 1 10, which is the stiongest of the low-oidei

leflections, is used, a diffeient PNS is pioduced (Fig 2-4b) In this case, the cations and the

anions aie located on the same side of the suiface

The summation (2-3) can be viewed as a seveiely tiuncated Founei seues, and the lesultmg

density distiibution as a veiy-low-iesolution election-density map Howevei, a true E-map

would be calculated using all leflections up to a ceitam <r/-spacing including the E0a0 term, and

not just a handiul of stiong leflections Ihe PNS simply sepaiates the legions of high election

density from those ot low election density In this sense, it snmlai to the molecular envelope

13

used m piotem ciystallogiaphy (Biicognc, 1976) to define the appioximate boundaiy between

the piotem molecule and the solvent in a low-iesolution election-density map (see, fot

example Coulombe & Cyglei, 1997 oi Subbiah, 1993 and tefeiences fheiem) Of couise, the

numbei of leflections needed to genet ate the PNS descnbed heie is significantly lowei than

that used m the piotem case and the suiface does not necessanly have a closed form However,

the two aie closely îelated, so the teim stiuctuie envelope" (Biennei, McCuskei &

Baeilochci, 1997) has been adopted foi this paititionmg PNS

The stiuctuie envelope îeduces the space m the as\mmetnc unit in which the atoms of a ciystal

structme aie likely to be located by a factoi of appioxunately two and its shape imposes seveie

geometnc constiamts on the atomic auangements possible It was hoped that such a stiuctuie

envelope could be used to lacihtate the deteiruination of unknown ciystal structmes for which

no single ciystals aie available

2.5 Reflection selection for the calculation of a PNS

To put the idea into piactice, the limitation of powdei diffiaction data (oveilappmg reflections)

had to be taken into consideration In 12 of the 14 zeolite sttuctuies tested (Tab 2-1), the

leflections used to geneiate the PNS piesent no difficulties They aie the strongest low-index

leflections and they aie at least 0 5 FWFIM (full width at half-maximum) fiom neighboring

leflections, so then intensities could be extiacted leliably fiom the powdei pattern An

advantage of using low-mdcx leflections is the fact that they tend to he m the low-angle (high

14

rf-spacing) region of the powder diffraction pattern, which is less prone to reflection overlap.

In the cases of MFI and PAU, alternatives would have to be considered. Either the relative

intensities of the overlapping reflections would have to be estimated using other methods (e.g.

David, 1987, 1990; Jansen, Peschar & Schenk, 1992; Cascarano, Favia & Ciacovazzo, 1992;

Estermann & Grämlich, 1993; Hedel, Bunge & Reck, 1994), or a different set of reflections

would have to be used. Since there are usually several reflections with \E\ > 1 in the high d-

spacing region of the diffraction pattern, this is not a problem. Fortunately, the exact set of

reflections used to generate the surface proved to be not too critical.

A few rales-of-thumb for the selection of reflections emerged from the preliminary tests:

(1) The lEl-valucs of the reflections selected should be strong (usually \E\ > 1).

(2) The d-values of the chosen reflections should be in the same range as the expected electron

density fluctuations (e.g. the pore size of a zeolite, or the "thickness" of an organic

molecule).

(3) All directions in reciprocal space must be repiesented.

The selection of reflections is demonstrated for the RUB-3 structure (RTE-topology). In Table

2-2, the first ten symmetry-independent reflections are listed. For the envelope calculation,

three strong reflections are needed. Withm these ten reflections, four reflections have an \E\

value larger then 1.0 and are suitable for the envelope generation. Correct phases were

assigned to these reflections and and two possible combinations of three reflections selected

arbitrarily. They were used to calculate two different surfaces, one from the set {110}; {1 lT},

-15

{021} and the other from the set {110}, {111}, {201}. The two resulting surfaces are shown in

Fig. 2-5. Although the surfaces look quite different from one another, in both cases the T-

atoms of the framework are located on only one side of the surface.

hkl d -value IE(h)l

110 9.70 1.48

001 7.25 0.62

200 6.88 0.58

020 6.83 0.79

llT 6.28 1.42

201 5.63 0.04

HI 5.42 0.01

02Ï 4.97 1.27

220 4.85 0.49

201 4.53 1.34

310 4.35 0.98

Table 2-2 Listing ol the first ten symmetry-independent reflections of the RUB-3 pattern

2.6 Application to non-zeolite structures

To test whether or not the concept of structure envelopes could be applied to other classes of

materials, a few organic and inorganic structures were also examined. As for the zeolites, the

atoms were found to be situated on only one side of the curved surface. The envelope

generated using the strongest low-index reflections separates regions of high electron density

from those of low electron density, whatever the chemical composition of the material.

The higher the fluctuations in electron density within the asymmetric unit, the easier it is to

generate the structure envelope. That is, fewer reflections are needed. From this point of view,

zeolite framework structures with their large voids are ideal, but the difference between

bonding and non-bonding contacts in organic compounds is also sufficient for the generation

of a useful structure envelope. To illustrate this fact, the envelope calculated for the organic

molecule Cimetidine (Hädicke, 1978) using only four reflections is shown in Fig. 2-6. All non-

hydrogen atoms of the molecule lie on the positive side of the surface.

For non-centrosymmetric cases, it can be useful to use centrosynimetric projections to limit the

values of the phases to 0° and 180°. In this way Periodic Nodal Lines (PNL) which are the

-16

two-dimensional equivalents of PNS, can be generated within the centrosymmetric planes. The

PNL for a cyclic tetramer of a beta peptide (Seebach et al., 1997) generated from the 220 and

310 reflections in the non-centrosymmetric space group 14 is shown in Fig. 2-7. The presence

of the ring structure and the approximate location of the methyl groups are easily discerned

from the shape of the PNL.

Figure 2-6 The crystal structure of the organic molecule

Cimetidine with the structure envelope calculated using four

strong low-order reflections

17

3 Solving the phase problem for structure envelope generation

3.1 Introduction

If a few strong non-overlapping reflections are present within the low-angle region of the

powder diffraction pattern, the selection of reflections suited for the structure envelope

generation can be done in a straightforward manner. The major problem is the estimation of

the phases for these reflections. Tn some cases, the phase problem can be overcome by using

only origin-defining reflections. Depending on the space group, up to three reflections can be

selected and their phases assigned arbitrarily. Sometimes these reflections alone suffice to

produce a useful structure envelope (see Tab. 2-1). The number of reflections needed for the

envelope calculation is dependent not only on the space group, but also on structural features.

Given the same symmetry, the higher the electron density fluctuations within the unit cell, the

fewer reflections needed to calculate the partitioning surface. Thus, zeolites, with their large

pores, arc particularly well-suited for this approach. For example, the framework structure of

AIPO4-D (Fig. 2-3d) can be enveloped by a surface generated using only the two origin

defining reflections. In this case, just two strong reflections in the low-angle area of the

powder pattern provide surprisingly detailed structural information.

Nevertheless, further phases must usually be determined. Either fewer reflections are needed

to define the origin than are required for the calculation of the envelope, and/or the strongest

reflections cannot be used as origin defining reflections (e.g. structure semi-invariants). This

phase determination is not trivial. For a routine phase determination by direct methods, atomic

resolution is needed. However, the degree of reflection overlap in a powder pattern increases

with diffraction angle, so only in the low-angle (low-resolution) region can the intensities be

extracted reliably. Of course, if this were not the case, structure determination from powder

data would not be a problem. Consequently, alternative techniques that are less dependent on

resolution are required. For example, two approaches have been described:

(1) Maximum Entropy (e.g. Gilmore, 1996). The structure factors of a basis set of reflections

{H} are used as constraints to generate an entrop-maximized map. Fourier transformation

of the map then allows, the amplitudes and phases of other reflections outside the set {H}

to be estimated (extrapolation). A certain number of phases in the basis set are permuted to

generate several possible sets, which are the nodes in the phasing tree. As a figure-of- merit

for the correctness of a phase set, the "likelihood gain", which is indicative of the

agreement between the observed and calculated structure amplitudes of reflections not

included in {H}, and the student-t-test are used.

18-

(2) Modified tangent formula (Rius, 1993; Rius, Sane, Miravitlles, Amigo & Reventes, 1995a;

Rius, Sane, Miravitlles, Gies, Marier, & Oberhagemann, 1995b; Rius, Miravitlles &

Allmann, 1996; Rius, Miravitlles, Gies & Amigo 1999). Phases of a randomly phased set

of structure factors can be refined using a modified tangent formula derived from Patterson

function arguments. This formalism is similar to that used in a Patterson search for finding

the orientation of a given strucural fragment. Instead of rotation angles, the phases of

normalized structure factors are refined. This shows a stable behavior even under

conditions of low resolution (about 2.2 A). The best refined phase sets are selected using

conventional combined figures-of-merit (CFOM).

Both are multisolution approaches. With the amount of computer power now readily available,

the generation and examination of thousands of phase sets is possible within a reasonable time

frame. Besides the space group ambiguity, the major problem is that the lower the resolution

and the number of reflections, the more difficult it is to find criteria to identify the best phase

set. Common figures-of-merit used in structure-determination algorithms developed for higher

resolution data are often inappropriate for very low-resolution data. Consequently, other

criteria must be found.

Currently, the likelihood gain mentioned above seems to be one of the most powerful figures-

of-merit for low-resolution data. It has yielded impressive results in the evaluation of electron

density maps in protein crystallography, where the resolution is much lower than the distance

between two carbon atoms. Another criterion can be the "peakiness function" (Stanley, 1986),

where the integral of the cubed electron density should reach a maximum for the map

generated with the best phase set. An alternative approach to evaluating electron density maps

is to judge them by eye. This can be quite a powerful method if the appearance of structure

fragments (e.g. small organic molecules) is known.

Despite advances in finding new criteria, the techniques currently available do not provide a

general solution to the problem of low-resolution data.

For the structure envelope generation, phasing approaches that function with a relatively small

number of structure factor amplitudes at very low resolution are needed. The combination of

direct methods (Sayre equation) with the pseudo-atom technique has been succesfully applied

to very-low-resolution electron-diffraction data from proteins (Dorset, 1997). An adaptation of

this method to powder diffraction data in combination with a multisolution approach (phase

permutation) appeared to be a promising avenue to achieve a reliable phase determination of a

19

few structure factors with strong amplitudes. This would allow the generation of a correct

structure envelope that could facilitate the determination of of an unknown crystal structure.

20

3.2 The Sayre equation

In 1952, a paper was published by Sayre with the title: "The Squaring Method: a New Method

for Phase Determination" (Sayre, 1952). For a structure of identical and non-overlapping

atoms, the electron density p(x) and its square p"(x) arc quite similar. The only difference is

that the peaks of the latter function are sharper then those of the former. The structure factors

SCI*

of the "squared" structure F (h) can be calculated using the equation

Fiq(h) = Q(h)F(h)(3""1}

where Q(h) is a function describing the change in the atomic shape from the true to the

squared atom. Using this concept, Sayre derived the fundamental relationship

F(|) = -^..pr^F^-fc) (3-2)

Q(h)V I

where Fis the volume of the unit cell. By applying this equation, it is possible to calculate each

structure factor F(h) from all other structure factors having a triplet relationship with F(h).

Furthermore, the phase of a structure factor can be obtained from other structure factors, and

this is the basis for phase extension from a starting set of phases.

An advanced form of the Sayre equation was developed for crystal structures consisting of

two kinds atoms by adding a cubic term (Woolfson, 1954). Woolfson demonstrated its validity

for a one-dimensional test structure, but so far no useful application to a three-dimensional

structure has been reported.

In most cases, the Sayre equation is transformed into the Sayre~Hugb.es equation (Sayre 1980),

where the structure factors are replaced by normalized structure factors. Strictly speaking, the

Sayre equation is valid only for structures of identical and resolved atoms, but it holds

reasonably well over a large range of conditions (Glover et al. 1983). Sayre (1972) showed that

the atomic shape function 6 can be modified to compensate for data incompleteness. This

equation, in combination with other approaches, has proved to be a powerful tool in the

structure determination of polymer structures from powder data (Dorset, 1996), or even

protein structures from electron-diffraction data (Sayre, 1972; Dorset, 1997).

33 The Pseudo-atom method

To use the Sayre equation as a phase extension tool for data with much lower then atomic

resolution, the data have to be transformed m such a way that atomic resolution is simulated.

21

The assembly of globular subunits in a protein, for example, can be treated as pseudo-atoms

for the normalization of the observed electron diffraction intensities (Harker, 1953). It was

demonstrated by Dorset (1997), with data from bacteriorhodosin, that a multisolution approach

via the Sayre-Hughes equation could be used to estimate phases to 6 A resolution. A two-

dimensional pseudo-atom with the carbon scattering curve was used to describe the projection

of a-helices along their axes in a two-dimensional projection of the unit cell. To compensate

for the difference between the diameter of a carbon atom and that of an a-helix. the unit cell

axes were scaled down by a factor of 10.

Figure 3-1 Simulated powder patterns (CuKa) ol the zeolite ITQ-t structure (thin black line)and the corresponding pseudo atom structure (thick gray line). Each tetrahedron of ITQ-1 is

replaced by a pseudo-atom with 30 electrons. In the low angle area (up to 4 Â, 22 °26), the

patterns are roughly the same. At higher 28-angles. the intensities of the pseudo-atom structure

pattern decrease faster, because the scattcnng curve of the pseudo-atoms falls off faster than

does that of an SiO 4 tetrahedron.

The pseudo-atom strategy can also be used to facilitate structure determination from powder

data. The Si04 tetrahedra in a zeolite, for example, can also be treated as pseudo-atoms. The

diameter of the pseudo-atom corresponds to the distance between two Si atoms. If the powder

pattern of the original structure is compared with that of the pseudo-atom structure (Fig. 3-1),

it can be seen that the intensity relationships are roughly the same in the low angle region. At

higher angles (<7-values smaller than the diameter of the pseudo-atoms), the intensities in the

powder pattern of the pseudo-atom structure decrease faster than do those of the real structure.

By assuming that the structure consists of pseudo-atoms and omitting data with d-values lower

TO

than the pseudo-atom diameter, the requirement of atomic resolution is met and the pseudo

atom structure can be solved by direct methods. The pseudo-atom approach is well suited for

the estimation of the phases of low-resolution data for structures consisting of one kind of

building unit (e.g. octahedra, tetrahedra). Fragments in organic molecules whose non-

hydrogen atoms have a similar scattering power (e.g. C. N. O), can also be replaced by pseudo-

atoms with an appropriate diameter.

3.4 Phase permutations

For phase extension, a basis set of reflections whose phases are known (from the origin

defining reflections and enantiomorph) is defined. Additional phases can also be obtained by

other techniques. For example, phase information of reflections from a particular projection of

a crystal structure might be obtained from a Fourier transformation of a HRTEM (high-

resolution transmission electron microscopy) image. Any additional phases in the starting

phase set have to be permuted. The disadvantage of this method is the dramatic increase in the

number of possible phase sets with the number of permuted reflections. In the centrosymmetric

case, sign permutations are sufficient. So, if N phases are permuted systematically, 2 phase

sets are generated (e.g. 16384 permutations for 14 reflections). From each starting set, a phase

extension and a calculation of a figure-of-merit have to be performed. This is very time

consuming, but for fast computers still feasible within a reasonable time. However, for a non-

centrosymmetric structure, permutation of fourteen acentric phases using quadrant

permutation (tc/4, 3n/4, 5n/4, 7rc/4), the computer must calculate and evaluate billions of phase

extensions, and that exceeds the computer capacity of most laboratories. An alternative to

systematic permutation, is the use of a random number generator to produce the starting phase

set. Given a sufficiently large number of random sets, phase space can be sampled rather

exhaustively. Currently this is the favored technique for large sets of reflections.

If the phases are permuted systematically, all points in phase space, which has the dimensions

of the number of phases permuted, are visited. If only a portion of the points are to be visited,

it is of interest to sample the space in steps large enough to be efficient, but small enough to

see all relevant features. If, for example, the phase space is filled with closest-packed spheres

of a certain size, each sphere covers a piece of phase space, and a visit to each sphere is the

most efficient way of sampling the phase space.

At least two techniques have been developed to perform such efficient sampling.

(1) Magic integers. This method was first used by White and Woolfson (1975) and later

23

refined by Main (1977). By using the Fibonacci sequence as "magic integers" 16385

systematically permuted phase sets, each consisting of 14 signs or of quadrant phases from

7 non-centrosymmetric reflections, can be reduced to 128. The gain in efficiency is

obviously considerable. The magic integer representation of the phases is implemented in

several widely used computer program e.g. in the MULTAN (Debaerdemaker, Tate, &

Woolfson, 1985), XTAL (Hall, King & Stuart, (1995), and SHELX (Sheldrick, 1993).

(2) Error-correcting codes (ecc's). Where data in digital form are transmitted, errors occur and

to keep these to a minimum, ecc's were developed (Hamming, 1947; Shannon, 1948;

Golay, 1949). These codes are widely used and are implemented in modern digital

telephones, CD-devices, and receivers for satellite signals.

3.4.1 Sampling the phase space with error correcting codes (ecc's)

An error correcting binary code consists of a subset of 2 combinations of n binary digits (0 or

1) among the 2n possible n-bit words (codewords). If an information source only uses the

codewords of a given ecc, a receiver can cheek whether or not an n-bit word received is a

legitimate one. If the received codeword is corrupted, the codeword in the code, which differs

in the fewest places (closest codeword) from the received word is used to correct the error.

Since transmission errors, in which the fewest bits have been corrupted, are the most likely to

have occurred, this correction is usually quite successful.

A very important group of the ecc's are linear. They consist of the linear span of the 2 linear

combinations of k //-dimensional binary vectors (generators) formed with coefficients 0 or 1

under modulo 2 arithmetic (Fig. 3-2). The code is denoted by [n, k, d], where d is the

"minimum distance" (Hamming distance) of the code (i.e. the lowest number of differences

between any two codewords of the code).

The Hamming [7, 4, 3| code can be produced using a generator matrix. Each binary value in

the ecc is obtained from the product under modulo 2 of the generator matrix and the matrix

from all combinations of four binary digits (Fig. 3-2).

An error correcting code can also be used as the basis for an efficient phase-permutation

procedure (Bricogne, 1997). Woolfson (1954) demonstrated such a method (called

permutation synthesis), in which sixteen combinations of signs of structure factors were

generated for 7 centrosymmetric reflections. This was done in such a way that none of the 128

possible combinations of the signs differed from the sixteen combinations in more than one

place. In the sixteen resulting two-dimensional Fourier maps, expected features of a structure

24

00 0 0 0 0 0 0 0 0 0

00 0 1 1110 0 0 1

0 0 10 0 1 10 0 10

0 10 0 1 0 1 0 1 1 0

10 0 0 110 10 0 0

0 0 11 10 0 0 0 1 1

0 10 1 1 10 10 0 0

mod(2)0 10 0 10 1

10 0 1 I 0 1 0 1 0 0 0 0 1 10 0 0

0 110 0 110 0 10 110 0 110

10 10 J l i o 0 0 1_ 10 1 10 10

110 0 generator matrix 0 11110 0

1110 0 0 0 1110

110 1 1 0 0 I 1 0 1

10 11 0 I 0 1 0 1 1

0 111 0 0 10 111

.1 1 I i 1 1 I 1 I 1 1

Matrix from all combinations

consisiting of four binary digitsHamming[7, 4, 3] code

Figure 3-2 Generation of the Hammmg|7,4,3] code using the corresponding generator matrix.

1 10 0 0 0 0 0 0 0 0 0 0 1 10 1 1 10 0 0 10

1 0 1 0 0 0 0 0 0 0 0 0 0 0 i 1 0 1 1 1 0 0 0 1

10010000000 0 0 101 10 1110 00

I 0 0 0 I 0 0 0 0 0 0 0 0 0 1 0 1 1 0 1 1 1 0 0

1 0 0 0 0 i 0 0 0 0 0 0 0 0 0 1 0 1 1 0 1 1 1 0

10 0 0 0 0 10 0 0 0 0 0 0 0 0 10 110 111

1 0 0 0 0 0 0 1 0 0 0 0 0 I 0 0 0 1 0 1 1 0 1 1

1 0 0 0 0 0 0 0 1 0 0 0 0 1 1 0 0 0 1 0 I 1 0 1

I 0 0 0 0 0 0 0 0 1 0 0 0 I 1 I 0 0 0 I 0 1 10

1 0 0 0 0 0 0 0 0 0 1 0 0 0 1 I I 0 0 0 1 0 I 1

10 0 0 0 0 0 0 0 0 0 10 10 1 1 10 0 0 10 1

0 000000000001 I llllll I lll_

higurc 3-3 Generator matrix ot the Go lay [24, 12, 8] code

OS

(fluorine) were sought, and in one map were found. In the generation of these sixteen sign

combinations, the 7-dimensional phase space was sampled rather completely. Woolfson did

not note the link to the error correcting codes, but his code for the generation of the 16 sign

combinations is the same as the Hamming [7, 4, 3] ecc.

The Golay [24, 12, 8] code (Golay, 1949), one of the most powerful codes, provides a way of

varying the signs of 24 reflections, such that one of the 4096 combinations (of 224 = 16777216

possible sets) will have a maximum of four incorrect signs. The generator matrix for the Golay

[24, 12, 8] code is shown in Fig. 3-3.

The Golay code can also be used to permute the quadrant phases of 12 acentric reflections. In

this case, the phase of each reflection is assigned using two digits rather than one within each

codeword (e.g. 0,0:7t/4; 1.0: 3ir/4: 1,1: 5xc/4; 0,1:77t/4).

A good introduction to this fascinating subject, which is also understandable to non-

mathematicians can be found in Bricogne (1997).

-26

4 Phase estimation using the Sayre equation

4.1 Introduction

The combination of the Sayre equation, the pseudo-atom method and phase permutation using

ecc's has been implemented in the program SayPerm. The program is written in ANSI-C and

all space group information needed is provided by Sglnfo (Grosse-Kunstleve, 1996). The

program uses a multisolution approach, where a large number of phase sets are generated by

phase permutation and, extended and ranked according to various figures-of-merit by

assuming the validity of the Sayre equation. The primary figure-og-merit is the validity of the

Sayre equation itself. It was hoped that the phases of the few reflections required to generate a

structure envelope could be established in this way.

4.2 Data collection and reduction

Before SayPerm can be applied, powder diffraction data must first be collected. Depending on

the absorption coefficient of the material and its tendency towards preferred orientation, either

Debye-Scherrer transmission (capillary) or Bragg-Brentano reflection (flat-plate) geometry

can be used. The positions of the peaks in the diffraction pattern can usually be found with a

second-dcrivative-based automatic peak-search program (e.g. Alexander, 1973). The cell

parameters can then be derived from these positions. To do this, there are a number of different

indexing programs (Visser, 1969; Werner, Erikson & Westdahl, 1985; Taupin, 1989; Boutif &

Louer, 1991), which apply different strategies to the problem. The space group is then

determined from systematically absent reflections. However, because of the overlap of

reflections, indexing and space group determination are often ambiguous and several

possibilities must be taken into consideration. At this stage, the symmetry information and

unit-cell dimensions are used to extract the intensities for all reflections in the pattern. For the

determination of the scale factor, a Wilson plot is used. With intensities extracted from powder

data, the Wilson plot often deviates significantly from the ideal straight line and sometimes

even negative overall displacement (thermal) factors are obtained. In such cases, fixing the

displacement factor at a reasonable value and estimating the scale factor by using this

assumption is recommended (Grosse-Kunstleve. 1996). Because the information content of a

powder pattern is much lower than that of a single-crystal data set, ambiguous results can be

produced. This restricted reliability should be kept in mind in the course of subsequent

evaluations.

For all examples presented in this study, the intensity extraction was performed using the

27

The SayPerm Procedure

'obs

structure factor amplitudes extracted

from the powder pattern

IZ,F -

N FP°mî -ü(Sm20AlVr

obs

MN

B: overall displacement factor

8: Bragg angleA.: wavelength

ff. scattering factor of atom /'

Z. : atomic number of atom j

1|r

' point

Structure factor amplitudes of a

structure consisting of point atoms

'pseudo

\

FT

A

I \ Ppseudo

M/ i

j

^ r^ "

^pseudoStructure factor amplitudes for a

structure consisting of atoms with the

pseudo-atom density curve

phase assignementto a prescribed number of £V>scll(]0 t

I phase permutation

using ecc

phase extension

to next set of reflections using Sayre

equationt

1 output

consistency test

ranking accordingR-value calculation

J

Figme 4-1

28 —

EXTRAC module (Baerlocher, 1990) in the XRS-82 Rietveld refinement package (Baerlocher

& Hepp, 1982). To scale the structure factor amplitudes, the XTAL (Hall, King & Stuart,

1995) module GENEV (Hall & Subramanian, 1995) was used.

4.3 The SayPerm procedure

4.3.1 Data preparation

The whole SayPerm procedure is shown in the flow diagram in Fig. 4-1.The cell parameters,

the space group, the scaled structure-factor amplitudes, and the overall displacement factor are

used as input to the SayPerm program. To satisfy the condition that the data have atomic

resolution and the structure be an equal-atom one, the structure-factor amplitudes must be

modified. First, each is divided by the displacement-factor function e~ and the

variation of the scattering factors with sin 9/A to simulate a point-atom structure. Combining

Eqn. 3-1 and 3-2 gives

F^(h) = ~^F(k)F(h~k)(4-1)

To make this equation valid for lower resolution data, pseudo-atoms having an electron-den¬

sity distribution

,>\ i-bnr) (4-2)

arc used. The parameter b describes the half width of the function (i.e. the size of the pseudo-

atom). From this function, the scattering curve for a spherical pseudo-atom is calculated by a

Fourier transformation of the density curve p PH,ut}0

°° *>

°°

,-,f > ^ j ——TC/ï2 j

, f (2mit >) ,> . r sin271/? • r 2,> 1Kb)

r„,.„j. = p c d r = 4K \ p,„ .,. . -—rdr= ——e

(4-3)Jpseudo }ypseudo J rpseudo

7t > ^ 1^

J

The point-atom data are multiplied by this curve to give F(h)->

FS(t(h) on the same scale, the squared pseudo-atom

on a relative scale. To obtain

P(r-r7.

=e(-2hnr) (4-4)1 v

'pseudo

is Fourier transformed to obtain its scattering curve

r w C~mh i) > f sq su\2nh-r 2 ,> 1p e d r = 471 p

,v—— r d r= -——— e

J 'pwndo J fpseudo , J > F>u fü2nh r *j2b>Jh

1 ,i

and is multphed by the point-atom data.

29

The diameter of the pseudo-atom can be varied in the program and should correspond to the

diameter of the building units of the crystal structure being described by the pseudo-atom.

Furthermore, the pseudo-atom diameter should not be smaller than the minimum d-value of the

data.

4.3.2 Phase extension

The prepared set of reflections is divided into a basis reflection set {Hi} and up to three

additional sets. The basis set consists of structure factors for which the phases are either known

(from origin defining reflection rules, enantiomorph's) or assumed to be known. The latter are

generated by phase permutation. The number of phases permuted depends upon the computer

power and/or on the permutation method. For example, if the phases are permuted using an

ecc, only a prescribed number of phases can be permuted. From a basis phase set {H]}, the

phases are now extended to a second phase set {Kt} using the equation

sei > 1_,

» » > (4-6)

An lvalue describing the discrepancy between the calculated Fs4Sayre

(structure factor

amplitudes of the squared pseudo-atom structure calculated using the Sayre equation) and the

"observed" robs

(structure factor amplitudes of the squared pseudo-atom structure derived

from measured data) is then calculated.

-i^

sq

T,

sq

(4-7)R = —

F

sq

- FS(]

obs Sayre

1fU'

"1(1,

ob\

The |F' (000)1 's are excluded from this calculation. If this i?-value is lower than the value

defined in the input file, the sets {K]} and {Hj} are merged to form a new phase set {H2}. The

phases from {H2} are then again extended to the next phase set {K2}. This procedure is

repeated until each subset of structure factors is phased.

4.3.3 Phase set evaluation

Once all structure factors have been assigned phases, a consistency test follows. Here again,

each phase (excluding the o.d.r.'s and enantiomorph) and structure factor amplitude is

calculated from all other structure factors and the phases are modified if nessessary. This is

repeated for several cycles until the Z?-valuc has converged. This /lvalue is the main criterion

used to rank the resulting phase sets. The best reflection sets are listed in a file, and Fourier

maps are then calculated.

30

The selection of the best phase set follows. If there are several "best" phase sets with similarly

low R-values, each must be evaluated. A good indicator in the centrosymmetric case is the

ratio of negative to positive signs. If no atomic positions are expected at the origin, this ratio

should be about 1:1, otherwise large electron densities will appear at the origin of the Fourier

map. Phase sets can also be discarded if the corresponding Fourier maps contradict known

chemical information (e.g. the experimentally determined size and dimensionality of a

zeolite's channel system, or the approximate shape of an organic molecule). A few test

applications of the SayPerm algorithm are described in the following sections.

4.4 Test structure RUB-3 (RTE topolpgy)

4.4.1 Data measurement and preparation

Powder data from a calcined sample of RUB-3 were kindly provided by Marier, Grünewald-

Lüke, & Gies (1995). They solved the stracture originally by model building and combined it

with a Rietveld refinement.

chemical composition S124^4 8

lattice parameters a = 14.039Â: b == 13.602A;c == 7.428Ä; ß = 102.22°

space group C 21m

asymmetric unit 3 Si, 6 0

Tabic 4-1 Data for the ciystal stiucture of RUB-3

The lattice parameters and the space group (Cllm) were determined from the powder pattern,

and then the structure factor amplitudes were extracted and the scale factor was determined to

be k = 0.47. For this calculation, the overall displacement factor was fixed at B = 2.5Â2.

4.4.2 SayPerm input fde

The input file used for the SayPerm run is shown in Fig. 4-2. The lines beginning with "#" are

comments and are not read. In the first two lines, the space group (SpaceGroup) and the unit

cell parameters (uniteell) are defined.

In the next line, the number of grid points along the a, b, and c directions for the generation of

a Fourier map are defined (GndDimensions). The distance between the grid points should be

about 0.5À. The next three values define the number of the unit cells in each dimension in

which the map should be generated.

The MaxRvaiue line gives the maximum 7?-value (Eqn. 4-7) allowed after a phase extension

from one subset or a group of subsets to another subset. If the R-value exceeds this value, the

31

# RUB-3 measured data

UnitCell 14.098 13.670 7 .431 90.000 102.421 90.000

SpaceGroup C2/m

#

GridDimensi on 24 24 16 1 1 1

#

MaxRvalue 30.0 30.0 50.0

Fscale 0.47

TempFac 2.S

PseudoAtom 3.1

PermCode hamming

#

# h k 1 F(hkl)| phaso

0 0 0 0 720.00 0.0

0 1 1 0 421.103 0.0

1 0 0 1 217.007 0.0

1 2 0 0 272.476 0.0

1 0 2 0 307.76b 0.0

0 i 1 -1 390.415 0.0

2 2 0 -1 80.467 0.0

2 1 1 1 15.969 0.0

1 0 2 1 323.793 0.0

1 2 2 0 138.311 0.0

1 2 0 1 503.206 0.0

1 3 1 0 177.443 0.0

1 2 2 -1 371.249 0.0

1 1 3 0 322.760 0.0

1 3 1 -1 372.827 0.0

2 t 3 -f L69.782 0.0

2 2 0 J 22.89 1 0.0

1 0 0 2 372.082 0.0

2 1 3 1 7.07L 0.0

2 1 1 -2 133.503 0.0

1 2 0 -2 244. L68 0.0

2 3 1 L 254.073 0.0

1 4 0 0 387.085 0.0

1 0 4 0 408.797 0.0

2 4 0 -1 227.077 0.0

2 1 1 2 425.507 0.0

2 3 3 0 208 .931 0.0

2 0 2 2 101.400 0.0

2 2 -1 -2 282.814 0.0

2 3 3 L 164,560 0.0

2 3 L - 2 16.248 0.0

2 0 4 Ï 2 62 .324 0.0

2 4 2 0 2 6 6.816 0.0

2 2 4 0 251.641 0.0

2 4 2 -1 258.818 0. 0

2 2 0 2 130 .077 0.0

2 2 4 -1 342 .254 0.0

2 4 0 1 US. 684 0.0

2 1 3 -2 16.4 52 0.0

2 4 0 z. 275.2 93 0.0

2 3 3 1 212 .73 5 0. )

2 2 4 J 112 .067 0.0

Figure 4-2 SayPerm input file of RUB-3 test calculation, data up to a resolution of 2.73Â were used

— 32

calculation using the current permuted phase set is interrupted and discarded and the next

phase set is generated. The first value is the highest Ä-value allowed for phase extension from

the first two sets (labelled 0 and 1 in the reflection list) to the subset with the flag 2. The

second it-value is the highest value allowed after the extension from subsets 0, 1 and 2 to

subset 3, and the third for phase extension from the first four to any remaining subsets. An

appropriate choice of these values can drastically reduce the run time. The scale factor

(Fscale) is defined in the next line. This factor is usually taken from the Wilson plot and used

to scale the unsealed structure amplitudes. The value for displacement factor B (TempFac)

should be the same as that assumed for the scale-factor estimation. PseudoAtom defines the

diameter (in Ä) of the pseudo-atom and is defined by Jl/nb where b is the ^-parameter from

Eqn. 4-2, and Permcode the code to be used for the permutation. The options for the latter are

hamming, golay 01" permsynth.

A doubled Hamming |7, 4, 3J code is used to generate 256 phase permutations of fourteen

(instead of seven) centrosymmetric structure factors. For the doubled Hamming [7, 4, 3] code,

all combinations of the codewords of two single Hamming [7, 4, 3] are generated (16 = 256).

A maximum of two signs are incorrect if all 214 = 16348 sign combinations are compared with

the 256 codewords of the doubled code. This doubled Hamming [7, 4, 3] code is initialized

with the word " hammi ng ".

In the next lines, the reflection data are given. The first column of the reflection line contains

the flag defining the subset. The phases of reflections having the flag "0" are fixed. This is

used for the phases of F(000), the origin defining reflections (o.d.r.) and any other structure

factors with known phases. The phases of the reflections with the flag "1" are to be permuted.

The first phase extension is done to the reflections labeled "2". the second extension to those

labelled "3" and the third to those with a flag higher than "3". The other values in the reflection

line are the indices hkf the unsealed structure factor amplitudes, and the phase. Unknown

phases can be set at an arbitrary value. They will be replaced in the output file by the phases

obtained from the phase extension.

4.4.3 SayPerm run

The first 41 reflections from the reflection list for calcined RUB-3 (Figure 4-2) were used. This

corresponds to a resolution of 2.73 Ä. The 110 and llT reflections were selected as origin

defining reflections, in view of the low number of reflections, the permutations were

determined by the doubled Hammingl7. 4, 31 code and only one phase extension from a

reflection subset (0 and 1) was carried out to another subset (2). Because the allowed .R-values

-33

(MaxRvalue) were set relatively high, a phase extension was calculated for each permutation.

Fig. 4-3 shows the i?-value obtained for each phase permutation and extension. The lowest R-

value was found for the 112th permutation.

0 80

0.60 -

g 0 40i

0 20

•>.

'»Vf %*

V'-/* *.

. . .a. •.

• *

.%•»

«V /*

a #J

0 00

*•. V*. :•'

/50 100 150 200

permutation number

250

Figuic 4-3 A'-\alues ol all 256 peitnutations and extensions toi RUB-3 The bestÄ-

valuc A' = 0 192 was obtained fi om the 112th peimutation set (marked by the arrow)

Figuic 4-4 RUB-3 stiuctuie and isosuiface at 807r ot the highest Founei peak of the

map calculated itom the estimated phases

34

h k / psq i/viZ F(k)F(h~-k) (j)Sayre Tcorrect

0 0 0 2400.615 2400.615 0 0

1 1 0 575.490 582.169 0 0

0 0 1 285.055 287.220 TC Tt

2 0 0 354.354 2 3 8.677 Tt Tt

0 2 0 399.674 271.777 71 Tt

1 1 _1 497.718 432 .403 0 0

2 0 -1 99.629 31.200 0 0

1 1 1 19.544 29.622 0 Tt #

0 2 1 384.331 475.402 0 0

2 2 0 162.53 5 194.740 Tt Tt

2 0 1 574.250 5 6 6.662 0 0

3 1 0 198.574 2 18.898 Tt TC

2 2 -1 415.354 410.33 0 0 0

1 3 0 360.158 333.409 Tt Tt

3 1 -1 405.402 3 2 9.878 Tt Tt

1 3 -1 17 6.731 92.459 0 Tt #

2 2 1 23.605 3 8.53 6 n Tt

0 0 2 373.211 461.221 0 0

1 3 1 7.066 12.63 6 0 0

1 1 -2 132.636 64.020 0 Tt #

2 0 -2 240.451 351.053 0 0

3 i 1 244.464 208.09 1 0 Tt #

4 0 0 373.023 246. 833 Tt 0 #

0 4 0 391. 687 232.624 Tt Tt

4 0 -1 217.006 265.841 Tt TC

1 1 389.456 322.833 0 0

3 3 0 190.911 2 6.2 60 Tt 0 #

0 2 2 91.9 05 44.376 7X Tt

2 2 -2 251.664 195.532 TC Tt

3 3 -1 146.106 13 4.42 6 0 0

3 1 -2 14.3 42 41.804 0 0

0 4 1 229.733 178.880 Tt Tt

4 2 0 232.340 193.470 0 0

2 4 0 218.133 111.747 0 0

4 2 -1 223.499 2 69.067 Tt Tt

2 0 2 108. 820 22.770 Tt 0 #

2 4 -1 282.519 215.324 Tt Tt

4 0 1 9 6.3 52 32.845 Tt 0 #

1 3 -2 13.330 72.315 Tt Tt

4 0 -2 217.956 i.i J. J . Z 2) J TC Tt

3 3 1 167. 131 143.109 Tt Tt

2 4 1 85.264 128.357 Tt Tt

Table 4-2 Result of the phase extension from fourteen permuted phases for the RUB-3 structure

factors, The R-value is calculated from the values m column four and five. Rows marked with a "#"

have incorrectly estimated phases

4.4.4 Results

The set having the lowest R-value was selected, and a Fourier map was calculated using the

pseudo-atom structure factor amplitudes and the extended phases. An isosurface at 80% of the

highest Fourier peak value structure is shown with the RUB-3 in Fig. 4-4. The T-atom

35

positions are readily apparent. The different diameters of the isosurface enclosures are caused

by the imperfect phase estimation from such low resolution data. By reducing the isovalue for

the isosurface, the bond directions between the T-atoms become visible, and the approximate

positions of the oxygen atoms can be detected. In the selected reflection set, comparison

between the structure factor amplitudes of the squared pseudo-atom structure and the

amplitudes calculated from the Sayre equation gives an ft-value of 0.192 (Fig. 4-3).

All permutations and evaluations of the reflection set took 15 min CPU time. Out of 39 phases

(41 minus two o.d.r.), 32 were determined correctly (Table 4-2). This is sufficient to determine

the topology of the framework, so, of course, a structure envelope could also be generated very

easily from a few strong reflections. This was an encouraging result, but RUB-3 is a simple

structure (three T-atoms in the asymmetric unit). For a more complicated one, such a similarly

reliable phase estimation cannot be expected. However, because the phases of the strong

structure factors are likely to be estimated correctly, it should still be possible to generate a

useful structure envelope.

4.5 Test structure ITQ-1 (MWW topology)

4.5.1 Data measurement and preparation

The first structure with the topology MWW (aluminosilicate MCM-22) was derived by model

building from high-resolution electron micrographs, and refined with synchrotron powder

diffraction data by Leonowicz et al. (1994). Later Camblor et.al (1998) described the synthesis

and refinement of the pure silica MWW-type zeolite ITQ-1 (Table 4-3). Those data, collected

on the Swiss Norwegian beamline (SNBL) at the European Synchrotron Radiation Facility

(ESRF) in Grenoble were also used for the SayPerm test calculations. The intensities were

extracted and the scale factor determined.The displacement factor was fixed at B = 2.0 À".

chemical formula S172OJ44

lattice parameters a= 14.209Ä, c = 24.969Ä

space group P6/mmm

asymmetric unit 8 Si, 13 0

Table 4-3 Data lor the structure of ITQ-1

4.5.2 SayPerm run

Data up to a resolution of 2.7 À were input into the SayPerm procedure. The 72 reflections

were divided into 4 subsets for a stepwise phase extension (Fig. 4-5). Using the doubled

Hamming [7, 4, 3] code, 256 phase permutations were examined and a Fourier map was

36

# Ti tie ITQ--1 measured data

Uni tCe Ll 14 .209 14 .209 24.969 90.000 90.000 120.000

Spa ceGroup P6/mmm

#

Gri dDimension 2t 2 8 48 1 1 1

#

MaxRvalue 0 3 0.3 0. 3

Fscale 0 68

TempFa 2 0

PseudoAtom 3 1

PermCode hamming

#

# h k 1 |F(hkl) [ phase

0 0 0 0 2159.00 0.0

2 0 0 1 79.40 0.0

1 0 0 2 286.83 0.0

1 1 0 0 191.91 0.0

1 1 0 1 15 8.63 0.0

0 1 0 2 17 0.2 6 0.0

2 0 0 3 96.48 0.0

2 1 1 0 36.14 0.0

2 1 0 3 45.73 0. 0

2 1 1 1 63.74 0.0

1 0 0 4 3 3 4.73 0.0

2 1 1 2 88.35 0.0

1 2 0 0 158.75 0.0

2 2 0 1 91.30 0.0

2 1 0 4 62.53 0.0

1 2 0 2 118.59 0.0

2 1 1 3 52 .66 0.0

2 0 0 5 72.97 0.0

2 2 0 3 54.22 0.0

2 1 1 4 0.00 0.0

2 2 1 0 41.7 6 0.0

2 1 0 5 63.80 0.0

2 2 1 1 32.86 0.0

1 2 0 4 102.92 0. 0

2 2 1 2 66.53 0.0

1 0 0 6 221.67 0.0

1 3 0 0 138.91 0.0

2 1 1 5 88.87 0.0

2 2 1 3 27.44 0.0

1 3 0 1 142.54 0.0

1 1 0 6 130.48 0.0

1 3 0 2 215.27 0.0

2 2 0 5 6 1.02 9.0

1 2 1 4 13 6.3 9 0.0

2 3 0 3 3 3.35 0 . 0

2 1 1 6 52.3 7 0. 0

1 0 0 7

•

264.90 0.0

Figure 4-5 SayPcrm input tile for the TTQ-1 test calculation, data up to resolution of 2.7Â were used

calculated from the reflection set having the lowest R-value.The run was repeated using the

Golay [24, 12, 8] code with reflection flags changed accordingly (24 phases were permuted).

Here the word golay is used in the permcode line of the input file.

o>/

h k / Fï? l/V^F(hF(h-k) ^Sayre rcorrect

0 0 2 307.953 352.458 Tt Tt

1 0 0 205.859 143.586 Tt TC

1 0 1 168.873 224.816 0 0

1 0 2 177. 171 132.834 0 0 odr

0 0 4 328.0/5 1 ^ 3.2 n 5 Tt Tt

2 0 0 155.040 1 o 5 .213 Tt Tt

2 0 2 112.353 9 2.5 8 6 Tt Tt

2 0 4 89.014 108.875 0 0

0 0 6 186.646 2 79.695 Tt 0 #

3 0 0 116.02 8 184.017 0 0

3 0 1 118.159 1L4.132 Tt Tt

1 0 6 106.482 30.600 0 Tt #

3 0 2 174.429 L44.45Ï Tt Tt

2 1 4 107.398 (•3 .875 0 Tt #

0 0 7 2 02.07 6 120.578 It TC

Table 4-4 List of reflections for ITQ-1 with permuted phases. The fixed phased odr is indicated. Rows

with a "#" mark the incorrect phases. The calculated and "observed" values (fourth and fifth column)of reflections used to generate a structure envelope generation should agree reasonably well

4.5.3 Results

The highest electron densities were found very close to framework atoms (Fig. 4-6). Four T~

atoms can be seen directly, but if the structure were unknown, it would not be possible to

construct the framework structure by surveying the Fourier map alone. The same test was

repeated using the Golay [24, 12, 81 code. Here more phases (24) were permuted. That is, the

phase space was sampled with a smaller stepsize. In the case of ITQ-1, this more accurate scan

did not provide phases that allowed a more meaningful Fourier map to be generated. Of the 69

stracture factor phases, 23 were incorrectly determined. However, all strong low-order

reflections, which would be used to generate a stracture envelope, were phased correctly

(Table 4-4). A very informative structure envelope (Fig. 4-7) could be calculated from just

four reflections, and this was subsequently used to limit the search volume in a direct space

structure determination procedure (Sect. 6.3). Evaluation of the second best phase set showed

that the corresponding Fourier map, generated using all structure factors, could also be

considered to enclose the ITQ-1 structure, but it had a shift of origin relative to the Fourier map

of the best reflection set. Thus, fixing the origin with origin defining reflections does not

guarantee that all maps calculated from the different phase sets will have the same origin.

Fourteen permuted structure amplitudes have more power to fix the origin than do the few

odr's. Because the phase sets do not necessarily have the same origin, phase statistics (e.g the

most likely phase for a given structure factor estimated from its frequency of occurrence in the

best phase sets) cannot be applied in a straightforward manner.

38

Figure 4 6 ITQ I stiuctuie and the isosuilacc at 80% ol the maximum

Founei peak The highest densities aie located at the positions ol loui Si-

atoms

Figuie 4 7 Stiuctuie tmclopc loi ITQ 1 calculated liom the {002},{100} {101} and {102} leflections whose phases weie estimated using

the SayPeim pioccduic

39

4.6 Test structure 6-AlF3


The pseudo-atom approximation should not only work for structures consisting of tetrahedra.

Pseudo-atoms can also be applied to other coordination polyhedra. To test this, a structure

containing octahedrally coordinated Al atoms was investigated.

In 1995 Herron et al. reported the preparation of two new A1F3 structures consisting of corner

sharing octahedra. One of these structures (9-AlF^) (Tab. 4-5) was selected as a test example.

unit cell formula A1|6F48

lattice parameters a= 10.184 Â, r = 7.172 A

space group P4nmm, choice 2

asymmetric unit 4 Al, 7 F

Table 4-5 Strutural data (or 9 AIF\

No experimental data were available, so a powder pattern was simulated. The relevant

parameters for the simulation are listed in Tab. 4-6. The peak shape in the simulated pattern

corresponds approximately to that of data obtained from a laboratory diffractometer. From this

pattern, the intensities were extracted and the unsealed structure factor amplitudes determined.

In this way, peak overlap could be taken into account. The scale factor (k = 1.8) was estimated

using a fixed overall displacement factor (B = 3.5 Â~).

20-range 3 - 50°

stepsize 0.02°

polarization ratio 1.0

FWHM = U+VtmQ + lTtan""8 (7=0.005; U =0; W=0

Asym = al + tfVtanG + a^/tan"9 a; = -0.005; a2 = 0.003; a3 = 0

profile peak shape pseudo-Voigt

peak range in FWHM 10

Lorentz fraction 0.5

Table 4-6 Parameters used to calculate the powder pattern ot the 0-A1P2,

4.6.2 SayPerm run

The input file is shown in Figure 4-8. The reflections 201 and 310 were selected to be the

odr's, and the doubled Hamming [7,4,3] code was used to permute 14 phases (label 1). The

pseudo-atoms used had a diameter of 3.4 A. which is approximately that of an A1F6

40

octahedron. The permutation having the lowest i?-vaiue (R= 0.202) was selected and a Fourier

map was calculated from the corresponding structure factors.

# Title A1F3 Chem. Mater. 1995, 7, 75-83

#

UnitCell 10.184 10.184 7.173 90.000 90.000 90.000

SpaceGroup P4/nmm .2

#

GridDimension 28 28 M 2 2 2

#

MaxRvalue 0 22 0 .22 0.22

Fscale 1 857

TempFac 3 5

PseudoAt om 3 4

PermCode hamming

#

# h k 1 JF(hkl)| phase

0 0 0 0 639.955 0.0

2 1 1 0 34.22 0.0

2 0 0 1 16.22 0.0

1 1 0 1 73.52 0.0

1 2 0 0 91.52 0.0

1 1 1 1 66.06 0.0

0 2 0 1 169.79 0.0

1 2 1 1 117.84 0.0

1 2 2 0 131.32 0.0

2 0 0 2 55.5 8 0.0

1 1 0 2 143.86 0.0

0 3 1 0 149.67 0.0

1 2 2 1 160.06 0.0

1 1 1 2 103.59 0.0

1 3 0 1 166.76 0.0

1 3 1 1 44.11 0.0

1 2 0 2 86.05 0.0

2 2 I 2 22.96 0.0

1 3 2 1 40.48 0.0

2 4 0 0 31 .54 0.0

2 2 2 2 3 6.15 0.0

1 3 0 2 53 .14 0.0

3 3 3 0 16.7 6 0.0

3 4 0 1 11.79 "1 AJ . \J

4 3 1 2 15.75 0.0

1 0 0 3 109.22 0.0

4 4 1 1 17 .64 0.0

4 1 0 3 16.67 0.0

Figure 4-8 SayPerm input file for the 0-AlF^ test calculation Reflections up to a resolution of 2.5 À

were used.

4.6.3 Results.

The maximum densities of the map were be found at or near (+0.5Â) the positions of the Al

atoms (Fig. 4-9). Only seven, relatively weak reflections of the 25 input were phased

incorrectly (Tab. 4-7). This would be quite a good starting point for the calculation of a

41

h k l psq \/V^F(k)F{h-k) ^Sayre ^correct

0 0 0 824.995 824.995 0 0

1 1 0 21.683 27.932 0 0

0 0 1 10.270 13.681 Tt 0 #

1 0 1 44.473 60.372 Tt Tt

2 0 0 52.929 25.42 0 Tt Tt

1 1 1 3 8.177 48.385 Tt Tt

2 0 1 89.562 107.2 66 0 0

2 1 1 59.385 67.699 Tt Tt

2 2 0 63.271 93.094 0 0

0 0 2 26.702 71. 2 8 6 0 0

1 0 2 66.029 59.747 0 0

3 1 0 65.819 60.025 0 0

2 2 1 70.337 53.219 0 0

1 1 2 45.424 40.348 Tt Tt

3 0 1 70.011 64.33 7 0 0

3 i 1 17.692 22.627 0 Tt #

2 0 2 34.440 47.48 8 0 0

2 i 2 8.779 12.23 3 Tt Tt

3 2 1 14.158 9.660 0 Tt #

4 0 0 10.547 3 0.3 87 0 0

2 2 2 12.053 n c; g cj Tt 0 #

3 0 2 16.927 8.32 6 0 Tt #

3 3 0 5.115 11.750 0 0

4 0 1 3.596 3.023 0 0

3 1 2 4.793 4.605 0 0

0 0 3 33.120 28.557 0 Tt #

4 1 1-L. 5.140 11.415 Tt Tt

1 0 3 4.829 7.13 3 0 Tt #

Table 4-7 List of the phaseset from the SayPerm procedure for e-AlF? with the lowest R-value.

Figure 4-9 Fourier map calculated from all reflections involved in the phasingprocedure. The isosurface has a value of 85% of that of the strongest Fourier peak

42

stracture envelope followed by structure solution in direct space. The positions of the

remaining F atoms could also be deduced by considering the distances between the Al atoms

4.7 Tri-ß-peptide C32N306H53 (sa322)

4.7.1 Measurement, data preparation, and first attempts at structure solution

The tri-ß-peptide sa322 (Tab. 4-8) is highly insoluble and therefore difficult to recrystallize

(Abele, 1999). Consequently, only a polycrystalline material could be obtained. Powder

diffraction data (Fig. 4-10) were collected in transmission mode (0.3 mm capillary) on a high-

resolution laboratory powder diffractometer (Stoe STADI P) using CuKal radiation and a

small linear position sensitive detector. The crystallinity of the powder was sufficient to obtain

data up to a resolution of 1.8 A (50°29). A subsequent measurement using synchrotron

radiation (SNBL at ESRF) did not provide data of higher resolution. Furthermore, the sample

appeared to change during that measurement. The positions of the peaks moved during the data

collection. Consequently, these data were not used.

Sum formula C^N^OgHsj}

structure formula

AI h 1 H ä H \

Table 4-8 Chemical formulae of the sa322 molecule

The pattern collected using the laboratory instrument was indexed using the program TREOR

(Werner, Erikson & Westdahl, 1985), and could be confirmed with the program DICVOL

(Louer, 1992). With both approaches, a high figure-of-merit, indicative of the correctness of

the cell (61.03Â; 11.18Â; 5.08Â; 90.0°; 90.0°; 90.0°) was obtained. The most probable space

groups (P21212; P2i2121) were established by examining the data for systematically absent

reflections and using the information that the compound investigated is enantiomerically pure

(i.e. centrosymmetric space groups could be excluded from consideration). Assuming a density

of about lg/cm3, it could be calculated that the asymmetric unit would contain one molecule.

To test whether or not the structure could be solved using a direct methods program in a

straightforward manner, the intensities for both space groups (P2i2121 and P21212) were

extracted and input to the EXPO (Altomare et al, 1997) program. A crystal structure

consistent with the chemical information available could not be recognized from the solutions

43

120.0

100.0

80.0

C

2- 60.0

>

«

0)

40.0

20.0

0.0

^•J J\

2.5 7.5

NI 1.

vA_

12.5 17.5

2-theta

llllll mu un II m m l IIIIIIIIIIIII NIM m

_____

_____^^22.5

10.0

9.0

8.0

«2 7.0c0)

>6-°

tu

5.0

4.0

3.0

2.0

I'

*

n iv>

111 u 'j

'„. « .»

iiiiiiiiiii iiiiiiiiiiiiiiiiiiiiiiii iiiiiiHiiiiiii'iiiiiiiiiiiiiwiiniiiiiiiiiiiiiiiiniiniiiiiiuniiiHiiiiiiiiHiiiMiiniiHiiiiiii

27.5 32.5 37.5 42.5

2~theta

47.5 52.5

Figure 4-10Measuted powdei pattern and indexing (61 O^A, 11 18A, 5 08A, 90 0°, 90 0°, 90 0°,

7J21212,)oUa322

44

presented by EXPO. This was certainly caused in part by size of the structure and the low

quality and the insufficient resolution of the data.

4.7.2 SayPerm run and map evaluation

The unit cell obtained from the indexing has quite a long (61.03 A) a-axis and a very short

(5.08 A) r-axis. Thus, it could be expected that the projection along the short c-axis would

show the main features of the structure such as the shape and the packing of the molecules.A

further advantage of such a projection is that both possible space groups have identical

projections (plane group p2gg). That meant that the selection of the space group could be

postponed until further information was obtained.

The full set of extracted intensities for the space group P21212j was used for the estimation of

the scale factor (k = 0.40). Then, the data of the reflections hkO up to a resolution of 2.5Â were

input to the SayPerm program (Fig. 4-11). For the SayPerm run, the centrosymmetric space

group Pnmm, which has the same projection along the c-axis as P2]212 and P21212]_, was

assumed. In the case of organic compounds, it is not usually possible to assign a pseudo-atom

to a particular group of atoms in the organic molecule. Nevertheless, assuming a pseudo atom

with a diameter of 2.5 A (approximately the diameter of a methyl group) can succeed.

Fortunately, the non-hydrogen atoms (C. N, O in sa322) have similar scattering power, so an

equal atom structure could be assumed. This approximation is of course of lower quality than

the tetrahedron or octahedron replacement, but it was hoped that the validity of the Sayre

equation would be sufficient to find a phase combination, from which a two-dimensional

molecular envelope could be determined. 24 phases were permuted using the Golay [24,12,8]

code. Phases of origin defining reflections were not fixed. The input file is shown in Fig. 4-11.

Because the procedure for a permutation phase set is interrupted if the #-value after the first

phase extension exceeds the value specified in the input file (Rvalue), only 81 permutation

sets were evaluated in full. The run required only 10 minutes CPU time to complete the 4096

phase sets.

4.7.3 Results

The 81 sets were ranked by the R-value, and a Fourier map was calculated from the best set. In

this Fourier map (calculated using all reflections), it was impossible to recognize any

relationship between the appearance of the map and a reasonable packing of molecules.

Following the rules described in Sect. 2.5, seven strong reflections (Fig. 4-12) were selected

and a two dimensional structure envelope generated (Fig. 4-13). A chemically reasonable form

45

# Title sa322 STOE measurement, sayperm inputUnitCell 61.010 11 191 5.085 90.000 90.000 90.000

SpaceGroup Pnnm

#

GridDimension 120 20 10 12 1

#

MaxRvalue 0.25 0.25 0.3 0

Fsccile 0.405

TempFac 3 .0

Ps eudoAtom 2.5

PermCode golay

#

# h k 1 JF(hkl)| phase

0 0 0 0 1220.00 0.00

2 2 0 0 82.90 0.00

2 4 0 0 81.42 0.00

2 1 1 0 48.03 0.00

2 2 1 0 68.43 0.00

2 6 0 0 123.57 0.00

1 3 1 0 197.65 0.00

1 4 1 0 498.32 0.00

1 5 1 0 367.89 0.00

1 8 0 0 245.88 0.00

1 6 1 0 2 07.2 6 0.0 0

2 7 1 0 74.40 0.0 0

1 8 1 0 291.95 0.00

1 10 0 0 383.80 0.00

1 9 1 0 295.26 0.00

2 0 2 0 26.27 0.00

1 1 2 0 303.23 0.00

1 2 2 0 213.83 0.00

1 3 2 0 167.33 0.00

2 10 1 0 1.41 0.00

1 4 '"i

0 214.29 0.0 0

1 12 0 0 544.82 0.00

1 5 2 0 452.67 0.00

1 11 i 0 313.41 0.00

1 6 2 0 104.32 0.00

1 7 2 0 165.96 0.00

1 14 0 0 137.66 0.00

1 9 2 0 13 0.54 0.00

1 10 2 0 270.27 0.0 0

1 14 1 0 173.46 0.00

1 11 2 0 143 .01 0.00

1 16 0 0 6 5 6.90 0.00

2 12 2 0 151.26 0.00

2 1 3 0 101.55 0.00

3 2 3 0 152.03 0.0 0

3 3 3 0 194.46 0. 0 0

3 4 3 0 122.74 0.00

Figure 4-11 Input file for the SayPerm run for phase estimation of the M0 reflections. Data up to a

rcso lution of 2.5 A were used

46

# Titl

#

61.

e sa322 reflections for envelope generation

010 11 191 5.085 90.000 90.000 90 000

# h k 1 |F(hkl) | phase

# 0 0 0 0 1220 000 0 00

# 2 2 0 0 2 9 293 0 00

# 2 4 0 0 28 770 0 50

# 2 ] 1 0 16 972 ^ 50

# 2 2 1 0 24 180 3 00

# 2 6 0 0 43 664 0 50

# 1 3 1 0 69 841 1 5 0

1 4 1 0 176 085 0 00

1 5 L 0 129 996 0 50

1 8 0 0 86 383 3 5 0

# I 6 1 0 73 2 3 7 ^ 00

# 2 7 1 0 26 >9 0 ot

L 8 1 0 103 163 0 50

1 10 0 0 135 612 3 50

1 9 1 0 L04 332 3 03

# 2 0 2 0 9 283 1 0 0

1 1 2 0 L07 148 3 00

# 1 2 2 3 75 558 T 00

# 1 3 2 0 59 127 0 00

# 2 10 1 0 0 498 ^00

# 1 4 2 0 75 J21 n 00

# L 12 0 0 192 516 0 00

# 1 6 2 0 159 9 5 \ e 5 0

# 1 11 1 0 1 L0 74 6 00

# 1 6 2 0 36 86„ 0 00

# 1 / 2 0 58 643 0 00

Figuie4 12 Beginning of the îeflection list with the lowest R-valuc (0 21) The list is pait of the

SayPeim output fil c The stiuctui e envelope ( Fig 4 13) was calculated fiom the leflections

without #'

Figuie 4 J 3 Two dimensional stiuctuie envelope toi the sa322 molecule

Two unit cells along the b axes aie displa\ed

47

for the sa322 molecule became visible. In this case the structure envelope can be considered to

be a molecular envelope because it has a closed form. This envelope could then be used to set

the molecule at a sensible starting position and to restrain it to a limited region of the unit cell

in a simulated annealing procedure (Sect. 7.5).

4.8 Limitations of the SayPerm Approach

The examples described demonstrate the potential of the SayPerm program. In these favorable

cases, the positions of T-atoms (RUB-3), useful phase information for the calculation of a

structure envelope (ITQ-1 and sa322), and the Al-position (0-AlF-ri could be obtained.

As mentioned at the beginning of this chapter, the Sayre equation holds over a large range of

conditions. However if the approximation is poor, the Sayre equation is no longer valid, so of

course, its validity cannot be used as a sensible figure-of-merit for the best phase set. The

approximation of coordination polyhedra with spherically shaped pseudo-atoms is sufficient

to make direct methods applicable at low resolution (2.5 - 3.0 A) for small-to-medium-sized

structures. The more pseudo-atoms in the asymmetric unit (e.g. the more complex the

structure), however, the lower the quality of the approximation. Tests with veiy complex

zeolite structures (e.g. ZSM-5 with 12 T-atoms in the asymmetric unit) have shown that the

SayPerm approach becomes less reliable as the complexity increases.

Theoretically, the SayPerm approach should work for non-centrosymmetric structures as well.

However, the number of possible phase combinations is much higher than for the

centrosymmetric case, and phase estimations for test cases have not been successful to date. A

number of possible approaches to improving the method to address the problems of non-

centrosymmetric structures remain to be explored. The main problem is to find the best phase

set from the many sets produced, so the development of a more selective figures-of-merit is

necessary to enable the SayPerm approach to handle non-centrosymmetric structures.

In addition to the size of the structure, the chemical composition is crucial to the applicability

of the Sayre equation. For the original form of the Sayre equation (Sayre, 1952) an equal atom

structure is assumed. That means that, the pseudo-atoms should also be equal. This works well

for structures containing groups of atoms with equal or similar scattering factors, but as the

diversity increases, the validity of the Sayre equation decreases. Woolfson (1958) published an

extended form of the Sayre equation valid for structures with two kinds of atoms. He

demonstrated the validity of the extension using a one-dimensional structure. In the scope of

this work, some initial tests with this extension for real three-dimensional crystal structures

48

were carried out. Zeolite structures with different T-Atoms (e.g. Ga and P) were examined,

but no notable improvement in the results of the phasing procedure with respect to those of the

original SayPerm approach could be achieved. Nevertheless, the implementation of

Woolfson's extension to the Sayre equation was left in the SayPerm program.

49

5 Phase estimation by the method of permutation synthesis

5.1 Introduction

Statistical approaches are not feasible if the number of objects used for the statistical

evaluation is too low. Thus, for a phase estimation, the number of reflections involved must

exceed a certain limit. Otherwise, any statistical calculation, however clever it might be, will

fail.

An alternative to statistical arguments is the visual evaluation of Fourier maps, generated using

different phase combinations. If the size and shape of structure fragments are known, objects

having an appropriate shape can be sought. Any chemical information such as the connectivity

of a molecule or a zeolite's pore size can be used in this process. Unfortunately, multisolution

approaches can produce millions of phase sets, so, evaluating by eye is not reasonable. The

selection of the correct phase set must be automated using direct space arguments (e.g.

Stanley, 1986). It would be a challenge to develop a computer algorithm that could replace a

visual map evaluation, and there are encouraging developments in pattern matching

algorithms. However, many problems must be solved before usable algorithms for

crystallographic purposes are available. An alternative way is the evaluation of each phase set

by a subsequent search for the structure in direct space. This concept is used in the FOCUS

approach (Grosse-Kunstleve, McCusker & Baerlocher, 1997). Here the program searches for a

reasonable zeolite framework within the peaks of the Fourier maps generated from randomly

phased structure factors.

As shown in Chap.2, a few reflections with the correct phases are sufficient to describe the

coarse features of a crystal structure. If the phases arc unknown but some features of the

structure can be assumed based on chemical knowledge, there is a chance that the correct

phase set can be found by permuting the phases of just a few reflections and evaluating the

resulting maps. The more phases permuted, the more distinct the features of the stracture in the

map. Ecc's can be used to allow a higher number of phases to be permuted while keeping the

number of maps to be evaluated low.

Woolfson (1954) permuted seven phases using the Hamming [7, 4, 3] code to confirm a

projection the of the p-nitroaniline crystal structure. This approach was called "permutation

synthesis". The Hamming [7, 4, 3] code has 16 phase combinations instead of 128, which

would be produced by a systematic phase permutation. From one of these 16 maps, the

structure projection could be recognized easily. The permutation synthesis has even been

50

applied to obtain a low-resolution structural characterization of a protein structure (Glycos,

1998). Unfortunately, permutation synthesis is not suited for non-centrosymmetric cases,

because the number of maps produced would be too large if the phases of a sufficiently high

number of structure factors were permuted. However, a centrosymmetric projection could be

useful, if the important features can be recognized from the projection.

The advantage of a permutation synthesis is that phase information can be obtained for just the

few reflections from which the structure envelope is generated. For other phasing methods, a

larger number of reflections (up to a certain resolution) are required and this can be an

handicap if the peak overlap is severe. The same rules used for the reflection selection for the

generation of a structure envelope (Chap.2) can be used. The only one difference is that in the

case of a permutation synthesis, the number of reflections is fixed, depending on the ecc used

for the phase permutation. For example, the Hamming [7, 4, 3] code has seven signs per

combination, and this corresponds to the number of the permuted centrosymmetric phases. A

maximum of three odr's are used for the map generation. With the Hamming [7, 4, 3] code, 16

phase sets arc produced and consequently sixteen maps must be examined. In favorable cases

the correct map can be recognized. The permutation syntheses has been implemented in the

SayPerm program.

5.2 Test structure Cimetidine

For historical reasons, Cimetidine has become a standard example for demonstrating new

approaches to structure determination from powder data (e.g. Cernik et al., 1991). It was first

synthesized by Haedicke et al. (1978) and its structure determined from single crystal data

(Table 5-1).

chemical formula H3C

\ H

/^ /\ ^\ /N\ ^N,+Ch6

HN T ||\===N N-CN

lattice parameters 10.699Â. 18.818Â, 6.825À, 111.284°

space group P21/>/asymmetric unit IOC, 6N. 1 S, 16 H

Table 5-1 Data tor the Cimetidine structure

The Cimetidine molecule is quite flexible, with nine free torsion angles. Therefore, structure

determination in direct space is a challenge (e.g. Csoka, 1997). A structure envelope could be

quite useful in limiting the number of possible conformations and orientations of the molecule

51

in trial structures. Attempts to determine the phases at a resolution of about 2.5 Â using the

SayPerm approach failed, because the organic structure does not consist of building units that

can be approximated by a single spherically shaped pseudo-atoms. Furthermore, the scattering

power of the sulfur atom and the carbon or nitrogen atoms are too different to be treated as a

single kind of pseudo-atom. Thus, the permutation synthesis approach appeared to be an

appropriate one.


The powder pattern used for the extraction of the intensities was provided as a test powder

pattern with the EXPO package (Altomare et al, 1997). Reflection intensities were extracted.

A normalization of the structure factor amplitudes was not necessary because only the ratio of

the strongest structure factor amplitudes influences the result of the calculations.

# Cimetidine st rongest extracted reflections

Unite ?11 14.09 8 13. 670 7.43 1 90.000 102.421 90.000

Space

#

OnctD

jroup P2/n

intension 24 24 L6 2 2 2

PermC

#

#

ode permsynth

h k ] |F(hkL) | phase

1 -1 2 1 362.791 0.0 00

1 0 2 1 390.237 0.000

1 2 0 0 490.322 0.000

1 1 L 1 342.079 0.000

1 2 L 0 342.975 0.000

0 -1 4 1 469.635 0.000

0 0 4 1 912.867 0.000

0 1X 3 1 409.342 0.000

J 2 4 0 564.758 0.00 0

1 3 I 0 286.929 0.00 0

Fig ire 5-1 SayPerm input file a permutation synthesis for Cimetidine

5.2.2 Application of the permutation synthesis

For a permutation synthesis, up to 10 strong reflections (odr's plus permuted phases) from the

low angle region of the powder pattern can be used. These reflections should not overlap. The

reflections were selected following the same rules (Sect. 2.5) as for the generation of a

structure envelope. The space group Plln was assumed, and the phases of three origin defining

reflections (141, 041, 131) were fixed to be 0.0. Then seven other reflections were selected for

phase permutation. The input file for SayPerm is shown in Fig. 5-1. To initialize the

permutation synthesis "permsynth" is used in the PermCode line. The program permutes the

phases using the Hamming [7,4.3 ] code and generates a Fourier map for each of them.

52

era

Cr

to

CT

fO

g

ryi

S1

O

Pi-

CD

Ra

s-

3

&

HCi

CT

ST

ito

a

s-

cr

o

er

os

#

iSJ

-J

— 53 —

es so

e

-3

"5

S

5

O

S

1o

<0

60"xf

OS

54

These maps were viewed using the graphics program ISOSURFACE ,a module implemented

in the CERIUS2 (MSI, 1997) package. The fact that chemical structures could be also

generated within the CERIUS2 package was an important reason for choosing to use this

software. The maps resulting from the SayPerm run are displayed in Fig. 5-2 and 5-3.

5.2.3 Selection of the best Fourier map

If three-dimensional maps are to be evaluated, software and hardware that allow a three

dimensional visualization on the screen is highly recommended. The most useful approach is

to look at the isosurfaces. The isovalue should be set such that the coarse features of the

structure can be recognized. For a small organic molecule structure, this value is usually about

50% of the highest positive value on the map. The following criteria proved to be useful to find

the most probable structure envelope:

(1) If only one molecule is expected in the asymmetric unit, the map should show only a single

shape

(2) The envelopes should be packed in an effective manner within the unit cell.

(3) If no atoms are expected at the origin, no density should be present there.

(4) The shapes should be consistent with the known connectivity of the molecule.

If a map fulfilling these requirements can be found, it is probably the correct envelope. The

map should be evaluated for a range of isovalues (e.g. 50% to 70% of the highest map density),

otherwise it is not possible to decide whether or not the isosurfaces tend to describe discrete

entities.

The codewords of the Hamming [7,4,3] code differ in at most one place from one of the 128

possible codewords generated by systematic permutation. Thus, it is possible, that no

acceptable map can be found if the phase of an important reflection (e.g one that strongly

influences the appearance of the map) is wrong. In this case, it can be useful to change the

phases of the origin defining reflections, so that the sign relationships between the reflections

change. This might be lead to a new sign combination that better fits the correct one.

The maps generated by the permutation synthesis for the Cimetidine crystal structure are

shown in Fig. 5-2 and 5-3. The surfaces in all maps have the same isovalue (60% of the highest

density in the map). Maps 1, 2, 3, 9,13, and 16 can be excluded because they show continuous

regions of high density. For the cimetine structure, individual objects can be expected. Maps 4,

7, 11, 12, and 14 can also be discounted, because they show more than one shape, and only a

single shape would be expected. The maps 6. 8 and 15 show objects which cannot

55

accommodate any conformation of the Cimetidine molecule. The decision between the two

remaining maps 5 and 10 is not easy. If the they are viewed in three dimensions (stereo mode)

it can be seen that the packing of the isosurfaces in map 5 is a little more effective (see also

Fig. 5-4), but actually, a definitive decision cannot be made. The phase sets of both maps

should be taken into account in further steps of structure solution.

5.3 Tri-ß-peptide C32N306H53 (sa322)

In Sect. 4.7, the phasing of a few strong low-order reflections for sa322 using all reflections up

to a resolution of 2.5A was described. In this case, where the molecule can be treated as an

equal atom structure, the accuracy of the pseudo-atom approximation for an organic molecule

sufficed for the Sayre equation to be used as a figure-of-merit. All conditions for a successful

application of a permutation synthesis are also given. Therefore, it was used to confirm the

results obtained earlier using the Golay code in the SayPerm program. Information about the

data collection and preparation were given in Sect. 4.7. To obtain results that were more

independent, a slightly different reflection set was used for the permutation synthesis.

5.3.1 Selection of the best Fourier map

All of the 16 maps generated are displayed in the Fig. 5-6 and 5-7. The isolines of densities

lower than 50% of the highest density in the map are not shown. The options for each map are

set at the same values. Following the rules given in Sect. 5.2.3, map 4 appeares to be is a

56

# sa322, inputfil e for the permu tation synthesesUnitCell 61.010 11.191 5.085 90.00 90.00 90.00

SpaceGroup

#

GridDimension

P21/n

120 20 10 12 1

PermCode

#

perms /nth

#

# h k 1 |F(hkl)] phase

1 4 1 0 498.32 0.0 0

1 5 1 0 3 67.8 9 0.00

0 8 1 0 2 91.95 C.5 0

1 10 0 0 3 83.80 0.00

0 9 1 0 295.26 0.00

1 1 2 0 3 03.2 3 0.00

1 4 2 0 214.29 0.00

1 12 0 0 544.82 0.00

1 5 2 0 452.67 0.00

Figure 5-5 SayPerm input file for the permutation synthesis. The origin defining (0 in the first column)reflections were phased according to the phases obtained from the SayPerm ran (Figure 4-12) to obtain

maps with the same origin

promising candidate for a structure envelope for the sa322 molecule. All of the remaining

maps contradict the assumption that the coarse features of a single molecule should be seen in

the map. Either differently shaped objects (maps 5, 9, 14, 16) or continuous regions of high

density (maps 1,2,3,6,7,8, 10, 11, 12, 13, 15) instead of individual objects are present. The

shape and packing of the objects in the fourth map correspond to those of the map generated

using the SayPerm procedure in section 4.7.2.

5.4 Conclusions

In most cases, the connectivity of a small organic molecule is known either from the synthesis

procedure and/or spectroscopic analyses (e.g. NMR). This information can be used to

recognize the appropriate form in a Fourier map. Permutation synthesis corresponds more

closely to the structure envelope approach than does the phase estimation with the Sayre

equation, because, as for the structure envelope generation, just a few strong low-order

reflections are taken into account. For small organic structures, permutation synthesis is a

suitable tool for obtaining information about the molecular packing and the coarse molecular

shape. Unfortunately, solution cannot be achieved in every case, but at least the number of

possible phase combinations can be limited.

For inorganic crystal structures, criteria for map evaluation are more difficult to formulate

because the connectivity cannot usually be predicted as easily as it can for a small organic

molecule. However, it would be interesting to investigate whether chemical information, such

(1)

(5)

(2)

(6)

(3)

(7)

(8)

en

Figu

re5-6The

firstei

ghtFouriermapgeneratedbythepermsynthpr

oced

ure.

The

selectedmolecularen

velo

peismarkedby

ablackframe

(9)

00

|ß^

ffà»«

(16)

b>Ä

<S1»#?|^»i

<

Figure5-7Maps9-16ofthepermutation

synthesesofsa322

59

as, for,l example the pore size or HRTEM images from zeolite structures would suffice to

allow the correct map to be selected.

For low-resolution phasing approaches, the selection of an appropriate starting phase set is a

crucial point. The more phases (in addition to those of the odr's) available, the higher the

probability of a successful phase extension and refinement. Such promising phase sets can be

determined using permutation synthesis.

60

6 Determination of zeolite structures using structure envelopes

6.1 Introduction

To build a zeolite framework model on one side of a structure envelope, an intelligent model-

building algorithm is needed. Chemical information such as the fact that zeolite frameworks

consist of corner-sharing T04 tetrahedra forming a 3-dimensional 4-connected framework, can

be used in an automatic procedure. Grosse-Kunstleve, McCusker & Baerlocher (1997)

presented an algorithm designed especially for this problem. It is implemented as a subroutine

of the program FOCUS (Grosse-Kunstleve, 1996). Given a list of potential atomic positions,

this topology-search algorithm seeks possible framework structures that are consistent with the

space group and the cell parameters. For small structures, the list of potential nodes can be a

simple grid of points within the asymmetric unit, and that is sufficient to find a chemically

reasonable framework. As the size of the structure (i.e. number of T-atoms in the asymmetric

unit) increases, the time required for such a gridsearch becomes formidable. Computers

currently available are not capable of completing such an exhaustive search procedure within a

reasonable timeframe. In the FOCUS program, this difficulty is addressed by implementing a

Fourier recycling loop. It was hoped that by introducing a structure envelope to define the

region within the asymmetric unit in which the framework was to be sought, the topology

search for even more complex structures would become feasible.

6.2 Topology search for zeolite structures with a structure envelope

For an exhaustive gridsearch, each grid point within the asymmetric unit is treated as a

potential node atom (T-atom). The search procedure is divided into two stages. First a list of

potential bonds (bondlist) is prepared for each grid point. Then a backtracking algorithm

operates on these bondlists seeking 3-dimensional 4-connected nets with appropriate

interatomic distances.

To test the effectiveness of using a structure envelope as a mask, an additional subroutine was

implemented in the original algorithm (Grosse-Kunstleve, 1997b).

Only those gridpoints located on the positive side of the structure envelope were considered in

the topology search (in Fig. 6-1, the negative side of the envelope is shaded). This reduces the

the number of possible atom positions by a factor of two. Starting from one Pivot atom (black

point in in Fig. 6-1), a bond list is created with all gridpoints on the positive side of the

envelope and with approximately the correct T-T distance (3.1Ä) from the Pivot atom. In Fig.

6-1 these atoms lie on the dark grey circle. Each point on the positive side of the envelope is

60

6 Determination of zeolite structures using structure envelopes

6.1 Introduction

To build a zeolite framework model on one side of a structure envelope, an intelligent model-

building algorithm is needed. Chemical information such as the fact that zeolite frameworks

consist of corner-sharing T04 tetrahedra forming a 3-dimensional 4-connected framework, can

be used in an automatic procedure. Grosse-Kunstleve, McCusker & Baerlocher (1997)

presented an algorithm designed especially for this problem. It is implemented as a subroutine

of the program FOCUS (Grosse-Kunstleve, 1996). Given a list of potential atomic positions,

this topology-search algorithm seeks possible framework structures that are consistent with the

space group and the cell parameters. For small structures, the list of potential nodes can be a

simple grid of points within the asymmetric unit, and that is sufficient to find a chemically

reasonable framework. As the size of the structure (i.e. number of T-atoms in the asymmetric

unit) increases, the time required for such a gridsearch becomes formidable. Computers

currently available are not capable of completing such an exhaustive search procedure within a

reasonable timeframe. In the FOCUS program, this difficulty is addressed by implementing a

Fourier recycling loop. It was hoped that by introducing a structure envelope to define the

region within the asymmetric unit in which the framework was to be sought, the topology

search for even more complex structures would become feasible.

6.2 Topology search for zeolite structures with a structure envelope

For an exhaustive gridsearch, each grid point within the asymmetric unit is treated as a

potential node atom (T-atom). The search procedure is divided into two stages. First a list of

potential bonds (bondlist) is prepared for each grid point. Then a backtracking algorithm

operates on these bondlists seeking 3-dimensional 4-connected nets with appropriate

interatomic distances.

To test the effectiveness of using a structure envelope as a mask, an additional subroutine was

implemented in the original algorithm (Grosse-Kunstleve, 1997b).

Only those gridpoints located on the positive side of the structure envelope were considered in

the topology search (in Fig. 6-1, the negative side of the envelope is shaded). This reduces the

the number of possible atom positions by a factor of two. Starting from one Pivot atom (black

point in in Fig. 6-1), a bond list is created with all gridpoints on the positive side of the

envelope and with approximately the correct T-T distance (3.1Â) from the Pivot atom. In Fig.

6-1 these atoms lie on the dark grey circle. Each point on the positive side of the envelope is

Figute 6-1 Exhaustive gndseaich limited by tt structure envelope.

treated, in turn, as new Pivot atom, until all possible topologies have been found.

For the tests, a simple circa 0.5A grid with a 0.5 Â tolerance on interatomic distances

(represented by the thickness of the dark grey circle in Fig. 6-1) is used. All calculations were

performed on a Silicon Graphics Solid Impact R10000 computer. The procedure consisted of

the following steps (Fig. 6-2):

(1) A set of origin defining reflections with high lEI-values and large ^-spacing, which do not

overlap in 29 were selected.

(2) If additional reflections were required, their phases were determined using the program

SayPerm.

(3) A structure envelope was generated using these reflections.

(4) An exhaustive gridsearch using the topology search on only one side of the structure

envelope was performed.

(5) The geometries of any topologies found were optimized using a distance least squares

procedure.

(6) If the search was not successful, additional high lEl-value, large <:/~spaeing reflections were

added.

62

Calculation of the

structure envelopeittf

*

1

exhaustive FOCUS grid search

additional

reflections

iAny nets found? no

yes

1Can the framework geometry

be optimized? (DLS)

no

yes

lRefinement of

atomic positions

Figure 6-2 Combination of an exhaustive gudseatch with a structure envelope

6.3 Test examples

Four zeolites structures, A1P04-D (APD topology). Sigma-2 (SGT topology), RUB-3 (RTE

topology), and ITQ-1 (MWW topology), with orthorhombic, tetragonal, monoclinic, and

hexagonal crystal systems, respectively, were used as test examples for the procedure shown in

Figure 6-2. For A1P04-D (APD), two origin defining reflections can be chosen, and the two

with the highest lEI-values and rf-spacings (021 and 131) were selected. These sufficed to

produce a useful structure envelope (Fig. 2-3 d). For Sigma-2 (SGT) the 121 reflection was

selected to define the origin, but the structure envelope generated with just this reflection did

not allow the structure to be found, so the phase of a second strong reflection, 116, was

determined using the SayPerm approach outlined in Chap. 2, and both reflections were then

used to generate a useful structure envelope. The grid searches were completed in 6 min CPU

63

time for APD and in 1.8 min for SGT. In both cases the DLS geometry optimization of the

topologies generated allowed the correct topologies to be singled out very clearly. For

comparison, grid searches were also performed over the whole asymmetric unit without the

structure envelope as a mask, and these took 43 and 123 min CPU time, respectively.

Zeolite Space Number of Number of CPU Time (min)

Topology Group T-Atoms Reflections with Envelope w/o Envelope

APD Cmca 2 2 6 43

SGT 14\/amd 3 2 1.8 123

RTE C2/m 4 3 25 >2880

MWW P6/mmm 8 4 27 >4320

Table 6-1 The effect of using a structure cmelope mask on computing time

In the lower symmetry case of RUB-3 (RTE). more then just the two origin-defining

reflections were required. A list of the ten reflections with the largest û?-spacings is given in

Table 2-2 on page 15. Initially, 110 and llT were selected for the generation of a structure

envelope. Flowever the grid search with this mask did not yield a satisfactory framework

topology. Consequently, a third reflection, 201, was added to the origin defining ones. As an

alternative method to the SayPerm approach, two surfaces were generated: one with the phase

for (201) set at 0° and one with it set at 180°. For the former combination, none of the 56

topologies generated had a satisfactory geometry. For the latter (structure envelope is shown in

Fig. 2-5b), the RTE topology emerged as the only geometrically sensible one out of the 15

generated. The two grid searches with the structure envelope each required 25 min CPU time.

The search without the structure envelope was still not finished after more than 48h.

In Sect. 4.5, the phasing of some strong reflections having large ^/-spacings for the test

structure ITQ-1 (MWW) was demonstrated. The structure envelope generated from just the

origin defining reflection 102 did not allow the structure to be found. New structure envelopes

using more reflections were generated, but the grid searches with these masks did not yield any

reasonable framework topologies. Only when four reflections, 102, 100, 101, and 002

(structure envelope is shown in Fig. 4-7) were used did the grid search yield a sensible result.

Of the 24 topologies generated, only the MWW topology was found to have a satisfactory

geometry. The search was repeated without the structure envelope and was not still finished

after 72h.

-64

6.4 Structure envelopes with the full FOCUS approach

Encouraged by the obvious advantage offered by structure envelopes in an exhaustive grid

search, a more demanding test was performed. The structure envelope mask was used in

conjunction with the full power of FOCUS. The FOCUS program is a combination of Fourier

recycling, topology search, and sorting algorithms.

(1) Fourier recycling. Strong structure factor amplitudes extracted from a powder pattern are

assigned random phases and a Fourier map is calculated. The strongest peaks in this map

are interpreted automatically using the chemical information input. This model (simple

atom assignment, a fragment of a framework, or a complete framework structure) is used

to calculate new phases, which are then assigned to the corresponding observed structure

factor amplitudes. These and the phases, in turn, are used to calculate a new Fourier map.

This is done in several cycles until the phases converge.

(2) Topology search. From the list of peaks from the Fourier map (potential atomic positions) a

list of potential bonds is made and a 3-dimensional, 4-connected framework is sought (as

described in Chap. 6.2). If an exhaustive grid search is to be performed, the peak list is

replaced with a list of grid point coordinates, and no Fourier recycling is done.

(3) Sorting of topologies. The frameworks produced by the topology search are classified and

sorted by evaluation of site multiplicities, loop configurations (Meier & Olson, 1992;

Fischer, 1971), and coordination sequences (Branner, 1979; Grosse-Kunstleve, Brunner &

Sloane, 1996) to decide whether or not two frameworks produced by the search algorithm

are equivalent. For a typical FOCUS run using the Fourier recycling procedure, the

framework that is found most often is usually the correct one.

In contrast to the combination of a structure envelope with a simple grid search where all grid

points outside the envelope were simply excluded from the search, for the combination with

the Fourier recycling loop, those peaks within the envelope were given more weight (i.e.

moved up in the peaklist) than those outside the envelope, but the latter were not excluded

completely. Whether or not a peak is included in the topology search depends upon its position

in the peaklist, because only the highest peaks (to a prescribed level) are used.

This strategy was used to solve the very complex structure ZSM-5 (MFI topology, 12 atoms in

the asymmetric unit, space group Pnma). High resolution synchrotron powder diffraction data

collected on the SNBL at the ESRF in Grenoble were used for the intensity extraction. The

structure envelope was calculated using five reflections, Oil, 102, 301, 200, and 020, three of

which were origin-defining (see also Tab. 2-1). Without the mask, FOCUS found the MFI

topology 50 times in eight hours (no other topologies were found). With the mask, the time for

50 MFI topologies to be found could be reduced to three hours.

6.5 Conclusions

A structure envelope defines a region in the asymmetric unit in which the framework atoms are

likely to be located. This means that a search program has fewer potential atomic positions to

check, and will therefore generate more reasonable models in a shorter period of time. The

search for the correct model becomes much more selective and a large number of calculations

can be saved. In some test cases in which only geometrical knowledge was used (exhaustive

grid search), the structure envelope restriction was crucial to the structure determination (e.g.

RTE, MWW). Other tests on zeolite framework structures demonstrated that the use of

envelopes can reduce the amount of computer time required by as much as two orders of

magnitude (Table 6-1). Weighting the peak positions from a Fourier map using a structure

envelope mask, was also found to reduce the computer time required when Fourier recycling

was applied, and this might be critical for the solution of more complex structures.

66

7 From structure envelopes to organic crystal structures

7.1 Introduction

In view of the very encouraging results obtained for the use of a structure envelope in zeolite

structure solution, their applicability to other classes of materials was considered. From a

structure envelope alone, a structure solution cannot be achieved. The lack of resolution must

be compensated with chemical information such as bond lengths, bond angles, torsion angles

and/or data of higher resolution. So far, in contrast to protein crystallography, the use of

envelopes in combination with direct space search methods has not been used for powder data.

Usually structure determination is attempted either in reciprocal space or in direct space. Both

approaches have advanced rapidly in recent years but neither has become dominant. For this

reason, the development of methods that take advantage of both direct and reciprocal space

concepts is appealing. The program FOCUS discussed in the previous chapter, does just this

for zeolite structure. (Grosse-Kunstleve. McCusker & Baerlocher, 1997).

A structure envelope could be used in structure solution from powder data in the same way a

molecular envelope is used by protein crystallographers. Basically, both fields have the same

major problem: the lack of reliably interprétable data at higher resolution and consequently too

many free parameters for the amount of data available. Although the immediate impression is

that the world of protein crystallography must be completely different from that of powder

diffraction, there are commonalities. In the last few years, there have been some advances in

low-resolution phasing (e.g. Gilmore, Henderson & Bricogne, 1991, Dorset, 1997, Rius,

Miravitlles & AUmann, 1996) and its combination with direct-space structure determination

(Tremayne, Dong & Gilmore, 1998), and these could be relevant to powder diffraction

methodology.

7.2 Direct-space approaches to crystal-structure determination

Instead of determining a structure from the peaks in a Fourier map calculated from reflections

phased in some way (e.g. direct methods), another approach can be applied. An arrangement of

electron densities can be generated without any consideration of the diffraction data. This can

be done without chemical knowledge (electron density modification) (e.g. Subbiah, 1993) or

with chemical knowledge (model building). From the model or the generated electron density

distribution, diffraction data are calculated and compared with the measured data. The better

the two sets match the higher the probability that the model or the electron distribution is

correct. So far, the approach of electron density modification has only been applied to organic

67

macromolecules and single crystal data, but in principle this approach should also be

applicable to other materials and powder data.

The first crystal structure solutions from powder data were done by building models by hand.

With the rapid development in computing power, more automated approaches can now be used

to build models. Not only can models be generated by the computer, they can also be modified

automatically. The diffraction pattern of each model can be calculated and compared with the

observed data simultaneously. In this way, it is possible to build and evaluate thousands or

even millions of models within a reasonable amount of time.

7.2.1 Model generation techniques for organic structures

The technique used for model building depends on the kind and complexity of the structure to

be solved. Generally two approaches are used:

(1) By hand. A chemically reasonable model is built either as a real model made of plastic,

wood or metal or on the computer screen using molecular-modeling software. Some of

these programs allow a fragment to be moved around the cell in real time while the powder

pattern is calculated simultaneously. This allows reasonable models for simpler structures

such as small rigid-body molecular structures to be determined (Seebach et al., 1997).

(2) By computer. The connectivity, bond lengths, and bond angles are input into a computer

program, which is able to move one or more organic molecules around the asymmetric

unit. When the position and orientation of the molecules are similar to those of the real

structure, a good fit between calculated and measured data is achieved. It is also possible to

construct these molecules in a flexible manner, so that orientations of parts of the molecule

can be varied separately by setting flexible torsion angles (David, Shankland, Shankland,

1998, Andreev & Bruce, 1998. Chernychev & Schenk, 1998).

7.2.2 Model modification control

The number of possibilities that have to be evaluated for a structure solution depends on the

complexity of the structure and the previous knowledge about the structure. The aim is to find

a combination of structural parameters providing the best agreement between the measured

diffraction pattern and that calculated for the model. For small problems, it is feasible to

evaluate the possibilities rather exhaustively m a reasonable amount of time. However, if the

number of unknown structural parameters is too high, a more intelligent way of determining

the correct parameter combination must be found. In principle the following, which are also

applied in other fields of science, can be used.

(1) Grid search. A space having the same number of dimensions as unknown parameters is

68-

defined (e.g. three for the position of the structural fragment in the asymmetric unit and

three for its orientation). Then a grid is placed in this space. Each grid point represents a set

of parameters describing a trial stracture. These points can be tested selectively or

exhaustively and each trial structure is evaluated. Thus, exhaustive searches are limited to

relatively small structures. For example, each torsion angle in an organic molecule would

increase the dimensionality of the space, so the number of grid points increases

dramatically with every additional parameter. A grid search has been applied to rigid-body

organic molecules (e.g. Chernychev & Schenk, 1998).

(2) Monte Carlo. Here each structural parameter <j) is modified using a random number m

between 0 and 1

è = m- Aè (7~i)

where A(f> is the range allowed for o. The atoms or atom groups are placed in the

asymmetric unit in a random fashion. This is usually done using chemical knowledge such

as the connectivity of an organic molecule (e.g Harris, Tremayne, Lightfoot & Bruce, ,

1994) or the Coulomb potentials in a structure with charged fragments (Putz & Schoen,

1999). A large number of models are generated, and it is hoped that the structural

parameters of one of these models are close to those of the correct structure.

(3) Metropolis Monte Carlo. Here random numbers also play an important role, but the

structural parameters (f> are not generated from scratch for each new random number m, but

are derived from the previous values. That is

<ta» = told + '» A^ (7-2)

where A<\> is the limit of the change for <]). Tn this way, the parameter <|)oW is changed to §new

and a decision (Metropolis importance sampling technique) is made whether or not this

change will be performed (Metropolis et al., 1953). The criterion used is the value of the

figure-of-merit X (e.g. the agreement between the measured and calculated data or the fit of

the structure to the structure envelope). The parameters are changed from 4>ö/j to §new

if (Xue»<%o,d) or f„<exp^^/^) (7~3)

where n is a random number between 0 and 1 (independent of m) and AT is given. This

second condition enables the system to escape from a local minimum and gives the

Metropolis approach a crucial advantage over minimization using a least-squares

algorithm.

69

(3) Simulated annealing. Simulated annealing involves the same approach as Metropolis

Monte Carlo with the difference that AT, which can be viewed as the temperature in an

annealing process, is slowly decreased during the course of the ran. The probability that

the exponential term in Eqn. 7-3 is larger than the random number n decreases, so fewer

and fewer modifications of the model (Eqn. 7-2) are accepted. At some point, AT is

decreased to a value where the structure becomes "frozen". With an appropriate schedule

for the annealing process, this technique is a powerful method for finding the optimal

combination of structural parameters (David, Shankland, Shankland, 1998).

(4) Genetic algorithm. The genetic algorithm uses strategies borrowed from genetics in life

sciences. First, a "population" of trial structures is generated. From these structures, the

next generation is generated by combining the "genetic material" (structural parameters)

from two parents in the original population to form two "children". From these new

models, the "fittest" (those structures providing the best agreement between calculated and

observed data) are selected as parents for the next generation. Mutations are introduced

randomly to avoid the development of the population to a local minimum. Once a

sufficiently high number of generations have been produced, structural parameters

providing a good agreement between measured and calculated data will be found (Harris,

lohnston & Kariuki, 1998; Shankland, David & Csoka 1997).

(5) Error correcting codes. Theoretically, error correcting codes can also be used to control

model modification (Bricogne, 1997). With the Golay [24,7,4] code, for example, one

could control the variation of torsion angles in a molecule. This approach has not yet been

put into practice, but its potential is obvious.

7.2.3 Comparison of diffraction data

Once a model has been built, diffraction data must be simulated for comparison with the

observed data. There are two principal approaches:

(1) Whole pattern approach. The agreement between the observed and simulated patterns is

calculated point for point. Usually this fit is calculated using the weighted /^-factor {Rwp) or

the goodness-of-fit function %".

(2) Extracted intensities. The observed structure-factor amplitudes are calculated from the

reflection intensities extracted from the powder diffraction pattern. These are then

compared with those calculated for the trial structure. This approach can only be used if

estimated standard deviations (csd's) are available for the extracted intensities (e.g. Sivia

& David 1994), or if overlapping reflections are treated as a single observation. If the esd's

70

are known, the figure-of-merit for the agreement between the simulated and the

experimental data is equivalent to that calculated using the whole pattern approach (Rwp ).

The calculation of the powder pattern for a trial structure is one of the most time-consuming

steps in direct-space approaches to structure determination. For a whole-pattern comparison

the structure-factor amplitudes have to be convoluted with the peak-shape function and the

intensity at each point of the powder pattern calculated. In the case of extracted intensities,

only the structure-factor amplitudes have to be calculated, so the agreement factor can be

evaluated considerably faster (David, Shankland & Shankland, 1998).

All direct-space approaches to structure determination of small organic structures from powder

data are combinations of the methods described above. In each case, models are generated,

modified according to some scheme and evaluated.

7.3 Simulated annealing, fragment search and structure envelopes

The combination of simulated annealing and fragment search has proven to be one of the most

powerful methods for solving relatively complex organic crystal structures from powder data

(e.g. David, Shankland & Shankland, 1998; Andreev & Brace, 1998). The only degrees of

freedom in the search are the torsion angles, the orientation, and the position of the molecule(s)

within the asymmetric unit. The bond lengths and angles are well known from thousands of

other crystal structures and can be included as known chemical information. Though this

approach limits the number of free parameters, it is still a challenge to find the combination of

parameters which describes the global minimum for the profile R-value (Rwp) or the goodness-

of-fit function. Further limitations in the number of tree parameters and/or the range in which

they can vary could be used to advantage. A structure envelope describes the coarse features of

the molecular crystal structure, and could be used to limit the range of variations of most free

parameters. Furthermore, the structure envelope provides information that can be used to find

optimal starting values for the free parameters that arc close to the global minimum of the

parameter space.

7.4 The program SAFE

To implement the structure envelope approach in a simulated-annealing-controlled fragment

search, the program SAFE (Simulated Annealing and Fragment search within an Envelope)

was developed. It is written in ANSI C and linked to the Sglnfo program library (Grosse-

Kunstleve, 1998) for all space-group information. The simulation of the powder pattern from

atomic coordinates and the lvalue calculations are performed by the XRS (Baerlocher &

71

Hepp, 1982) Rietveld package. Fig. 7-1 shows the flowchart of the complete SAFE procedure.

7.4.1 Input

As input, SAFE needs an initial model. This model is either created using a random set of

torsion angles, orientation angles and fractional coordinates, which determine the position of

the molecule, or the model is built using previously obtained information (e.g. stracture

envelope, chemically reasonable hand-built model). SAFE also produces output files that can

be used as input, so a model obtained from a previous run can be used as a starting model for a

new one. The program uses chemical information such as connectivity, known torsion angle

limits, and typical bond lengths and angles. The cell parameters and the space group must be

specified. To control the simulated-annealing process (see Sect. 7.2.2), some parameters such

as the cooling rate are needed. Furthermore, the measured powder pattern and/or the structure

envelope generated, must be input.

7.4.2 Variation of the trial structure

Bond lengths and angles are constrained to be constant, while the torsion angles, the

orientation and the position of the molecule are varied. In this way parameter space is sampled

to find the minimum of a figure-of-merit function. This model modification is described in

section 7.2.2 in point (3) and (4). Each free parameter is changed independently, using its own

random number (m in Eqn.7-2) to generate a new trial structure for evaluation.

7.4.3 Model construction

Because the crystallographic calculations are not performed in this parameter space, the atomic

coordinates must be transformed into those of the crystallographic frame. In the first step, the

coordinates of the parameter space (internal coordinates: /; $; p) are transformed to those of

the orthonormal frame (*£; y£; z„) using a chain-like description. Starting from the origin of

the orthonormal frame, where the first atom of a molecule, consisting of n atoms, is located.

The second atom has a distance / to the first one. The bond between the two atoms lies along

the x-axis. The third atom is connected to the second one (bond length /2). The bond angle (|)2 is

the angle between the first, the second and the third atom. If a fourth atom is connected to the

third one, rp is the torsion angle around the bond between the second and the third atom. In

general :

ll = bond length (atom.(?'-l),atom(/))

<bl = bond angle (atom(/-2), atom(/-1), atom(/))

if = torsion angle (atom(/'-3), atom(/'-2), atom(/-l), atom(/)).

72

Flow diagram for SAFE

XRS

powder data

preparation

CRYLSP

Inputinitial model

SA control parameters

structure envelope and/or

powder data

Move

random torsion angle, position and

orientation variation

T

Construct model

orthonormal and crystallographiccoordinates

ÏTrial structure chemically

reasonable ?

intermolecular

intramolecular d istances

yes

evaluate fitness

powder data envelope

penalty function

Move accepted ?

Metropolis

no

back to previousposition

-no-

lowering of temperatureafter certain number of

accepted moves

yes

Figure 7-1 Flow diagram for the SAFE program

73

The Cartesian coordinates of the n'th atom (x£; y£; z„) in a chain of atoms can then be

calculated using the following recurrent equation first proposed by Annott and Wonacott

(1966)f \

vC

I( = 0

( X\

Il A 0

V, = o )l°l(7-4)

where the transformation matrix A describes the change of orientation of all following bonds,

caused by the torsion angle r\l and bond angle <j)'

-coscV -sino' 0

sin(j)'costV -cos 0'cos p.7 siniy

^ -sin(j)'ship' cos6.'sinr\l cosfl' )

A1 (7-5)

with Z° = 0; (j)°= (j)1=Tc;ri0= ri1 = rp2 = 0. The result of the multiplication of the matrices

A0 to A" gives the rotation matrix which transforms the vector (/", 0, 0) to the bond between

atom(n-l) to atom(w). The orthonormal coordinates ( xc; yc;zc ) are then transformed to

crystallographic ones (xf',yi',:f) via a transformation matrix (e.g Giacovazzo, 1992).

7.4.4 Checking for a chemically reasonable structure

Once the crystallographic coordinates have been calculated, the chemical sense of the model

can be evaluated. Three tests can be performed:

(1) Intramolecular distance. A change in torsion angle can cause non-bonded atoms of the

molecule to come too close to one another.

(2) Intermolecular distance. The molecule changes its position and orientation within the unit

cell, so symmetry equivalent molecules can come too close to one another (< van de Waals

distance). This test is performed only within one unit cell to limit calculation time.

(3) Translational symmetry. The length of a molecule can clearly exceed a lattice constant. In

this case, an inappropriate orientation of the molecule will violate the translational

symmetry.

Trial structures that do not pass the three tests are omitted. These relatively simple checks can

save significant calculation time in subsequent steps. It is possible that a model is so far from

the true structure, that unreasonable intermediate trial structures are required to get to the

global minimum. In this case the tests can be switched off.

4

7.4.5 Evaluation of the fit to the powder pattern and/or the structure envelope

SAFE provides the possibility of minimizing two functions that depend upon free parameters.

The first one is the weighted profile i?-value (Rwp), which describes the fit of the measured

powder pattern to that simulated from the trial structure.

(7-6)R =

'J^wfyfobs^^yfcalc))^rfyfobs-A

Here each point in the measured pattern yfobs) is compared with that of the simulated data

yfcalc). As weighting factor vv-, = y,(obsyl is used. This calculation is done by the XRS

CRYLSP module (See Fig. 7-1). Using other XRS modules (STEPCO, PEAK, SPRING,

DATRDN), the powder data are prepared for their use in the CRYLSP module.

The special feature of SAFFI is the use of a structure envelope for the calculation of a figure-of-

merit. To do this a value from a penalty function P

0 for s, > sUmit

P = s, - y (7-7)V1 limit '

c ^

i limit nun

is calculated, where s, is the density at grid point i in the map used to generate the structure

envelope, and sjimit is the value of the isosurface used to define the envelope, and smin is the

value s,- of the minimum density in the map. The positions of the atoms of the trial structure are

assigned the nearest grid point. Those grid points that lie inside the structure envelope (i.e. S[ >

slimit) are set t0 De 0- The summation of s, over all atoms gives the penalty value P. The lower

this value, the better the model fits the envelope.

The weighted profile /?-value and the envelope penalty value can either be used separately or

in combination to derive the figure-of-merit of a trial structure. If a stracture envelope is used

to find a favorable starting model for a new SAFE run, P alone is used. In the SAFE run itself,

Rwp and P can be combined to calculate the figure-of-merit

F = w}R + w2P (7-8)

where wj and w2 are weighting factors that can be adjusted depending upon the probability that

the envelope is a useful one. In this way, the minimization can be performed while the

molecule is encouraged to stay within the structure envelope.

75

7.4.6 Acceptance of moves and temperature control

Whether a move is accepted or not is decided by the Metropolis importance sampling

technique, described in section 7.2.2. Rwp, P, or F are then used as figures-of-merit in Eqn. 7-3.

If the move is not accepted, a new model is created from the previous one. Otherwise, the

accepted model acts as the starting point for a new modification. After a certain number of

accepted moves, the temperature (AT) in Eqn. 7-3 is decreased to tighten the acceptance

conditions for subsequent moves. As the temperature decreases and fewer models are

accepted, the possibility of escaping from a minimum is more and more difficult. At the end of

the run, a deep minimum is reached, and it is hoped that it is also the global one.

7.5 Structure of the tri-ß-peptide sa322

Application of the SayPerm procedure to the tri-ß-peptide sa322 (Sect. 4.7) allowed a two-

dimensional stracture envelope to be generated. From this projection along the [001] direction,

the coarse shape and the packing of the molecules could be discerned, but that alone did not

suffice to build a model good enough for a Rietveld refinement. The envelope did not provide

information below a resolution of 2.5A, so many conformations of the molecule could fit

inside the envelope. Consequently, an automatic approach to finding the optimal position of

the molecule within the envelope was needed, and the SAFE algorithm was applied.

Furthermore, there were still two space groups (P2|2j2j and P2]2^2) to be evaluated.

7.5.1 Combination of chemical information with a structure envelope

If all 17 torsion angles in the molecule were to be varied freely in the simulated-annealing

process, a large number of chemically unreasonable models would be produced and computing

time wasted. The geometries of many molecular fragments are known from rules of organic

chemistry and from thousands of structure analyses. It is important to include such information

in the input file for the simulated annealing run. In case of the sa322 molecule, the phenyl rings

and the peptide groups must have a planar geometry and so, the corresponding torsion angles

can be fixed.

Furthermore, the structure envelope suggests that the molecule is more or less straight, so

torsion angles or their combinations that would produce a U-turn or a zig-zag form can be

excluded. Torsion angles, not influencing the approximate form and not limited by chemical

rules, were set to vary freely (Fig. 7-2). Altogether, eight torsion angles were allowed to vary

freely (360°) and nine were varied within a limited range (60°). Because of the straight form of

the molecule and the short r-axis, hydrogen bonding between the molecules stacked in this

direction was expected. In order to obtain a chemically reasonable conformation one requires

76

3603-" 60° 6(7 H ?60°

\^

Figure 7-2 Molecule ot sa322 The bonds maiked by an anow aie varied in the SAFE run The others are

fixed. The torsion angles are varied in a range specified by the dcgiee-numbcrs.

that the oxygens are all on the same side (and therefore the three nitrogens on the other).

The starting model for the simulated annealing run was built by hand. A linear sa322 molecule

was moved and rotated withm the cell, until the molecule fitted the two-dimensional structure

envelope (Fig. 7-3). This was done, using molecular modeling software (Cerius2). The starting

parameters input to the SAFE program are shown in Fig. 7-4.

77

7.5.2 SAFE input file

First the space group P212l2l was assumed because it seemed to allow a better packing of the

molecule in the unit cell.

The molecule, consisting of 41 non-hydrogen atoms, was input in terms of internal parameters

(bond lengths, bond angles, and torsion angles). To do that, an input file similar to that for the

MOPAC-program (Steward, 1993) is used. The parameters are listed in the first part of the

input file (Fig. 7-4). AH parameters pertinent to one atom are given in one line:

(1) Atom name. If the description of the molecule begins with an atom located in the middle of

a chain, it is usually necessary to introduce dummy atoms. These are used to define the

initial bond and torsion angles. All atom names beginning with an upper case "D" are

understood to be dummy atoms and are excluded from the powder pattern calculation.

(2) Bond length. The bond length between the current and the previous atom. In the case of the

first atom the bond length is set to 0.0 À,

(3) Bond angle. This is the angle between the two previous atoms and the current one in a

chain. For the first two atoms, the angle is defined to be 0.0°.

(4) Torsion angle. This is defined by the last three and the current atom. The torsion angles for

the first three atoms are set to be 0.0°.

(5) Torsion angle flag. If this is larger than 0, the torsion tingle is varied. Torsion angles having

the same flag are connected and varied in the same manner. For example, the three torsion

angles around the bond connecting a tri-methyl group (-C-(CH3)3), must be changed

simultaneously by the same value, to maintain the geometry of the group.

(6) Maximum torsion angle change. Here the maximum change in the torsion angle in a single

move is fixed. This depends upon the state of the structure solution process. If the current

parameter is thought to be close to the correct one and just a fine tuning is needed, then this

value should be kept low.

(7) & (8) Minimum and maximum torsion angle. These values define the range in which the

torsion angle can be varied. The starting value should be within these limits.

(9) Previous atom in chain. As mentioned in Sect. 7.4.3. the molecule is described in terms of a

main chain (back bone) and connected subchains. In this column the number of the

preceding atom in the chain is given.

(10)Atom where a ring is closed. It is necessary for SAFE two know at which atom a ring is

closed. This information is used, for example, in the internal distance check.

Using this list, it should be possible to describe any connectivity of an organic molecule. It is

oOooooOoooooooooooOooOooOooooOooO

CN

ro^

tn

1X1o

ce

COoO

es

ro

ro

os

CD

r-

r^

asooo

LT,

<1

^Jl

LO

LO

CO

ODo

rH

rH

CD

tJI

rH

^H

rHH

v-irH

rH

rH

rH

r-l

CN

es

CN

CN

es

es

oq

es

CN

ro

ro

ro

CN

ro

ooooooOoOoooooooooOoOOooooooooooO

OooooooOoOOoooooooooooOoooo

CDoooooo

oooooooooOOooOooooOooooooooOooooOo

o

ooooQoro

ooOr-

CS

Oin

rH

oro

ro

olo

ro

OLO

co

orH

OS

or-

(N

ooOSO

ro

oLO

ro

oLO

ro

oLO

CO

ooro

ro

oOs

ooLO

rH

OoLO

rH

oLO

ro

oLO

ro

oLO

ro

CN

oOD

oa-,

ooooooooooOOooOooooo

CDoOoooooOooOoOO

ooooooooOOoOooooooOoOooooooo

oOoOO

ooooooooooooOooooooOoOooaoooooOooO

oooooooooCN

Oai

oOooLO

rH

orH

es

oooOoOoor-

CN

oro

ooa-,

ooCD

oOoOrH

CN

oco

OD

oo

OS

co,

C3

ooooooooooooooo

(73ooOooooooooooo

CDO

<3>

O

CD

ro

o,B

oooooooooooooooooooo

CDoooooooooOOO

oO

ooooooo

oooooooooooOo

CDooo

CDooooo

CDO

CD

LO

o

CD.

CD

<L>

ooooooo

Clo

c-,ooooooo

CDO

CDOo

CDooo

CDooooO

CD

oMl

LO

'~~~

*H

_i

olj

C]

rH

rH

o.)

CN

CN

CN

-H

rH

r<

es

Ol

O]

'v-l

LO

CN

00

oo

r-.

ooooo

ID-oo

^J1

•srjfQ

LQ

too

rooo

ce

D^

r-

LO,

rH

-H

CD

LO

CD

LO

;o

-f

*D?

LO

C\l

OLO

-^

<

rH

—1

rH

*—irH

ri

,—1

^-1

rH

rH

rH

T1

..L

•o

LO

oOoo

r~-o

rH

oCO

oooooooo

CDoOooooooo

CDoOoooooooooo

CDO

orH

rH

CD

.—,

ooooooooooOooooo

CDoOoOooooOooooOOO

oLO

CO

ro

rrj-oooooooo°oooooooooooooOooOoooooOO

LO

o

e^

O

rH

rH

•"ICD

CD

r1-ooooooooooOoooooooOoOoo

OoOooo

CDooo

^LO

o<D

CO

LO

CO

CN

es

LO

es

CO

CN

00

(M

ldo

CO

LO

LO

LO

LO

es

CO

es

LO

MD

ce

e*]

LO

ooo^o

Oj

2r-l

1r^

,-H1

H1

1

rH

1

rH

rH

1^

11

rH

rH

rH

1

1

rH

oro

LO

orH

oCN

ooo

otu

S3rH

CDoO

rH

LO

TOO

»LO

moO

ooooooooooOooooooOooOooOoOooOoooO

^rH

CN

0)

tt)<DoO

(DOooo

r-ooo

r-

r-

r-

i>ooooOO

r-

r-

r^

r~~

r^

r-oOo

r-o

r-

p-

r-

r-

Oo

rH

rH

LO

a.O

>1O]

>!

r\T

>ioj

ro

rHO

sho

oo

"*

-tfooo

T?

•*^

=^

=*

=#oooO^

=*^

^f^

rj<o

oo^

•L}!"vf^

«=#

^jl

oo

ai

as

CM

CN

"O

CD,

cd

o--,en

os

o\

CN

CN

LOO

cd

CD

os

CD

OD

as

es

LO

es

OD

ai

OD

CD

a-,

CD

0)

0Oloo

es

O}

rHoooooo

oq

CN

vi

CNooOoOo

CN

rH

esOoOOoo

U0

-p

rH

vH

rH

rH

r-1

rH

tH

rH

v~i^H

rH

rH

t-1

v-i

rH

rH

rH

,—irH

r-1

rH

rH

rH

rH

rH

rH

rH

rH

.—i

rH

fifi

3M

(0tu

arj

40

Ai

4J

fioooo

I>O

r—.ooH

LO

LO

LO

r-o

r-

CN

rH

LO

LO

LO

00

LO

es

r-~o

ro

roœ

L0

L0

oo

LO

0)to

0m

Ai

-HOoo

es

SO

CN

ro

ro

ro

LO

*?

esM

LO

es

ro

co

l/D

rH

rH

--4

ro

CN

ro

ro

es

LO

es

ro

rH

*H

co

CN

m^H

Q)~H

Uo

in

LO

lo

•&

co

os

LO

LO

LO,

LO

LO

LO

^Jt

ro

CS

^f

•&

to,

LO

LO

LO

LO

LO

es

CO

-*

LO

LO

LO

LO

LO

LO

4-1fi

(ULD

aO

XiQ

<D

H0

Ö!

M4-1

rHu

rH

-fi[hooo

v~irH

tHH

rH

rH

rH

rH

-H

rH

rH

r-l

rH

rH

rH

r-l

rH

rH

rH

rHH

rH

t~i

rH

rHH

^H

^H

T-1,,-J

H-H

r-f-P

mmp

!tiOH

<:0

4J

S0)a

Hg

mg

40

rtSOT

Ol

4-ifi

er-1

0CrH

-Hfi

-H

OHfi

m

-H

CD

fio-q

r-<

00

HSh

40

0H

CnH

Ofi

o;M

HN

rt

•H

<tfO

-Hs

Sfig

rH<

um

-UQ

4-14J

Ph

arj

m(Ö

-H

nM

rH

töcd

H>i

Sh(0

rH

uD

!hMX

OJfi

aLH

OT

ro

•^1

LO

LO

r-

oa

ODo

rH

OS

ro^

LO

IX)

r-

CO

eno

v-i

rH

0*3

asoœ

CO

=*m

SQ

r-

V-1.O)

T3

<tiQ

(Ö4J

rH

<dM

4J

4J

Ü40

(tie

ShrH

ON

CN

oliLDI

OJ

cn

rsiW

f^l

rom

ro

ro

ro

ro

ro

ro

ro

=#

•^1CN

CNH

e^HHHH

v-irH

r-l,rH

fi4J

X4J

0ai

ft

[flfi

fi«i

fiMH

fi*

PO

uU

sO

oU

UOU

Ou

sao

OOUO

OU

aUO

SUU

UO

aUO

WOTQm

Pio

OT

<;H

Ha

HEnW

OT

79

recommended that an atom near the centre of gravity of the molecule is selected as the first

atom in the list of atoms, because this atom is used to define the rotation center and the

position of the molecule. The end of the list is marked with the word "End".

In the next section of the input file, information about the starting conditions and the control of

the model modification is given. The starting position of the molecule is defined after the word

"startingPoint ". This point is the starting position in fractional coordinates for the first atom

(or dummy atom) in the atom list. The maximum translation of the molecule in one move is

given in the "DxDyDz" line. "startRotation" is the starting orientation of the molecule. The

whole molecule is rotated around the first atom in the list. In the next line "RotationRange",

the maximum rotation (in degrees) in one move is given.

The next three lines contain the lattice parameters (CellParameters), the space group

(SpaceGroup) and the region (in fractional coordinates) in which the first atom will be moved

(Assunit). This can be done within the asymmetric unit or another region of the unit cell.

In the next lines, the three geometry checks can be switched on or off. These are the intra¬

molecular distance (intraMoiDistcheck), the packing distance check (PackingDistcheck),

and the translational symmetry check (TransiDistcheck) .The minimum allowed intra- and

intermolecular distance can be fixed (in A) in the "intraMinimalDistance" and

"interMinimalDistance" lines.

In the next line (simulAnneal), five control parameters for the simulated annealing procedure

are given. The first value is the minimum number of moves that must be performed before the

"temperature" is reduced. Multiplying this number by the second value gives the number of

accepted moves that are necessary for a reduction of the "temperature". An inexperienced user

will have difficulty choosing an appropriate starting temperature, so SAFE also performs an

automatic temperature fixing. Starting from a very high test starting temperature, a number of

test moves are performed to obtain an optimal start temperature that produces the acceptance

probability (between 0 and 1) specified in the third value. The test starting temperature is the

fourth value in the line. When a sufficient number of moves, given by the first two values, have

been performed, the temperature is reduced by the fifth value. The last line (surface) defines

the stracture envelope mask. The é?;-value (Eqn. 7-7) of gridpoints having a value higher than

the first value given in this line are set to zero for the penalty function calculation. The second

value defines the weight of the penalty function for the calculation of a figure-of-merit (Eqn.

7-4). If this weight is set to 1.0 the calculation of a weighted profile R-value is switched off.

80

The structure envelope is not used if the "surface" line is omitted. For sa322, the weight for

the envelope penalty value was set to 0.3. For the penalty-function calculation the two-

dimensional envelope was extruded to three dimensions to form a columnar envelope.

Q*

U.W

0.35

iii'fi::,"i\:HjJ

i

-

<N

rt 0.30 W—IlMB ? I.Bii i ihiPib

-

+ timWtM*i ;'

'

'- -

K> B^iJiltii,--ijLfi.?s;i: ,' j

ri 0.25 ICi]'!:S PfPHf BntiKift.k_

II! '' i 11' h 1 >.W

W*f-ttriWiî •

kMi,

k,

0.20

0.15

1

y -.%

i

' ^!^w

20000 40000 60000

accepted moves

80000

Figure 7-5 Trend of the figure-of-mcnl calculated from the weighted profile Ä-value

(Rwp) and the envelope penalty value with the number of accepted moves

yr—

Figure 7-6 Molecule before (light) and after (dark.) the simulated annealing ran

81

7.5.3 SAFE run

First some tests were made to find appropriate parameters for the annealing control. The

temperature was slowly reduced and approximately 80000 moves were performed before the

reduced temperature prevented escape from the minimum and then the figure-of-merit value

was minimized within that minimum. Fig. 7-5 shows the trend of the combined figure-of-merit

(0.7 Rwp + 0.3 P) with the number of accepted moves. The difference between the starting

structure and the final one from the SAFE ran is shown in Fig. 7-6.

7.5.4 Refinement of the crystal structure

The atomic coordinates of the final model obtained from the SAFE run were then refined. Both

the crystallographic and the profile parameters are written to a binary data file that could be

used without change for the Rietveld refinement. As the refinement progressed, it became

apparent that the refinement was more stable if the hydrogen atoms were included in the

model. The final parameters of the refinement are given in Tab. 7-1.

Unit cell

Space group P2,2,2la(A) 61.033

b(A) 11.185

c(A) 5.084

Refinement

29 range (°20) used in refinement 2 -42

Number of observations 2057

Number of contributing reflection 268

Number of geometric restrains 100

Number of structural parameters 123

Number of profile parameters 8

^exp 0.056

KP 0.134

RF 0.128

Table 7-1 Experimental anc structural data for sa322

Fig. 7-7 shows the corresponding Rietveld plot. The relatively high ^-values probably result

from an inconsistency in the data. As mentioned earlier, the sample seemed to change during

the synchrotron measurement. A low temperature data collection might allow this problem to

be circumvented. Tn the refinement only data up to 42° 20 were used, because the data at

higher 29 were not consistent to the those of the low angle region. However the stracture

82

100.0

80.0

60.0

40.0

20.0

0.0

i i t

wS-4. 4~rV ^t~^'ÀJ^yV^Y!,'^---,',-.S -

L» -jt MyJL*Jktök

2.5 7.5 12,5 17.5 22.5

2-theta

27,5 32.5 37.5 42.5

Figure 7-7 Sa322 Rietveld plot

83

refined is very likely to be the correct one. Of course, it cannot be absolutely guaranteed that

the stracture is correct in every detail, but the probability that almost all features are correct, is

rather high. The final refined crystal structure and the structure envelope used in the SAFE run

are shown in Fig. 7-8.

The whole procedure, SAFE run and refinement, had to be repeated, using the other possible

space group P2|2|2, but less satisfactory results were obtained.

84

8 Conclusions

A periodic nodal surface (PNS) that partitions a unit cell into regions of high and low electron

density can be calculated from just the data in a powder diffraction pattern. These surfaces

(structure envelopes) envelop the structure and therefore can be useful in direct-space

structure-determination algorithms.

The reflections needed to generate a structure envelope can be selected by following some

simple rules. The intensities of these low-index reflections can usually be extracted reliably,

because the reflections from the low-angle region of the powder diffraction pattern tend not to

overlap. However, to calculate the envelope, the phases of the corresponding structure factors

are needed.

Sometimes, just the origin defining reflections, whose phases can be assigned arbitrarily, are

sufficient to generate an informative structure envelope, but usually the phases of additional

reflections must be determined. To do this, the computer program SayPerm, which can

estimate the phases of structure factors from low resolution (2.5-3.5 Ä) data, has been written.

It combines the pseudo-atom approach to simulate atomic resolution, the Sayre equation for

phase extension and phase set evaluation, and phase permutation using error correcting codes

(ecc's) to obtain an efficient sampling of the phase space. The SayPerm approach works

reliably for small- to medium-sized structures that can be treated reasonably well as equal

pseudo-atom structures. In such cases, the validity of the Sayre equation can be used as a

figure-of-merit to select the best phase set.

An alternative approach is to (i) permute the phases of seven structure factors using the

Hamming [7.4,31 ecc, (ii) generate a Fourier map from each phase set and (iii) evaluate each

map by eye. If chemical information such as the connectivity of a small organic molecule is

known, there is a good chance of recognizing the coarse features of the structure in one of the

16 Fourier maps.

A structure envelope restricts the volume of the unit cell in which atoms are likely to be

located, so the structure determination process can be accelerated dramatically. Stracture

envelopes used as masks in an exhaustive gridsearch combined with a specialized topology

search, allowed zeolite structures to be solved. For comparison, the searches were repeated

without the envelope mask. The use of the envelope reduced the amount of computer time

required by as much as two orders of magnitude, and in two cases proved to be essential for the

structure solution.

85

The shape of a structure envelope restricts the allowed conformations of a organic molecule

considerably. By combining the envelope with a simulated-annealing-controlled structure-

determination procedure, a previously unknown tri-ß-peptide structure (C32N306H53) with 23

degrees of freedom, could be solved. The envelope could be used to obtain suitable starting

models for the simulated annealing run and/or to accelerate the convergence (fit of the

calculated to the measured powder pattern) during the run.

86

9 Possible developments of the structure envelope approach

The potential of using a structure envelope to facilitate structure determination from powder

diffraction data has been demonstrated in this study. Nevertheless, many improvements in the

method can be envisioned

At the moment, the estimation of phases using the the SayPerm program is only feasible for

centrosymmetric structures. With its efficient phase permutation techniques (error correcting

codes), SayPerm should also be applicable to noncentrosymmetric structures. However,

because of the high number of phase combinations, the selection of the best phase set using

just validity of the Sayre equation as the only figure-of-merit is a problem with

noncentrosymmetric structures. Additional criteria must be found.

If a phase set producing a low R-value is produced and the results of the consistency test are

satisfactory, the further potential of the Sayre equation could be implemented in SayPerm.

After phase extension, a limited number of amplitudes could be extended to stracture factors of

overlapping reflections. At least the information whether a reflection is strong or weak should

be derivable from its relationships to the other structure factors (e.g. Jansen, Peschar &

Schenk, 1992, Dorset, 1997). The sum of the intensities of the overlapping reflections in a

cluster could be fixed during the calculations but the partitioning adjusted. If the Ä-value

derived from the validity of the Sayre equation is decreases after such an amplitude extension,

the new reflection intensities are likely to be closer to the correct ones.

A useful structure envelope divides the unit cell into regions where atoms or fragments of a

structure are likely to be found and those where they are not. Thus, it can serve as a guide in

the evaluation of a Fourier map generated from all stracture factors or be used to perform a

density modification. Theoretically, the envelope can be used directly in the phasing process.

After a SayPerm run, the peaks in a Fourier map (from the best phase set) that lie on the

negative side of the envelope could be omitted or weakened (using a penalty function). This

modified map could then be Fourier transformed in order to obtain a new phase set. Then

more reflections (than in the first SayPerm run) could be selected to have fixed phases in the

next SayPerm run. From the structure factors with fixed phases, a new, more detailed envelope

could be generated and used in the next Fourier map modification. This iterative density

modification and phase extension using a structure envelope mask would be similar to the

Shake and Bake algorithm (Miller, Gallo, Khalak & Weeks, 1994) and the solvent flattening

approach (e.g. Hoppe & Gassmann, 1968) used in protein crystallography.

87

Structure envelopes can be used to find a starting position, orientation and conformation of an

organic molecule for a simulated-annealing run. Nevertheless, for a low-quality envelope or a

very complex structure, this starting model can be ambiguous and several starting models must

be tested. Perhaps, this could be done systematically using error correcting codes (ecc's).

Positional and orientational parameters and torsion angles could be selected and varied under

the control of an ecc to generate starting models. Starting from each of these models, a

simulated annealing run could be performed. The final model that is found most often and

whose powder pattern shows the best match to the measured one, is likely to be correct. For

example, using the Golay[24,12,8] code, 64 positions, 64 orientations, and six torsion angles

with four values each could be varied more or less systematically to give 4096 starting models.

Of course, the parameters for the simulated annealing runs must allow the calculations to be

performed within a reasonable time frame. The computing time could be also reduced by using

a structure-envelope penalty function. Simulated-annealing runs from more than one starting

model are also used in protein crystallography.

Just as a structure envelope is closely related to a molecular envelope approach used in protein

crystallography, some of the methods suggested above are also derived from macromolecular

crystallography. It would definitely be worthwhile evaluating other methods used in this field

for their applicability to powder data more systematically. It may even be possible to use

available or slightly modified software from protein crystallography for this purpose.

88

10 References

Abele, S. (1999). Ph.D. Thesis,ETH Zurich, Switzerland.

Alexander, D. (1973). PEAKFIND, A Program for the determination ofpeak positions in

powder profiles, ICI, U.K.

Altomare, A., Carrozini, B., Cascarno G., Giacovazzo C, Guagliardi A., Moliterni, A. G. G.,Rizzi R., Burla M. C, Polidori, G.& Camalli, M., (1997), EXPO, A packageforfull

pattern decomposition and for solving crystal structures by direct methods.

Andersson, S„ Hyde, S.T., Larsson, K„ Lidin, S. (1988). Chem. Rev., 88, 221-242.

Andersson, K.M. & Hovmöller, S, (1996). Acta Cryst., D52. 1174-1180.

Andreev Y.G. & Brace, P.G.(1998). J. Chem. Soc. Dalton Trans., 4071-4080.

Annot, S. & Wonacott. A.J. (1966). Polymer, 7, 157-172.

Baerlocher, Ch.,& Hepp, A. (1982). XRS-82, Inst, tuer Kristallographie, ETH Zurich,Switzerland.

Baerlocher, Ch. (1990). EXTRAC, Inst, fuer Kristallographie, ETH Zurich, Switzerland.

Brenner, S., McCusker & LB., Baerlocher, Ch. (1997). JAppl. Cryst., 30, 1167-1172.

Boutif, A., & Louer, D. (1991). J. .Appl. Cryst., 24, 987-993.

Bricogne, G. (1997). Meth. Enzymology, 276, 424-448.

Bricogne, G. (1976). Acta. Cryst. A32, 832-847.

Brunner, G.O. (1979). J. Solid Stat. Chem., 29, 41-45.

Bunge HJ., Dahms, M. & Brokmeier FLG. (1989). Cryst Rev., 2, 67-88.

Camblor, M.A., Corma, A. & Diaz-Cabanas, M.-J. (1998)../. Phys. Chem. B, 102, 44-51.

Cerny, R. (1996). Proc. of the 45th Annual Confom Application ofX-ray Analysis Denver, 40,CDJR.OM.

Cascarano, G. Favia, L. & Giacovazzo. C. (1992). J. Appl. Cryst. 25, 310-317.

Cernik, R.J., Cheetham, A.K., Prout. CK., Watkin, DJ., Wilkinson, A.P., Willis, B.T.M.

(1991) J. Appl. Cryst, 24, 222-226.

Cernychev, V.V. & Schenk. H. (1998). Z. Krist, 213. 1-3.

Ciacovazzo, C. (1992). Fundamentals of Crystallography, Chap. 2, Oxford University Press

Coulombe, R. & Cygler, M. (1997). Acta Cryst. D53, 426-433.

Csoka, R., David, W.I.F. & Shankland. K., (1997) Proceedings of the 5th Européen Powder

Diffraction Conference, Parma. Italy.

David, W.I. F. (1987). JAppl Cryst. 20, 316-319.

David, W.I. F. (1990). Nature (London), 346, 731-734.

David, W.I. F.. Shankland K. & Shankland, N. (1998). Chem. Commun., 931-932.

Debaerdemaker, T., Tate.C, & Woolfson. M.M. (1985). Acta Cryst. A41, 286-290.

Dorset, D.L. (1997). Polymer, 38. 247-253.

Dorset, D.L. (1997a). Acta Cryst. A53, 445-455.

89

Dorset, D.L.(1997b). Proc. Natl. Acad. Sei., 94, 1791-1794.

Dreele, von R.B. (1990). GSAS, General Structure Analysis Program, Los Alamos National

Laboratory.

Estermann, M. A. (1991). Ph.D. Thesis, ETH Zurich, Switzerland.

Estermann, M. A. & Grämlich, V. (1993)../. Appl. Cryst. 26, 396-404.

Fischer, W. (1971). Z. Krist. 138, 129-146.

Gilmore, C.J. (1996). Acta Cryst, A52. 561-589.

Gilmore, C.J., Henderson, K., & Bricogne, G. (1991). Acta Ciyst. A47, 830-841.

Glover, I., Haneef, I., Pitts, J., Wood, S., Moss. D„ Tickle, I.,& Blundell, T.(1983).Biopolymers, 22, 293-304.

Glycos, N.M., 1998, Glycos, N.M.. Holzenburg. A., Phillips, S.E.V. (1998). Acta Cryst, D54,215-225.

Golay, M.J.E. (1949). Proc. IRE (IEEE). 37, 23-28.

Grosse-Kunstleve, R.W. (1996). Ph.D. Thesis, ETH Zurich, Switzerland.

Grosse-Kunstleve, R. W„ McCusker L.B. Baerlocher, Ch. (1997). Acta Cryst. A30, 985-995.

Grosse Kunstleve, R. W. (1997b). personal communication.

Grosse-Kunstleve, R. W., McCusker L.B. Baerlocher, Ch. (1999). J. Appl. Cryst. 32, 536-542.

Grosse-Kunstleve, R. W., Brunner, G. O. & Sloane, N. J. A.(1996). Acta Cryst. A52, 879-889.

Grosse-Kunstleve, R. W., (1998). Sginfo, a comprehensive collection ofANSI C routinesfor the

handling of space groups., ETH, Zurich. Switzerland.

Hall, S. R.. King, G. S.D. & Steward, J. M. (Eds.) ( 1995). Xtal3.4 User's Manual, Universityof Western Australia.

Hall, S. & Subramanian, V. (1995). Xtal3.4 User's Manual, University of Western Australia.

Haedieke, E., Frickel, F. & Franke A. (1978). Chem. Ber. Ill, 3222-3232.

Harker, D. (1953). Acta Cryst. , 6, 731-736.

Herron, N., Thorn D.L., Harlow, R.L., Jones, G.A., Panse J.B., Fernandez-Baka, J. A., &Vogt,T., (1995). Chem. Mater. 7, 75-83.

Hamming, R. W. (1947). Memorandum 20876. Bell Telephone Laboratories, Murray Hill. NJ.

USA.

Harris, K.D.M., Tremayne, M., Lightfoot, P. & Bruce, P.G. (1994). J. Am. Chem. Soc, 116,3543-3547.

Harris. K.D.M., Johnston, R.L. Kariuki, B.M. (1998). Acta Cryst. A54. 632-645.

Hedel, R., Bunge, H. J. & Reck, G. (1994). Mater. Sei. Forum, 157-162, 2067-2074.

Hoppe, W. & Gassmann, J. (1968). Acta Crxst. B24, 97-107.

Hyde, S.T. (1993) Defects and Processes in trhe Solid State: Geoscience Applications, The

McLaren Volume, edited by J.N.Boland & J.D. Gerald, pp 317-342. Amsterdam: Elsevier.

International Tables for X-ray Crystallography, Vol. A.( 1995). ed. Hahn, T, Klüver Academic

Publishers, Dordrecht, 856-869.

90

Jansen, J., Peschar, R. & Schenk, H. (1992). JAppl. Cryst. 25, 237-243.

Larrson, K., Fontell, K. & Krogh, N. (1980) Chem. Phys. Lipids. 27, 321-324.

Lasocha, W. & Schenk, H. (1997). J. Appl. Cryst, 30, 561-564.

Le Bail, A. (1988), Mm. Res. Bull., 23, 447-452.

Leone, S. (1998), Ph.D. Thesis, ETH, Zurich, Switzerland.

Leonowicz, M.E., Lawton, .1. A. Lawton S. L., Rubiv. M. K., (1994), Science, 264, 1910-1913

Louer, D. (1992). Proceedings of the International Conference, May 26-29, E. Prince, J.K.

Stalick (Eds.); NIST SP 846, 92-104

Mackay, A. (1979). IUCR Meeting. Copenhagen, Denmark.

Main, P. (1977). Acta Cryst, A33, 750-757.

Marler, Grünewald-Lüke, & Gies, (1995). Zeolites, 15, 388-399.

Meier, W. M., Olson, D. H., & Baerlocher, Ch. ( 1996). Atlas ofZeolite Structure Types; 4th ed.

London: Elsevier.

Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N. Teller, A.H & Teller, E. (1953). J. Chem.

Phys., 21, 1087.

Miller, R., Gallo, S. M., Khalak, H. G. & Weeks, C. M. (1994). J. Appl. Cryst., 27, 613-621.

MSI, (1997) Molecular simulations Inc.,Cerius23,5.

Neovius, E.R.(1883 ). Bestimmung Zweier Spezieller Periodischer Minimalflaechen, J.C.

Frenckel & Sohn, Helsinki.

Pawley, G.A. (198 l). J. Appl. Cryst, 14, 357-361.

Putz, H. & Schoen, J.C, (1999). Annual Meeting of the German Crystallographic Association,

Leipzig.

Rietveld, H. (1969)../. Appl. Cryst., 2, 65-71.

Rius J. & Miravitlles, C (1987). J. Appl. Cryst., 20. 261-264.

Rius, J. (1993). Acta Cryst, A49, 406-409.

Rius, J., Sane, J., Miravitlles, C, Amigo, J.M. & Reventes, M.M. (1995a). Acta Cryst, A51,268-270.

Rius, J., Sane, L. Miravitlles, C, Gies, FL, Marler. B. & Oberhagemann, U. (1995b). Acta

Cryst, A51, 840-845.

Rius J., Miravitlles, C. & Allmann, R. (1996). Acta Cryst, A52, 634-639.

Rius J., Miravitlles, C, Gies H,& Amigo J.M. (1999). J. Appl. Cryst., 32, 89-97.

Rodriguez-Carvajal, J. (1993). FullProf. Grenoble, France.

Rossmann, M.G. & Blow, D.M. (1962). Acta Cryst. A42, 402-404.

Sayre. D. (1952). Acta Crvst 5, 60-65.

Sayre, D. (1972). Theory and Practice in Direct methods in Crystallography, edited by M.F. C.

Ladd & RA. Palmer, 271-286. New York: Plenum.

Scriven, L.E. (1976). Nature, 266, 123.

91

Schnering, FLG. von & Nesper, R. (1987). Angew. Chem. Int. Ed. Engl 26, 1059-1080, and

references therein.

Schnering, H.G. von & Nesper, R. (1991). Z. Phys. B, 83, 407-412.

Schwarz, H.A. (1890). Gesammelte Mathematische Abhandlungen; Springer, Berlin.

Seebach, D., Mattews, J.L., Meden. T., Wessels. T., Baerlocher, Ch. & McCusker L.B. (1997),Helvetica ChimicaAcla, 80, 173-182.

Shankland, K., David, W.I.F. & Sivia, D.S. (1997). J. Mater. Chem., 7, 569-572.

Shankland, K., David, W.I.F., Csoka, T. (1997). Z. Krist, 212, 550-552.

Shannon, CE. (1948). Bell Syst.Tech. J. 27, 623-656.

Sheldrick, G. M. (1993), SHELX-93, University of Goettingen, Germany.

Sivia, D.S. & David, W.I.F. (1994). Ada Cryst, A50, 703-714.

Stanley, E.(1986). Acta Cryst. A42, 297-299.

Steward, LP. (1993). MOPAC, a general-purpose semi-empirical molecular orbital package,http://jschem.korea.ac.kr/services/mopac/mopac.html

Stout, G.LL, Jensen, L.LI. (1989). X-ray structure determination, A practical guide, John Wiley& Sons, New York, 309-324.

Subbiah, S. (1993). Acta Cryst. D49, 108-119.

Taupin, D. (1973a)../. Appl. Cryst, 6, 266-273.

Taupin, D. (1973b)../. Appl. Cryst, 6, 380-385.

Taupin, D. (1988). J. Appl. Cryst, 21, 485-489,

Taupin, D. (1989). J. Appl. Cryst, 22. 455-459.

Tremayne, M., Dong, W., Gilmore C.J. (1998). BCA Spring Meeting, St.Andrews, Scotland.

Tremayne, M., Kariuki. B. M. & Harris, K.D.M. (1997), Angew. Chemie, Int. Ed. Engl, 36,770-772.

Visser, JW. (1969)../. Appl. Cryst, 2, 89-95.

Weeks, C. M., DeTitta, G. T... Hauptmann, H. A.. Thuman, P., & Miller, R. (1994). Acta Cryst,A50, 210-220.

Werner, P.-E., Erikson, L. & Westdahl. M. (1985). ./. Appl. Cryst. (1985), 18, 367-370.

Wessels, T.. Baerlocher, Ch. & McCusker, L.B. ( 1999). Science, 284, 477-479.

White, P.S. & Woolfson, M.M. (1975). Acta Crvst. A31, 53-56.

Woolfson, M.M. (1954). Acta Cryst, 7, 65-67.

Woolfson, M.M. (1958). Acta Cryst.. 11. 277-283.

92

Curriculum Vitae

Birth

Leipzig, Germany, July 1, 1965

CitizenshipGerman

Marital Status

single

Education

1972 - 1982 Polytechnische Oberschule, Frankfurt (Oder), Freiberg, Leipzig

1982- 1985 Metal-worker apprenticeship

1985 Abitur

1985 - 1987 Military service

1987 - 1989 Employed in a bookshop

1989 - 1994 Stdent at the Universität Leipzig

1994 Diplom in crystallography

1994 - 1995 Research at the Max-Planck-Gesellschaft in Rostock

I started this thesis research in January 1996

93

Dank

Am Ende dieser Arbeit möchte ich allen danken, die direkt oder indirekt, mehr oder weniger,

bewusst oder unbewusst zu deren Gelingen beigetragen haben. Das sind, neben vielen anderen

alle Kollegen des Laboratoriums für Kristallographie an der Eidgenössisch Technischen

Hochschule in Zürich mit denen die Arbeit grossen Spass gemacht hat,

Dr. Lynne McCusker und Dr. Christian Bärlocher die nicht nur begeisternde Chefs, an die ich

mich nicht nur jederzeit mit Fragen und anderen Dingen wenden konnte, sondern die auch

wunderbare Kollegen waren,

Prof. Dr. Walter Steurer, meinem Doktorvater und Gutachter der Dissertation,

Prof. Dr. Jordi Rius für das Begutachten und die wertvollen Hinweise in der Endphase der

Arbeit,

Dr. Thomas Wessels, Dr. Javier de Onate, mit denen es ein Vergnügen war, das Büro zu teilen,

Dr. Tone Meden, der mich mit bewundernswerter Geduld in die Programme, die für die Arbeit

mit Pulverdaten nötig sind, eingeführt hat,

Prof. Dr. Grämlich, Dr. Torsten Haibach, Dr. Michael Estermann und Dr. Jürgen Schreuer, die

auf unzählige Fragen Antworten gegeben haben,

Dr. Ralf Grosse-Kunstleve, der mir bei der Anpassung des FOCUS-Programms für die

StruktLireinhüllenden prompt und ausführlich behilflich war, und dessen Sginfo- und

Atominfomodule unverzichtbare Bestandteile, der im Rahmen dieser Dissertation

entstandenen Programme, sind,

Prof. Dr. Chris Gilmore, Prof. Dr. Kenneth Harris, Dr. Benson Kariuki, die mir Anregungen zu

Ideen, die wichtig für diese Arbeit waren, seseben haben,

Prof. Dr. Hermann Gies für seine ermunternden Worte und das zur Verfügung stellen der

RUB-3 Pulverdaten,

und noch einmal Dr. Lynne McCusker, für das. ganz bestimmt, nicht immer unterhaltsame

Geradebiegen meiner englischen Ausdrucksweise.

Diese Arbeit wurde vom Schweizer Nationalfond (SNF) unterstützt.

Rights / License: Research Collection In Copyright - Non ... · 2.1 Periodic MinimalSurfaces...

Documents

Transcript of Rights / License: Research Collection In Copyright - Non ... · 2.1 Periodic MinimalSurfaces...