Standardizer, canonicalization and chemical business rules for structure database handling: US UGM...

22
Solutions for Cheminformatics Standardizer canonicalization, conversion and registration

description

Standardizer is a popular component of compound registration systems providing customizable functions for the transformations of mesomers, tautomers, salts and solvents in the molecule files and databases. New actions help to convert molecule libraries and to restore the chemical information encoded in old compound databases. For latest details see here: http://www.chemaxon.com/product/standardizer.html

Transcript of Standardizer, canonicalization and chemical business rules for structure database handling: US UGM...

Page 1: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

•Solutions for Cheminformatics

Standardizer

canonicalization, conversion and registration

Page 2: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Introduction to Standardization

• Standardization is the first step of chemical canonicalization, the conversion of functional groups and other structural elements of molecules to a predefined representation.

• Standardizer is a tool for the standardization of structures, and it provides other conversion functions as well.

• Standardizer is available in the form of a integratable class, as a stand alone application, and integrated with ChemAxon’s databases as well.

• Standardizer is configurable by a list of actions to accommodate corporate standards.

Page 3: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Aromatization

Aromatize – Basic MethodConverts bonds to aromatic type according to the current resonant form of the molecule.

Page 4: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Aromatizations

Aromatize – General MethodConverts bonds of rings having aromatic character to aromatic type .

DearomatizeConverts aromatic bonds to alternating single/double bonds.

Page 5: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Hydrogen Actions

Add Explicit HydrogensConverts implicit hydrogens to explicit ones (adds hydrogen atoms to the graph).

Page 6: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Hydrogen Actions

Remove Explicit HydrogensConverts explicit hydrogens to implicit ones (removes hydrogen atoms from the graph). Special hydrogens (isotope, charged, radical, mapped, lonely) can be handled optionally.

Page 7: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Stereo Actions

Absolute StereoSets the chiral flag if a compound has tetrahedral stereo center.

Clear StereoRemoves stereo features.

Page 8: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Stereo Actions

Convert Wedge InterpretationConverts an wedge between two stereo centers into two separate wedges.

Convert Double BondsConverts the crossed and wiggly representations of unknown stereo double bond

Page 9: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Clean Actions

Clean2DCalculates the atom coordinates for two-dimensional structure representation.

Page 10: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Clean Actions

Clean3DCalculates the lowest energy conformer of the molecule.

Page 11: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Clean Actions

Wedge CleanRealigns wedge bond according to the IUPAC preferences.

Page 12: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Clean Actions

Template based CleaningCalculates the atom coordinates using templates. It is a useful action for the alignment of combichem libraries to the scaffold.

Page 13: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Group Actions

UngroupUngroups superatoms and multiple groups irreversibly.

Expand/ContractThe superatoms and multiple groups can be opened and collapsed with these reversible actions (the group info remains in the structure).

Page 14: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Group Actions

Alias to GroupConverts alias and pseudo atoms to superatom groups according to their symbols.

Page 15: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Transform Actions

TransformThe transform action provides a general interface for user defined conversion of structural elements. Transforms are useful for the standardization of mesomers, tautomers, salts and for the removal of specific counterions and solvents.

Page 16: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Salt Handling Actions

Remove FragmentProvides various options to remove small fragments like counterions.

Page 17: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Salt Handling Actions

NeutralizeNeutralizes ionic functional groups but keeps the formal charges of mesomers and quaternary ammonium ions.

Page 18: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Reaction Mapping Actions

Map ReactionsAssignes map numbers to the corresponding atoms of a reaction scheme.

UnmapRemoves map numbers from the atoms of a reaction scheme.

Page 19: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Other Actions

Remove IsotopesConverts isotopic atoms to elemental atoms.

Page 20: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Standardizer Demo

Page 21: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

Future Plans

Multiprocessor support

Some complex actions will be converted to more smaller actions to improve readability of the configuration.

Structure checker functionality (just check, report, but do not convert).

New actions• Group (autocreating superatoms)• Convert to enhanced stereo

Graphical design improvements.

Page 22: Standardizer, canonicalization and chemical business rules for structure database handling: US UGM 2008

• Thank you for your attention!• For more information please visit

www.chemaxon.com