GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where...

31
IAEA-142 GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS DECEMBER 1971 EDITION PREPARED BY J. HEIJNEN AND C. TODESCHINI A TECHNICAL REPORT PUBLISHED BY THE INTERNATIONAL ATOMIC ENERGY AGENCY, VIENNA, 1971

Transcript of GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where...

Page 1: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

IAEA-142

GUIDELINESDEVELOPMENT

MAINTENANCETHESAURUS

DECEMBER 1971 EDITIONPREPARED BY

J. HEIJNEN AND C. TODESCHINI

A TECHNICAL REPORT PUBLISHED BY THEINTERNATIONAL ATOMIC ENERGY AGENCY, VIENNA, 1971

Page 2: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

The IAEA does not maintain stocks of reports in this series. However,microfiche copies of these reports can be obtained from

1NÍS Microfiche ClearinghouseInternational Atomic Energy AgencyKârntner Ring 11P.O. Box 590A-1011 Vienna, Austria

on prepayment of US $0.65 or against one 1 AEAmicrofiche service coupon.

Page 3: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

PLEASE BE AWARE THATALL OF THE MISSING PAGES IN THIS DOCUMENT

WERE ORIGINALLY BLANK

Page 4: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

FOREWORD

The Board of Governors of the International Atomic Energy Agency(IAEA), meeting in February 1969 in Vienna, authorized the establishmentof the International Nuclear Information System (INIS).

INIS is a mission-oriented system in nuclear science and thepeaceful applications of nuclear technology. It is highly decentralizedfor both input preparation and retrieval operations. INIS outputproducts are multi-purpose; both computer tapes and a printed biblio-graphy are available.

Optimal use of computers for handling information, especiallyin a highly decentralized environment, requires a high degree ofconsistency and compatibility in the preparation of input. The IAEApublishes a set of standards and rules called: The INIS Reference Series.They are based on long-term experience with other information systemsand were prepared to meet the decentralized and specialized INISrequirements. INIS, as a mission-oriented system, seeks to interfacewith other mission- or subject-oriented information systems. In orderto avoid duplication in the work done for subject-oriented systems, itis expected that INIS will attempt to link itself with such systems inan effort to interchange or exchange input or output products. Thisrequires a high degree of compatibility.

This is in keeping with the UNISIST concept to develop and promotethe idea of a world-wide network of information systems.

INIS may be considered as the first decentralized world-widemission-oriented information system serving as an example for thefeasibility of the UNISIST concept. The IAEA fosters this development

Page 5: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

in various ways, such as the careful preparation and maintenance ofits controlled vocabulary to enhance subject retrieval.

Draft guidelines for the development and maintenance of theINIS Thesaurus ' were drawn up by the IAEA Secretariat and with minormodifications were approved and used by the INIS Centre of theCzechoslovak AEC; by Le Service Central de Documentation, CEA de Prance;by the Division of Technical Information Extension of the US AEC andthe Information Research Group, Lawrence Berkeley Laboratory,University of California, in a systematic study of the INIS Thesauruslasting from March to September 1971• Alterations and improvements inthe guidelines resulting from the systematic thesaurus study areincluded in the present document. These guidelines were approved bythe first meeting of the Working Group on the INIS Thesaurus andRelated Technical Problems which met in Vienna on 2-4 November 1971-This approval was confirmed by the INIS Advisory Committee at itsVienna meeting on 18-19 November 1971•

Acknowledgement

The authors would like to thank members of the INIS Section,headed by Mr. Zhan Turkov, for their comments and suggestions for theimprovement of this document. Furthermore we are most grateful to:Messrs J. Hybner, J. Helbich and 0. Vojtech of the Czechoslovak AEC;Mr. J. lung, of Le Service Central de Documentation, CEA de France;Mr. D. Davis of the Division of Technical Information Extension of theUS AECj and to Mrs. J. Herr of the Information Research Group,Lawrence Berkeley Laboratory University of California, for theirvaluable contributions to these guidelines.

1) lAEA-INIS-r? (INIS: Thesaurus)

Page 6: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

I. INTRODUCTION

INI S requires that for each piece of literature reported to thesystem both the bibliographical description and a set of descriptors toidentify the subject content of the document be provided. It was recom-mended by the Study Team which initially designed INIS that co-ordinateindexing be used for the system

Co-ordinate indexing of full-text documents offers high precisionin indexing, i.e. both high quality and consistency, and consequentlygood retrieval performance. For high-quality indexing it is aprerequisite that indexers have good subject knowledge. In order toachieve high consistency the controlled vocabulary (thesaurus) fromwhich descriptors are to be selected needs an understandable andpredictable structure and clear presentation. The easier it is for anindexer to be led to the appropriate terminology in the thesaurus,the more likely he will produce consistent indexing.

Therefore the criteria on which the thesaurus is constructedshould be unambiguous and clear to all contributors and users of INIS.Retrieval operations on INIS output tapes will be performed on adecentralized basis by participants in INIS. Indexers as well asretrievers preparing query profiles using INIS descriptors for SDI orretrospective searches will be scattered all over the world. Further-more indexers at the different centres are not necessarily thepersons preparing query profiles for mechanical searches. Advancedcomputer techniques will make it possible for anyone wishing toretrieve certain information to go to an on-line console and type inhis own query profile, provided the thesaurus from which he selects hisdescriptors is structured in such a way that all concepts containedare unambiguously defined and their presentation self-evident. Whether

1) "INIS Study Team Report", IAEA,

- 5-

Page 7: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

a retriever is an experienced indexer or someone with no knowledge ofinformation science at all, he should not be flooded with informationnot directly pertinent to his question while doing a machine search.An information system will stand or fall with the appeal it has toits users.

The main reasons for establishing these guidelines are:1) to set out clear and unambiguous criteria on which the

structure of the terminology in the INIS Thesaurus is based;

2) to allow the INIS Centre in Vienna to perform a highlycentralized maintenance of the INIS Thesaurus;

3) to have a broad basis for compatibility between the INISThesaurus and the controlled vocabularies of other mission- orsubject-oriented information systems, existing or planned.(See: Foreword)

The latter is also the reason why the Guidelines for theDevelopment and Maintenance of the INIS Thesaurus follow closely thosedeveloped by UNESCO ; Guidelines for the Establishment and Developmentof Monolingual and Technical Thesauri for Information Retrieval.Continued reference was also made to the Guidelines commonly knownas the EJC/Project IEX guidelines !Thesaurus Rules and Conventionswhich are much in parallel with the UNESCO guidelines. On questionsof the hierarchy of the terminology, reference was also made to a

2)letter by Mr. Eugene Wall to the editor of the ASÍS Journal .

In the present document the term "descriptor" will be usedexclusively to signify an accepted term and to differentiate it froma forbidden term or non-descriptor.

The examples used in this document have not necessarily beentaken from the INIS Thesaurus.1) Thesaurus of Engineering and Scientific Terms, US Department of

Defense, Report No. AD 672 000, 196?, pages 673-6792) Journal of the American Society for Information Science - Sept. -Oct.

1970, pages Jft-JJ*- 6 -

Page 8: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

II. THE THESAURUS STRUCTURE

The most important function of a thesaurus is to serve as a toolin information retrieval. Descriptors chosen from the thesaurus mustgive clear indications of the information and date content of adocument. To do this their meaning must be well defined arid completelyunambiguous. This semantic definition must be provided in the thesaurusby means of the structure which is given to the terminology. Itbecomes imperative therefore, that the interrelationship betweenindividual descriptors be brought clearly into evidence.

These interrelationships are of three types: preferential;hierarchical; affinitive. All three have the property of reciprocity,i.e. when two or more descriptors are related in any way, reciprocalentries should be provided. It is one of the purposes of theseguidelines to lay down the criteria which govern the existence orvalidity of an interrelationship between any two descriptors withinany of the three types mentioned above.

Cross references

Interrelationships among terms will be shown by cross referenceswhich are here grouped according to the three types:

Cross References _________Symbol____________TypeUSEUSED FORSEE ... OR

SEEN PORBROADER TERMNARROWER TERM

USEUPSEEORSPBTNT

Preferential

Hierarchical

RELATED TERM RT Affinitive

- 7 -

Page 9: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

1. Preferential relationships

These cross references are employed to refer from a forbiddenterm to a descriptor(s) and vice versa. They are used when themeaning of descriptors overlap substantially; where differentspellings of the same word exist; for synonyms, antonyms andhomonyms and, in general, wherever a choice has been made betweena number of terms, all of which are included in the thesaurus display.

a) USE ReferenceThe USE reference is intended to lead users of the thesaurus froma forbidden term to a descriptor(s) which must be used in its stead.This "Exclusive" reference is employed in a variety of situations:

i) to indicate a preferred synonyme.g. -c olumbium

USE NIOBIUM

ii) to indicate a preference between spelling variationse.g. -sulphur

USE SULFUR

iii) to expand or explain abbreviationse.g. -esr

USE ELECTRON SPIN RESONANCE

iv) to express concepts which can be considered synonyms forpurposes of indexing and retrievale.g. -maxwell-boltzmann distribution

USE BOLTZMANN STATISTICS

v) to bring together different viewpoints of a conceptualcontinuume.g. -softness

USE HAREWESS

- 8 -

Page 10: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

vi) to reflect current terminology, eliminate jargon or exercisea choice for semantic or other reasonse.g. -electric condensers

USE CAPACITORS

-cosmosUSE UNIVERSE

vii) to refer the variations in proper name terminology to onechoicee.g. -fermi diagram

-fermi-kurie plot-kurie plot

USE FERMI PLOT

There are instances other than of the types exemplified above,when a term is entered as a forbidden term either because it isconsidered too specific, or to reduce the number of precoordinateddescriptors or because it is considered poor terminology. In thesecases the forbidden term is referred, whenever possible, to a moregeneral descriptor(s) which is a correct broader term in the sense ofsection 2. below (Hierarchical relationships)

e.g. -dark repairUSE BIOLOGICAL REPAIR

-radioactive gaseous wastesUSE GASEOUS WASTESAND RADIOACTIVE WASTES

b) USED POR ReferenceThe USED FOR reference (UP) is the mandatory reciprocal of the USEreference and accompanies the descriptor to which the USE referencereferse.g. CAPACITORS

UP -electric condensers

Page 11: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

c) SEE Reference

The SEE reference is primarily intended to resolve the difficultyof homographie terms. Since the accurate indication of informationcontent and hence pertinent retrieval cannot permit more than onemeaning to a descriptor, homonyms are not allowed. The thesauruseither identifies which of the possible meanings is the only oneaccepted (e.g. by a scope note) or better the homonym is enteredas a forbidden term and the SEE...OR references invite the user toselect among the two or more alternatives the specific meaning,more clearly spelled out, which he has in mind.

e.g. -processingSEE DATA PROCESSINGOR FABRICATIONOR ORE PROCESSINGOR REPROCESSINGOR WASTE PROCESSING

In some cases the forbidden term may have only one SEE referencewhere the preferred term may or may not be appropriate.

e.g. -level densitySEE ENERGY LEVELS

d) SEEN FOR Reference

The SEEN FOR reference (SP) is the mandatory reciprocal of the SEEreference and accompanies the descriptor to which the SEE referencerefers.

e.g. FABRICATIONSF -processing

e) Alternative methods of resolving the difficulty posed by homographieterms are an extensive use of qualifiers and of scope notes. Thesetwo methods are used in the INIS Thesaurus whenever possible inpreference to the SEE reference. They are discussed in the sectiondevoted to homonyms ( See IH/l/b) .

- 10 -

Page 12: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

2. Hierarchical relationships

Hierarchical relationships are used to exhibit relative degreesof specificity within a category of descriptors all of which belong toa particular generic group. This relationship is not based upon thepossible use or application of an entity, but on the position of theconcept within a given class of concepts. Certain terras may be membersof more than one hierarchical chain. Where any hierarchy has more thantwo levels, the cross-reference for all levels are completed for eachdescriptor. All concepts which are subdivisions of a broader conceptshould form part of a hierarchical chain.

a) BROADER TERM Reference

The BROADER TERM reference (ET) is employed to refer from adescriptor representing a member of a class (or classes) of conceptsto any descriptor(s) in the thesaurus representing that class orclasses.

e.g. HARDNESSBT mechanical properties

Por each BROADER TERM reference there must also be a reciprocalNARROWER TERM reference.

b) NARROWER TERM Reference

The NARROWER TERM reference (NT) is the reciprocal of theBROADER TERM reference and is employed to refer from a descriptorsymbolizing a concept class to all descriptors symbolizing conceptsthat are members of that class.

e.g. MECHANICAL PROPERTIESNT HARDNESS

For each NARROWER TERM reference there must also be a reciprocalBROADER TERM reference.

- 11 -

Page 13: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

c) The validity of hierarchical relationships must be clearlyestablished in each instance. The criterion to be used should bethat of class membership, i.e. a valid hierarchical relationshipexists only if the NT is a subclass of the HT. It should beexplicitly said that a descriptor could be a subclass of more thanone BT1 simultaneously.

The part-whole relationship is not a legitimate hierarchicalrelationship. The class membership rule of hierarchies is waivedonly for part-whole relationships between geographical subdivisionsof a political nature and between anatomical parts.

e.g. AUSTRIABT1 europe

STCMACHBI1 gastrointestinal tract

d) Hierarchies in the terminology are not truncated but in all instancesare carried through to the highest level of generality usingterminology present in the thesaurus. Although this will lead tohigh frequency of usage figures for some descriptors as a resultof automatic generic posting, this is not an unduly adverse resultand useful retrieval can be made with the more general terms in co-ordination with specific ones.

e) Truncated display

There are a number of descriptors in the thesaurus which are in thenature of cumulative terms, each representing a class consistingof a large number of specific descriptors. Each one of thespecific descriptors is a narrower term of the cumulative descriptorin question, e.g. the descriptor RADIOISOTOPES has over l400specific radioisotopes as narrower terms. Since long lists ofnarrower terms of this type displayed with these cumulative

- 12 -

Page 14: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

descriptors are not of great usefulness to indexers but moreso to retrievers formulating query profiles, the display ofthe hierarchies is truncated in the printed version of thethesaurus. This means that whenever one of the specificnarrower terms is used in indexing the cumulative broader termis automatically added by the computerized up-posting procedureas with all other descriptors used in indexing, but the specificterms do not appear as narrower terms in the display of thecumulative term in the printed thesaurus. Those descriptorswhose hierarchical display is truncated in this way are identifiedby an asterisk (*) at the level below which no narrower terms are listedand have a scope note reading:

(For specific terms, consult the Appendix.)

The complete hierarchical display for a few descriptors whosehierarchies are truncated in the printed thesaurus as describedabove, are given with full listings of narrower terms in theAppendix to the thesaurus.

Example, thesaurus display:

* YEARS LIVING ISOTOPES(For specific terras,consult the Appendix.)

Example, thesaurus appendix display:

YEARS LIVING ISOTOPESBT1 radio!sotopesNT1 ACTINIUM 22?NT1 ALUMINIUM 26NT1 AMERICIUM 24lNT1 AMERICIUMETC.ETC.

Page 15: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

Another kind of truncation is applied to long hierarchieshaving many subclasses, each consisting of a high number ofdescriptors. In order to save space, some of these long trees havebeen truncated in the printed version of the thesaurus at theNT1, or NT2, or NT5 level, etc.

Contrary to the previously described truncation, all narrowerterms in long hierarchies are very useful to both indexers andretrievers and should thus be found in the thesaurus itself.

e.g. REACTORS*NT1 BREEDER REACTORS*NTI DUST COOLED REACTORS*NT1 ENRICHED URANIUM REACTORSETC.ETC.

In our example the narrower terms to the descriptors preceded by anasterisk (*) are not printed in the display of REACTORS. However bylooking up the descriptor marked with an asterisk in the directlisting in the thesaurus one will find all narrower terms belongingto that descriptor.

e.g. BREEDER REACTORSBT1 reactorsNT1 FER TYPE REACTORSNT2 AIPFR REACTORNT2 BELOYARSK-3 REACTORETC.ETC.

or: DUST COOLED REACTORSBT1 reactors

or: ENRICHED URANIUM REACTORSHT1 reactorsNT1 ACPR REACTORNT1 ADIEKA-1 REACTORNT1 AFSR REACTORETC.ETC.

- U -

Page 16: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

This kind of truncation can be performed on each level ofspecificity of a hierarchy.

Truncated descriptors in long hierarchies for which narrowerterms can be found in the thesaurus itself have no scope notes.

Affinitive relationships

The affinitive relationship is employed to refer from onedescriptor to others that are related in concept but are neitherconsistently hierarchically nor preferentially related.

a) RELATED TERM Reference

The RELATED TERM reference (RT) is employed as a guide froma given descriptor to other descriptors if it is believed that theuser, when examining one descriptor, may be usefully reminded of theexistence of the other.

Related term references may be used to identify:i) descriptors that are related in meaning or concept including

those in different hierarchical chains.

ii) descriptors that are near synonyms

e.g. FUEL CANSRT fuel elements

iii) descriptors that have view point interrelationships, such asrelationships based on usage

e.g. BREATHRT air

iv) descriptors representing concepts bearing a part-wholerelationship to each other.

- 15 -

Page 17: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

4. Scope notes

Scope notes may not only be used for the purpose as citedunder III/1/d of this document, but can considerably contribute toreinforcing the cross-reference structure of the thesaurus. Besidesthe definition of a term, the scope note could consist of directivesto consult other hierarchies in the thesaurus, which have developedas a result of the restriction in usage of the annotated descriptor.

e.g. BREAKDOWN(Limited to electric discharge phenomena.See also CLEAVAGE or DECOMPOSITION.)

In case a scope note is also used for cross-reference toother concepts in the thesaurus, those descriptors are printed inupper case in the text of the annotation.

- 16 -

Page 18: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

III. SELECTION OP CANDIDATE DESCRIPTORS

General

Descriptors in general consist of terms related to discreteconcepts encountered in the subject field under consideration and inpertinent marginal areas. In the INIS subject control system allindex terms are equally referred to as descriptors.

Descriptors should succintly summarize concepts in as fewwords as possible, preferably one. Grammatical connections such asprepositions or articles should be avoided whenever possible.

In evaluating the utility of candidate descriptors, referenceshould be made to their:

1) frequency as encountered in the literature or in theexisting stocks of information;

2) anticipated incidence in retrieval inquiriesj5) relationship to descriptors already accepted;4) appropriateness and authenticity as current terminology

in the discipline concerned;5) effectiveness and expediency in connoting and denoting

the particular concept.

None of these factors should be considered independently andparticular attention should be paid to areas of peripheral interestwhere the exhaustivity and specificity required of the descriptorsare not the same as for the core subject.

In all cases, descriptors should be selected for inclusionin the thesaurus on the basis of their estimated effectiveness forretrieval purposes, and their significance in the material to beindexed.

- 17 -

Page 19: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

1. Methods of avoiding ambiguity in defining descriptors

In the process of maintenance and development of the INISThesaurus, difficulties will be encountered with candidate descriptorswhich have more than one accepted meaning or whose meaning in a givencontext is different to that commonly encountered. Furthermore, itmay also happen that the same concept is written in two or moredifferent ways (synonyms) or the meaning of two candidate descriptorsmay overlap substantially (near-synonyms). In case of synonyms, nearsynonyms (quasi-synonyms) as well as antonyms a certain preferentialdescriptor will be selected always with cross-reference to thenot accepted terms (and vice versa). In such cases the requiredmeaning may be brought out by the use of the following methods:

a) Compound expressions

Although descriptors are preferably self-contained, single termconcepts, the IWIS Thesaurus makes extensive use of modifyingexpressions to define the different meanings associated with a givenconcept, whenever necessary. For the method of entering theresulting compound expression, see J/a below.

e.g. SEPARATED ORBIT CYCLOTRONSor

ABSOLUTE ZERO TEMPERATURE

b) Homonyms

Whenever possible the difficulty arising from homographie terms willbe solved by entering the homonym as a forbidden term withdifferent qualifiers each referring to different descriptors, whichmore clearly identify the different meanings.

In those cases where it is not possible to clearly differen-tiate between different meanings of a homonym by means ofqualifiers, liberal use should be made of scope notes.

- 18 -

Page 20: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

The method preferred by INIS for handling homonyms is the onedescribed under ]/e below. However entering the homonym as aforbidden term with SEE, OR references to better defined descrip-tors is not ruled out.

e.g. preferred display:

-abundance (chemical) USE CHEMICAL COMPOSITION-abundance (isotopic) USE ISOTOPE RATIO-abundance (mineral) USE ORE COMPOSITION

instead of :

-abundance SEE CHEMICAL COMPOSITIONOR ISOTOPE RATIOOR ORE COMPOSITION

c) Synonyms, quasi-synonyms and antonyms

It is rare that two or more candidate descriptors can be consideredas true synonyms. When one candidate descriptor must be searchedevery time that another is searched, they may be treated as synonyms.Descriptors that overlap significantly or represent differentaspects of the same property may be considered quasi-synonyms.Antonyms should be similarly treated. When all the synonyms,quasi-synonyms or antonyms are included in the thesaurus displaythe preferential cross-reference should be used.

The method adopted by INIS is such that the thesaurus willdisplay the preferential term as descriptor. Their synonyms, quasi-synonyms or antonyms will be included as forbidden terms withreference(s) to the descriptors which should be used in theirplace. See also: point d) Forbidden terms, page 20 of theINIS: Manual for Indexing '.

1) IAEA-INIS-12 (INIS: Manual for Indexing)

- 19 -

Page 21: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

e.g. -columbiumUSE NIOBIUM

-maxwell-boltzmann distributionUSE BOLTZMANN STATISTICS

-electric resistivityUSE ELECTRIC CONDUCTIVITY

d) Scope notes

A scope note is a brief explanation which may accompany thedescriptor in the thesaurus display, but does not form part of thedescriptor. It indicates the way in which the descriptor should beused; it need not necessarily consist of a dictionary definition.Scope notes are sometimes used to restrict the usage of a descriptor.(See II/4).

The total length of a scope note in the INI S Thesaurus islimited to a total of 9 (lines) x 28 characters (per line) includingupper and lower case shift characters. In view of its nature ascope note should be as concise and precise as possible. Only in afew exceptional cases a scope note in the INIS Thesaurus exceedsfour to five lines.

e) Qualifiers

It should be explicitly mentioned that in the INIS Thesaurus noqualifiers exist or are allowed as a part of the descriptors.However, they are allowed as a part of forbidden terms of the typementioned under point b) and c) of this chapter. Qualifiers areused extensively to solve the problem of homonym expressions.(See example under III/1/b)

- 20 -

Page 22: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

e.g. -analysis (activation) USE ACTIVATION ANALYSIS-analysis (gas) USE GAS ANALYSIS-analysis (thermal) USE THERMAL ANALYSIS

or-cortex (adrenal) USE ADRENAL GLANDS-cortex (cerebral) USE CEREBRAL CORTEX

Their purpose is to assure an appropriate cross-reference to thepreferential term (descriptor), the introduction of which wasbased on the philosophy of introducing descriptors as "directentries" into the Thesaurus as in the examples above. (See III/3/a)

2. Form of descriptors

a) Word form

Once it has been decided to include a given term in the thesaurus,care should be taken to ensure that the word form used adequatelyconveys the exact meaning intended.

i) Spelling : the most widely accepted spelling of the word shouldbe used. Cases arise, particularly in English, due to varyingusage on different sides of the Atlantic where more than onespelling of a word is accepted, in which case both forms of theword should be included in the thesaurus. In these cases thepreferential cross-reference should be employed. (See HI/3>/c)

e.g. SULFUR OR: LABELED COMPOUNDSSULPHUR LABELLED COMPOUNDS

SULPHUR written with PH could be introduced as a forbiddenterm like: -sulphur USE SULFUR (and vice versa) or provisionscould be made in the programmes for processing descriptors toautomatically correct different spelling of the same concept of

- 21 -

Page 23: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

the type as mentioned above. It may be said that they aretreated like synonyms, if the spellings in common usagediffer very much from each other. (See III/l/c)

ii) Translation: many current technical terms have arisen bytranslation from other languages, but sometimes a modernforeign language or Latin term is incorporated into the spe-cialized vocabulary for a particular subject. When both theforeign language term and its putative translation coexist,they should both be included in the thesaurus and cross-referenced preferentially.

e.g. BRAKING RADIATIONBREMSSTRAHLUNG

Technical terms as mentioned above will for INIS terminologycontrol purposes be treated like synonymous terms. The spellingmost frequently used in the scope of interest, whether theconcept is written in English or in a foreign language, willbe adopted as the preferential term for indexing the conceptin question. (See III/1/e)

The other spelling, whether in English or a translation ofthis concept, could be introduced as a forbidden term with across-reference to the preferential terme.g.

as BREMSSTRAHLUNG is the preferential term in the INISThesaurus, the cross-reference would look as follows:

-braking radiationUSE BREMSSTRAHLUNG

and

- 22 -

Page 24: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

BREMSSTRAHLUNGUP -braking radiation

iii) Transliteration : the problem is further complicated when theforeign language in question is written in a different alphabet.This is particularly true in the case of very specificdescriptors. The transliteration standards to be used forINIS descriptors and forbidden terms are the INIStTrans-literation Rules for Selected Non-Roman Characters . Wherevera choice exists, the transliteration which does not employdiacritical marks should be selected (See III/2/e).

e.g. SPUTNIK SATELLITES

b) Noun form

The descriptor should be in the form of a noun or that part of theverb which is grammatically equivalent. The gerund in English.

c) Number

In general, the plural form should be used for descriptors, parti-cularly when generic terms are involved. The singular and pluralforms of a word denote different concepts, in this case bothshould be entered.

For examples: See table No.l on page 24

1) IAEA-INIS-10 (INIS: Transliteration Rules for SelectedNon-Roman Characters)

Page 25: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

T A B L E N O . 1

TYPE OP TERM USE SINGULAR FORM USE PLURAL FORMMaterial terms,such as :chemical compoundsmixturesmaterials

When term is specificas:

UREACELLOPHANEASBESTOS

When term representsa group as :AMINESSOLVENTS

Terms representingproperties, condi-tions, characteristics

When term is specificas:VISCOSITYTEMPERATUREOPACITY

When term representsa group as :

MECHANICAL PROPERTIESTENSILE PROPERTIES

Terms representingequipment, devices,physical objects andelementary particles

Do not use singular Use plural, as:ACCELERATORSWELDSMESONSTEETHSTARS

Special purpose terms Do not use singular Use plural, as :ADHESIVASCATALYSTS

Process terms Use singular, as:CATALYSISFABRICATIONWELDINGMODULATION

Do not use plural

Proper names(A proper name isdefined as the namefor a single uniqueitem)

Use singular, as:FERMI-YANG THEORYPLUTO PLANET

Do not use plural

Disciplines, fields,subject areas

Use singular accor-ding to common usage,as:

MECHANICSHYDRAULICSBIOLOGY

Do not use plural(Words such as"hydraulics" areactually singular)

Cumulative termsrepresenting a groupof: events,occurrencesor effects

Do not use singular Use plural, as:CHEMICAL REACTIONSNUCLEAR EXPLOSIONSELECTRIC DISCHARGES

Specific phenomena,effects or events

Use singular, as:DOPPLER EFFECTALTERNATING CURRENTBOXCAR EVENT

Do not use plural

- 24 -

Page 26: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

d) Abbreviations and acronyms

Abbreviated word forms should be used only when their meaning isinternationally established. Both abbreviated and unabbreviatedforms should be displayed and cross-referenced preferentially.Sometimes the necessity for limiting the length of thedescriptor (see e) below) entails the use of less well establishedabbreviations. A scope note (see III/1/d) could be appended.

e.g. -castillejo-dalitz-dyson polesUSE CDD POLES

and: CDD POLESUP -castillejo-dalitz-dyson poles

i) Reactor abbreviations

Abbreviated reactor names accepted as descriptors for theINIS Thesaurus will always have a reciprocal cross-reference tothe full name of the reactor as forbidden term (and viceversa). This is to guarantee the indexer that the abbrevia-tion in question is not meant to be used for another reactor.If the full name of the reactor displayed as a forbiddenterm would exceed the allowed 30 characters set for thesaurusterms, (see e) below) only the first 30 characters of theforbidden term will be displayed.

e.g. -magnox type gas cooled reactor ^ 'USE MAGNOX TYPE REACTORS

ii) Abbreviation of chemical compound names

Abbreviations of long chemical compound names accepted asdescriptors for the INIS Thesaurus will always have a recip-rocal cross-reference to the not accepted full name of thecompound (and vice versa). For the rest the same explanationas above. A chemical compound name displayed in the INIS

Page 27: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

Thesaurus as a forbidden term and exceeding the allowedcharacter set could be:

(tate)e.g. -cyelohexylenedinitrilotetraaceu ^USE CDTA

The cut off name of the chemical compound will still providethe indexer or retriever with sufficient information foridentifying the utilization of the descriptor CDTA. If toomuch of a name is cut off by the JO character limitation,giving rise to possible difficulty in identifying the exactcompound, it may be useful to include that part, which wascut off in a scope note.

The above remarks also apply to acronyms.

e) Character set

The character set for printing the INIS Thesaurus is in upper andlower case letters. INIS descriptors and Narrower Terms (NT's)are printed in upper case whereas Broader Terms (BT's), RelatedTerms (RT's) and forbidden terms are printed in lower case.

The INIS Thesaurus is not only the authority for indexing andquery profiling but also the standard to be used for input processingand query profile compilation. For this purpose and for purposesof thesaurus updating and structure control, it has been stored oncomputer memory devices.

Consequently the limit set on the length of descriptors andforbidden terms is JO characters. Therefore there are exceptionalcases where descriptors or forbidden terms in the INIS Thesaurusare simply cut off after the first JO characters.

- 26 -

Page 28: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

e.g. For a descriptor:

CONTAINMENT RESEARCH INSTALLAT-ZQ

e.g. For a forbidden term:

-cyclopentanediaminetetraacetic

No diacritical marks will be allowed for in INIS descriptorsand forbidden terms. If pertinent they will be transliterated asindicated in: Appendix 1 of the IMIS: Descriptive CataloguingRules '

e.g. mossbauer effectinto MOESSBAUER EFFECT

i) Descriptors

The character set allowed for descriptors is the following:

spaceasterisk *dashRoman alphabet A-ZArabic numerals 0-9

ii) Forbidden terms

The character set allowed for forbidden terms is thefollowing :

1) IAEA-INIS-1 (INIS: Descriptive Cataloguing Rules)

Page 29: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

spaceperiodopen parenthesis (plus +asterisk *closed parenthesis )dashslash /comma ,apostrophy 'Roman alphabet A-ZArabic numerals 0-9

iii) Scope notes

The character set for scope notes is the same as for forbiddenterms plus the following special characters:

semicolon ;percent %colon :equal sign =

Furthermore both upper and lower case Roman characters areallowed.

3- Method of entering descriptors in the thesaurus

a) SyntaxCompound expressions consisting of two or more words should belisted preferably by direct entry i.e. not artificially inverted.Inverted entries may be introduced as a forbidden term referringto a direct entry as the descriptor. When a forbidden term isentered in inverted form, the qualifying expression should beenclosed in parentheses.

- 28 -

Page 30: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

e.g. -conductivity (electric)USE ELECTRIC CONDUCTIVITY

and: ELECTRIC CONDUCTIVITYUP -conductivity (electric)

or: -conductivity (thermal)USE THERMAL CONDUCTIVITY

and: THERMAL CONDUCTIVITYUP -conductivity (thermal)

b) Punctuation

Punctuation marks are not allowed in descriptors. (See III/2/e/i),Candidate descriptors exceeding 30 characters are rare, withexception of compound terms. Long compound terms however couldbe abbreviated and the abbreviation used as the preferential term(descriptor) with cross-reference to (limited by ̂ 0 characters)compound term introduced as a forbidden term. (See III/2/d/ii).

c) i) Specialized vocabularies

Certain fields have highly specific systems of nomenclature,or well-established standardized technical vocabularies.Whenever an internationally agreed nomenclature exists, it isused for standardizing INIS Terminology,

e.g. The ISO Nuclear Energy GlossaryThe ISO Standardized TerminologyThe UPAC rules

ii) Specific names

The proliferation of unrelated specific names would tend toconvert the thesaurus into a simple list of identifiers which

- 29 -

Page 31: GUIDELINES DEVELOPMENT MAINTENANCE THESAURUS · meaning of descriptors overlap substantially; where different spellings of the same word exist; for synonyms, antonyms and homonyms

would be self-defeating. It is therefore recommended, thatthe names of unrelated specific entities be avoided as muchas possible.

d) Alphabetization

The method of alphabetization of terms in the INI S Thesaurus isperformed on the basis of computer sort as follows:

spaceperiodopen parenthesis (plus +asterisk *closed parenthesis )dashslash /comma ,apostrophy 'Roman alphabet A-ZArabic numerals 0-9

Note:

The main points of this document have been introduced into theINIS: Manual for Indexing (IAEA-INIS-12 (Rev.l)).