Table 1 Soybean sample composition

Table 1 Soybean sample composition

Detection of Roundup Ready™ Soybeans by Near-Infrared Spectroscopy

S.A. Roussel*, C.L. Hardy**, C.R. Hurburgh, Jr.*, G.R. Rippke* Iowa State University,. Ames, IA

* Grain Quality Laboratory ** Center for Crops Utilization ResearchDepartment of Agricultural & Biosystems Engineering

The controversy over genetically modified (GMO) grains has created the need for a rapid screening method for grain handlers. Maintaining purity of an otherwise indistinguishable product is not an operation familiar to commodity markets.

Near-infrared analyzers would be well suited to screening inbound deliveries, if there were sufficient spectral differences between GMO and conventional varieties. While NIR units cannot detect DNA, spectral differences may be caused by larger structural changes accompanying the modification.

Roundup Ready™ soybean (RR) was the first genetically modified grain to be widely adopted by US farmers (0% of soybean acres grown in 1996 to more than 50% in 1999).

10th International Diffuse Reflectance Conference (IDRC-2000), August 13-18th, 2000, Chambersburg, PA.

Figure 2 shows the region of greatest spectral response to be 900–950nm.

A preliminary study demonstrated a spectral difference between RR and conventional soybeans (Figure 1). Samples of known varieties from Iowa county

Additional samples (Table 1) were scanned in 1999 on near infrared spectrometers (Figure 3).

The objective of this study is to develop the protocol for distinguishing Roundup Ready™ from conventional soybeans using near infrared spectrometry.

RR Non-RRSample set(nRR/nnon-RR)

FactorAvg. Std. Dev. Avg. Std. Dev.

Moisture 11.5 0.63 10.9 0.81

Protein 37.5 0.93 36.5 0.84Sample Set I

1998 harvest samples(53/61) Oil 18.9 0.48 19.5 0.69

Moisture 11.2 0.63 10.7 1.71

Protein 36.0 1.50 35.8 1.61Sample Set II

1999 harvest, breeder, strip plot samples (308/341) Oil 18.5 0.82 18.3 1.00

Moisture 8.73 0.96 9.2 0.93

Protein 35.7 1.29 35.5 1.45Sample Set III

1999 Iowa SoybeanYield Test samples (3882/3482) Oil 18.2 0.84 18.3 0.80

Basis 13% moisture, predicted by the Infratec, calibration SB00994

After the preliminary study based on Sample Set I, the general progression of evaluation was:

1. New PLS calibrations and ANN models (Figure 5) were created using Sample Sets I & II (Table I). The constituent values as well as the spectra were used as input variables.2. PLS, ANN (Figure 5), and LWR (Figure 6) calibrations were generated using a very large database (Sample Set III in Table I).

3.2

3.3

3.4

3.5

3.6

3.7

850 870 890 910 930 950 970 990 1010 1030Wavelength (nm)

ln (1

/t)

Roundup Ready Soybeans(51 samples)

Non-Roundup Ready Soybeans

(63 samples)

Near-infrared transmittance spectra, Infratec 1229; Roundup Ready vs. Non-Roundup Ready soybeans

strip plots in 1998 comprised a calibration set of 53 RR soybeans and 61 non-RR

soybeans (Table 1). On an independent set of 39 samples, this calibration classified

RR soybeans with 85% accuracy and non-RR soybeans with 95% accuracy.

However, it gave less than 70% accuracy on the following year’s grain (1999).

-300

-200

-100

0

100

200

300

850 870 890 910 930 950 970 990 1010 1030

Wavelength (nm)

Reg

ress

ion

coef

ficie

nt v

alue Water band, 960 nm

Active Region

Regression coefficients for 1998 Roundup Ready soybean calibrations

CH-, CH2-, CH3-, R-OH activity

Figure 2

0.00

0.01

0.02

0.03

0.04

0.05

0.06

0.07

850 870 890 910 930 950 970 990 1010 1030

Wavelength (nm)

Ave

rag

e d

iffe

ren

ce, R

R -

no

n-R

R, l

n (

1/T

) u

nits

Original calibration set (n=114)

Original validation set (n=39)

Early harvest 1999 (n=43)

Wet breeder samples (n=96)

Dry breeder samples (n=270)

1999 strip plots (n=107)

Differences in NIR transmission spectra (Roundup Ready - non-Roundup Ready)

in 6 soybean sample sets, 1998-1999

Figure 4

Figure 4 shows the Infratec spectral differences between RR and non-RR samples. The direction of differences (RR higher than non-RR) remained but the magnitude of the difference was not consistent. Figure 4 indicates that a linear PLS model might not describe the differences accurately.

Figure 3Foss/Tecator Infratec 1229 Grain Analyzer

To prevent confounding, the data was balanced in other constituents (Moisture, Protein, Oil) known to affect the spectra (Table 1).

The classification accuracy of various models was compared: Partial Least Squares (PLS) (UnscramblerTM v. 7.5) Locally Weighted Regression (LWR) (MatlabTM v. 5.3) Artificial Neural Networks (ANN) (MatlabTM v. 5.3)

Materials

Methods

850 875 900 925 950 975 1000 1025 10503.4

3.45

3.5

3.55

3.6

3.65

3.7

3.75

3.8

3.85

3.9

Soybean NIR spectrum

Wavelengths (nm)

ln(1

/T)

Roundup Readysoybeans

Non Roundup Ready soybeans

Moisture

Protein

Oil

Fiber

Pri

ncip

al

Co

mp

one

nts

Figure 5: ArtificialNeural Networks

The best ANN model (Figure 5) used 25 principal components generated from the 100-wavelength spectra and 4 constituent

5 to 8 hidden nodes, and 2 outputs

values (moisture, protein, oil, and fiber) as inputs,

encoding for RR and non-RR classes.

Calibration set Predicted test set% correct

classificationModel and parameters Year (RR/non-RR) Year (RR/non-RR) RR Non-RRPLS 16LVs 66 72PLS+

a17LVs

Sample Set I & II 308 / 341 Sample Set II (cv)

308 / 34171 76

PLS 12LVs 62 53PLS+

a10LVs 59 69

ANN (25+4:8:2)b

Sample Set I & IICalibration

272 / 275Sample Set I & II

Test Set37 / 55

81 87PLS

a16LVs Sample Set III

c4302 / 3880 Sample Set III

(cv)4302 / 3880 89 87

PLS 16LVs 77 79PLS+

a15LVs 77 79

LWR 17LVs, 850ng 93 92ANN (25+4:5:2)

Sample Set III 3882 / 3482 Test Set III 420 / 398

91 89

Table 2 Roundup Ready™ soybean classification results

(cv) -Classification errors computed using cross-validation LVs - Latent variables ng - neighborsa Includes the constituents (moisture, protein, oil and fiber) as independent variablesb (25+4:8:2): Input nodes: 25 principal components + 4 constituents; hidden nodes: 8; output nodes: 2c Sample Set III and Test Set III were combined for cross-validation.

Results

300500

700900

11001300

1500

13 14 15 16 17 18 19

7

7.5

8

8.5

9

9.5

10

10.5

11

Number of neighbors

LWR prediction of the 818-sample test set

Number of latent variables

Pe

rce

nta

ge

of c

lass

ifica

tion

err

or

8.3%

7.6%

Figure 6Figure 6 shows the various parameters tested with LWR:

300 - 1500 neighbors 13 - 19 latent variables α = 0 - 1(proportion of

x- and y-distances) Spectra + constituents

or spectra onlyThe lowest classification error (7.6%) was obtained with 850 neighbors and 17 latent variables and α=0 (Figure 6).

The discrimination between RR and conventional soybeans has been achieved using non linear (ANN) and local (LWR) models a on large NIR spectral database.

The optical standardization of spectra issued from different NIR spectrometers may further improve the GMO classification.

The difference in the organic composition between RR and non-RR soybeans detected by NIR measurements must be more clearly defined.

Among the 3 models tested, Locally Weighted Regression achieved the best classification performance (92.4% accuracy). Artificial Neural Networks classified the RR soybeans nearly as accurately as did LWR. Partial Least Squares produced unsatisfactory classification accuracy.

Figure 1

Conclusions

http://www.iowagrain.org http://www.ag.iastate.edu/centers/ccur/

Table 1 Soybean sample composition

Documents

Transcript of Table 1 Soybean sample composition