Figure (S1). Distribution Pattern of the Date Samples (Soft, Semi Dry ,Dry) in the 3-Dimensional

Figure (S1). Distribution Pattern of the Date Samples (Soft, Semi Dry ,Dry) in the 3-Dimensional

Figure (S1). Distribution pattern of the date samples (soft, semi dry ,dry) in the 3-dimensional PCA-based factor space of their transmittance FT-IR spectra.

(○,●) semi-dry, (□, ■) dry ( ,▼) soft:

(A)

(B)

Figure (S2).(A) Distribution pattern of the Kabkab dates samples in different geographical region in 2-dimension PCA plot.(○,●) Samal, (□, ■) Tangestan ( ,▼) Dashtestan: (B) Distribution pattern of the Zahedi dates samples in different geographical region in 3-dimension PCA-based factor space of their transmittance FT-IR spectra (○,●) Pesh Kooh, (□, ■) Posht Kooh.Open and filled markers denote prediction and calibratioon samples, respectiely.

Figure (S3).CV classification error for class (1,2 and 3) as a function of the latent variable PLS components used in the inner PLS on the transmittance FT-IR of datesamples.

Figure (S4).Distribution pattern of the date samples (soft-semi dry and dry ) in the PLS-DA factor spaces of their transmittance FT-IR spectra.

(○,●) semi-dry, (□, ■) dry ( ,▼) soft:

Figure (S5).Distribution pattern of the 12 date varietiesin the PLS-DA factor spaces of their transmittance FT-IR spectra.

Figure (S6).Number of misclassifications as a function of the number PLS components used in the inner PLS relation of ECVA on the transmittance FT-IR data.

Figure (S7). Distribution pattern of the date samples (12 varieties) in the 2-dimensional canonical variate space of their transmittance FT-IR spectra (whole region).

Figure (S8).Distribution pattern of the wavenumber values for the transmittance FT-IR data set obtained by 4-node self -organizing map.

Figure (S9). The extended canonical weights (for first three ones) obtained from CLoVA- ECVA method, using cluster (S4,4) from network size (n=4).

Figure (S10). Distribution pattern of the date samples (soft, semi –dry and dry) in the 2-dimensional canonical variate space of their transmittance FT-IR spectra(based on cluster (S4,4) of network size (n=4).)

Table (S1).The results of PCA application on the transmittance FT-IR data matrix of all date samples.

Component / Eigen-value / % of Variance / Cumulative % of variances
1 / 220 / 62.60 / 62.60
2 / 114 / 32.18 / 94.78
3 / 11.2 / 3.09 / 97.87
4 / 3.55 / 1.00 / 98.87
5 / 1.50 / 0.42 / 99.29

Table (S2).The percentage of misclassification obtained by different chemometrics methods for discrimination of date types.

Methods
Date type / PLS-DA / ECVA / CLoVA-ECVA
Cal / Pre / Cal / Pre / Cal / Pre
Soft / 14.38 / 14.28 / 0 / 0 / 0 / 0
Semi-dry / 0 / 0 / 0.3968 / 0 / 0 / 0
Dry / 0 / 0 / 0 / 0 / 0 / 0

Table (S3).The percentage of misclassification obtained by different chemometrics methods for discrimination of 12 date varieties

Date Verities / Methods
PLS-DA / ECVA / CLoVA-ECVA
Cal / Pre / Cal / Pre / Cal / Pre
Berehi / 23.81 / 22.22 / 24.80 / 25.23 / 0 / 0
Halavi / 0 / 0 / 59.14 / 45.44 / 0 / 0
Kabkab / 80.95 / 66.67 / 0 / 0 / 0 / 0
Khanizi / 19.04 / 11.12 / 25.5 / 23.4 / 0 / 0
Karoot / 0.11 / 11.11 / 10 / 8.5 / 0 / 0
Medjool / 38.09 / 22.22 / 16.28 / 15.11 / 0 / 0
Mordasang / 76.19 / 66.67 / 0 / 0 / 0 / 0
Mazafati / 9.52 / 11.11 / 0 / 0 / 0 / 0
Piarom / 4.76 / 13.11 / 52.38 / 11.11 / 0 / 0
Rabbi / 0 / 0 / 0 / 0 / 0 / 0
Shahabi / 14.28 / 0.44 / 3.52 / 0 / 0 / 0.11
Zahedi / 14.28 / 13.11 / 0 / 0 / 0 / 0