Introduction

In food analysis, various analytical methods are used to determine or detect the compound characteristics of food. These methods (Jaimez et al. 2000; Nielsen 2010; Nollet 2000; Simo et al. 2005) show good precision, accuracy, and reliability. However, they demand expensive equipment, complex pre-treatment of a sample, and are time-consuming. To overcome these shortcomings, electrochemical methods combined with multivariate data analysis have emerged as rapid and promising tools for evaluation of food quality, especially in the situations where only qualitative or semi-quantitative information is required. This is the so-called electronic tongue, which is mostly for liquid analysis based on the taste sensors, and the first system was presented by Otto and Thomas (1985).

Sensors that have wide selectivity and consequently reveal overlapping signals for different species are used so the data analysis is of great significance in the system. The taste sensors proposed by Toko and his co-workers (Toko 2000) consist of potentiometric electrodes with lipid membranes (PVC membranes with lipid derivatives) and claimed to be capable of discriminating several kinds of foodstuffs such as beer, coffee, and even tomatoes (Kikkawa et al. 1993) with the help of principal component analysis (PCA). Another version of the potentiometric electronic tongue composed of solid-state crystalline ion-selective electrodes based on chalcogenide glass (Vlasov et al. 1994; Vlasov et al. 2000) was applied for the discrimination of beverages (Lvova et al. 2002). The pattern recognition block of that system involved mainly PCA and some kinds of artificial neural networks (ANN) (Vlasov et al. 1997). Winquist’s team (Winquist et al. 1997) were pioneers in the development of voltammetric electronic tongues, and the system had also been used to evaluate several food samples (Ivarsson et al. 2001; Winquist et al. 2005) under the combination of PCA. Also, with the data analysis methods of PCA and ANN, a hybrid taste sensor based on voltammetric metallic electrodes and potentiometric ion-selective electrodes was employed to classify six types of fermented milk (Winquist et al. 2000). In general, the electronic tongue has been widely used year by year, and it has played a significant role in many aspects (del Valle 2012; Escuder-Gilabert and Peris 2010; Laura Escuder-Gilaberta 2010; Riul et al. 2010).

In the classification of tea by electronic tongues, different devices have been constructed for this purpose. A comparison of different applied waveforms on discriminating green and black teas by the pulse voltammetric taste sensor was also studied (Ivarsson et al. 2001). A disposable all-solid-state potentiometric taste sensor coupled with PCA and Soft Independent Modeling of Class Analogy (SIMCA) was investigated in the identification of Korean green tea (Lvova et al. 2003). A combination of the amperometric electronic tongue and linear discriminant analysis (LDA) was utilized to evaluate five classes of tea (Scampicchio et al. 2006). In recent years, several varieties of the Longjing tea were discriminated by the electronic tongue based on multifrequency large amplitude pulse voltammetry (Tian et al. 2007), and a single batch production of Kangra orthodox black tea was classified successfully by a novel impedance tongue (Bhondekar et al. 2011). These studies mentioned above reveal that the taste sensor technique was able to differentiate tea samples, but few studies have focused on the cyclic voltammetric (CV) signals of tea infusion, in spite of cyclic voltammetry’s significant place in electrochemical analysis.

Moreover, considering that data analysis is a fundamental part of the electronic tongue system, as we have mentioned above, several pattern recognition techniques such as PCA, LDA, SIMCA, ANN (Lvova et al. 2003; Scampicchio et al. 2006; Vlasov et al. 1997; Winquist et al. 2000) have been explored for the data analysis of the sensors. In the last two decades, a classification algorithm called support vector machine (SVM) (Burges 1998; Shawe-Taylor and Cristianini 2000), which is on the basis of statistical learning theory (Smola and Schölkopf 1998; Vapnik 1998), has been proposed in the broad learning field (Brereton and Lloyd 2010; Li et al. 2009). Despite this success story, few literature dealing with the applications on SVM in the electronic tongue area have been reported (Bhondekar et al. 2011; Ciosek et al. 2006; Kumar et al. 2012; Wei and Wang 2011), furthermore, few of them have combined the cyclic voltammetric method with the utilization of SVM.

In this work, we report an electronic tongue for the classification of green and black teas with multivariate data analysis of cyclic voltammogram of a homemade metallic oxide (SnO2, ZnO, TiO2, Bi2O3) modified nickel foam electrode. Several different combinations of sensors were compared in their classification ability in this system. Using the electrochemical responses obtained from cyclic voltammetry as input information, supervised and unsupervised chemometric analysis methods, principal component analysis (PCA) was performed as the exploratory method and SVM as the classification method.

Materials and Methods

Materials

A total of 43 tea samples investigated in this work, from ten provinces in China, were all purchased in the local markets in the city of Changsha, Hunan province. These teas can be divided into two groups, 16 black teas and 27 green teas.

Sample Preparation

In the experiment, 5.00 g of tea leaves was prepared as a sample. Before data acquisition of the taste sensors, a sequence of sample preprocessing (He et al. 2009; Ivarsson et al. 2001) was carried out as follows: first, each sample was infused with 100 mL distilled boiling water for 20 min; second, the tea leaves were removed by a sieve; third, the sample liquid was cooled in a 25 °C water bath until the temperature reached 25 °C (approximately 15 min). Voltammetric measurements of the tea samples were implemented after an addition of an adequate amount of 2 mol L−1 NaOH solution to yield a final concentration of 1 mol L−1 (Novakowski et al. 2011).

Electrodes and Measurements

The four electronic electrodes were based on nickel foam (from Liyuan New Material Co., Ltd., Changsha, China), which was used as substrate (Hu Hai et al. 2007). The SEM images (obtained from JSM-6360LV scanning electron microscope) of the nickel foam is shown in Fig. 1; it is obvious that the foam has an open cell structure, in which reactants are able to diffuse inside the bulk and the sensitivity of the electrode would be improved to a large extent. The substrate was modified by adding components as nanoparticles of SnO2, ZnO, TiO2, Bi2O3 (provided by the Sinopharm Chemical Reagent Co., Ltd., Shanghai, China)—one component per sensor. More specifically, every electrode was made by mixing the metal oxide with acetylene black (from Alfa Aesar) and polytetrafluoroethylene (from Dupont de Nemours) liquid (5 wt. %) in the following weight ratios, 80:15:5, and then rolling the mix into a homogenous paste. The paste was spread onto the nickel foam collectors and finally pressed under 10 MPa.

Fig. 1
figure 1

SEM images of nickel foam

Electrochemical experiments were performed in a CHI600A electrochemical workstation (form ChenHua Instrument Co., Ltd., Shanghai, China) using a conventional three-electrode cell, which was formed by the modified nickel foam working electrode plus the saturated calomel electrode (SCE) reference electrode and a homemade platinum counter electrode. Cyclic voltammograms were registered from −0.7 up to 0.6 V at a scan rate of 0.01 V/s. The electrochemical experiments were carried out at a constant temperature (25 °C).

Electrodes were conditioned by running cyclic voltammograms in 1 mol L−1 NaOH solution to attain a stable voltammetric response before performing the measurements with tea samples. In random order, each sample was measured three times and then averaged into one plot.

Statistic Analysis

A non-supervised multivariate method, principal component analysis (PCA) (Wold et al. 1987) was used to analyze signals obtained from the electronic tongue. The normalized anodic waves constituted the input data matrix for the multivariate analysis. A PCA model was constructed using all samples. The score plot of PC1 versus PC2 was examined for clustering results relating to the two groups, green tea and black tea.

As PCA can be used to reduce the dimension of cyclic voltammogram signals of different kinds of teas, PCs which occupied 98 % of the total information were analyzed by SVM to establish the classification model for green and black tea. Support vector machine algorithm, a powerful multivariate technique based on a machine learning approach, was first introduced by Vapnik and Burges (Burges 1998; Vapnik 1998). In very simple terms, the basic concept of SVM is mapping input vectors to a higher dimensional feature space with kernel tricks, then a maximal margin separating hyperplane is constructed for classification. More details about the theory of SVM can be found in the literature (Webb 2003). Basically, for a given data set X i R N (I = 1,…, N) with corresponding labels y i (y i  = + 1 or −1, representing the two classes to be classified), SVM gives a decision function:

$$ f(x)={\displaystyle \sum_{i=1}^N{y}_i{a}_iK\left({x}_i,{x}_j\right)+b,} $$

where K(x i ,x j ) is the kernel function, x i is the input vector, parameter a i is the Lagrange multiplier called support value, b is the bias term. There are three most commonly used kernel functions, which are the linear kernel of K(x i ,x j ) = x T i x j , the polynomial kernel of K(x i ,x j ) = (γx T i x j  + r)d γ > 0 (d = 3 in this work), and the Gaussian radial basis function (RBF) kernel of K(x i ,x j ) = exp(−γx i  − x j 2), γ > 0. All of the three kernels were conducted and compared in the present investigation. In addition, when considering the trade-off between the empirical error and the regularized term, penalty parameter c should be taken into consideration. Increasing c means giving more significance to the errors on the training set in determining the optimal hyperplane and leads to the more complex network structures with a smaller separation margin. Therefore, optimization of the modeling parameters is of considerable importance for the development of SVM models. For actual implementation, we used the LIBSVM (version 2.81) package (Chang and Lin 2011), and the penalty parameter c and kernel parameter were tuned based on the training set using the grid search strategy in LIBSVM.

All computations and chemometric analysis were performed in MATLAB version 7.5.0 (MathWorks, Natick, USA).

Result and Discussion

Electrochemical Characterization of Tea Samples

Tea has many different varieties because of different processing steps. Green tea, which is brewed from unfermented leaves, contains a rich source of monomer flavanols, while full fermentation to black tea produces a series of theaflavins and thearubigins (Harbowy et al. 1997). These phenolic compounds have shown antioxidant abilities in quenching reactive oxygen species and chelating metal ions in diverse tests (Dufresne and Farnworth 2001; McKay and Blumberg 2002). On the other hand, electrochemical analysis can provide valuable information on the physicochemical characteristics of compounds and has been utilized to estimate polyphenols (Ferreira et al. 2006; Freire et al. 2003; Gay Martín et al. 2012; Ghasemi-Varnamkhasti et al. 2012; Gutés et al. 2005; Gutes et al. 2005; Gutiérrez et al. 2008; Novak et al. 2009; Roginsky et al. 2003; Wang et al. 2012). Thus, the first step in this work was the analysis of electrochemical behavior of the teas.

Several cyclic voltammograms obtained by four metal oxide-modified electrodes on two tea samples, as an example of the total 43 samples examined, are shown in Fig. 2. To a certain degree, the four electrodes have similar responses towards these two teas. For instance, most of the CV diagrams exhibit two redox processes at ca. 0.05 and 0.52 V (versus SCE) in their anodic part, and an evident evolution of the signal, increasing in peak current, is detectable over the different teas at 0.52 V. More specifically, a blank NaOH solution (1 mol L−1) gets the lowest, while green tea’s peak current is higher than that of black tea. Nevertheless, according to previously published studies (Roginsky et al. 2003), these peaks are in relation with the polyphenolic content of the samples. To be specific, different teas contain different phenolics in different contents. The response of green tea is dominated by epigallocatechin gallate (EGCG), while theaflavins make up the bulk of the phenolics in black tea; previous studies (Kilmartin and Hsu 2003; Roginsky et al. 2003) have shown that EGCG has higher intensities than theaflavins in cyclic voltammetry. Besides that, the overall response would be the sum of the various substances, and the total flavonoids’ content of green tea is higher than that of black tea. In the corresponding reverse scan, the cathodic wave of redox processes appears at 0.32 and 0.40 V (versus SCE). Though different electrodes under study show similar redox processes, their responses differ in the shapes of the diagrams and the relative intensities of the peaks due to the interactions occurring between the electrode and the compounds solved in the solution studied. Obviously, the numerical differences of peak currents in Fig. 2a and c are larger than those in Fig. 2b and d.

Fig. 2
figure 2

CV scans recorded on different kinds of teas. a SnO2-modified nickel foam electrode performed as the working electrode. b TiO2-modified nickel foam electrode performed as the working electrode. c Bi2O3-modified nickel foam electrode performed as the working electrode. d ZnO-modified nickel foam electrode performed as the working electrode

Principal Component Analysis on CV Signals

In this work, we examined the discriminative ability of different electrodes in distinguishing both green and black teas. Considering that tea infusion has a much complex composition, which may result in the overlapped information, a multivariate data analysis was brought to solve this problem. Principal component analysis (PCA) makes it possible to extract maximum variation information from the high-dimension data matrix with several principal components. PCA is one of the most common data reconstruction and dimensional reduction method, therefore, PCA was performed on the electronic data (Quansheng Chenciq* and Saritporn 2008). In the case of CV, since the anodic wave is more reproducible, the decision taken was to use the anodic wave (which presents an enhanced performance) as the input data.

The PCA score plot corresponding to the nickel foam electrode modified with SnO2 (Fig. 3a) or TiO2 (Fig. 3b) is shown. By analyzing the data exhibited in Fig. 3a, it is possible to see a slight overlap between the green and black teas, with the first two principal components using 79.2 % of the total information. A similar result can be obtained from Fig. 3b, with 87.1 % of its total information. In addition, it has to be noticed that Lapsang, one kind of black tea, appears somehow in the overlapping parts of both Fig. 3a and b.

Fig. 3
figure 3

PCA score plots (two dimensions) of teas. a SnO2-modified nickel foam electrode performed as the working electrode. b TiO2-modified nickel foam electrode performed as the working electrode. (c) Bi2O3-modified nickel foam electrode performed as the working electrode. d ZnO-modified nickel foam electrode performed as the working electrode

The PCA score plots derived from the Bi2O3 (Fig. 3c) or ZnO (Fig. 3d) modified nickel foam electrode show a similar distribution of clusters to that displayed in Fig. 3a and b, even if the results were visibly improved, with almost total separation of clusters, including the Lapsang.

Moreover, throughout these four figures, there has been an obvious tendency that the distances between black tea samples are much closer than those of green tea samples in the PCA plots. The reason accounting for this phenomenon was that the main producing areas of black tea were concentrated in three provinces while green tea samples were distributed in eight provinces in this study. Therefore, it was clear that the geographical condition had an influence on the characteristics of the teas, and similar results were reported in previous works (He et al. 2009; Ye et al. 2012).

In the above-presented results, a non-supervised multivariate technique as PCA has demonstrated that this class of electrodes possesses a valuable power for the discrimination of different teas and even portrays the samples’ regional distinctions. In particular, such electrodes have a certain degree of similarity, for example, Fig. 3a and b both identify Lapsang improperly, in contrast, Fig. 3c’s and d’s better classifications reveal the differences among these electrodes. However, considering that the PCA only provides a visualization of the grouping of samples, a mathematic classifier was needed to perform the pattern recognition. In particular, a support vector machine was built to attain a qualitative classification of the teas.

Support Vector Machine Analysis

Besides the principal component analysis, a supervised method, the support vector machine, was applied to develop the classifier model for the analyzed teas. In SVM, the holdout method of cross validation was employed to randomly divide the dataset into two parts: a training set of 60 % and a testing set of 40 %. To achieve a good discrimination performance, determination of appropriate input feature subset and optimization of SVM parameters were the crucial problems to be solved. Firstly, the input data matrices were dimension-reduced by the principal component analysis, PCs that occupied 98 % of the total information were applied as the input datasets for the SVM classifier. Secondly, a grid search technique with fivefold cross-validation was employed to obtain the optimal parameter values of the classifier. Separately, the linear kernel has no special parameters while the polynomial kernel and the Gaussian radial basis function kernel should both consider γ. As a whole, two important parameters c and γ should be taken into account. The ranges of c within (2−8–28) and γ within (2−8–28) were set. Basically, all the c or pairs of (c, γ) were conducted and the one with the best cross-validation accuracy was selected.

After the settlement of these problems, the whole training data was evaluated to generate the final classifier. To get a stable and reliable result, the whole dataset was randomly divided 50 times and calculated 50 times. Finally, the three kernel functions were all employed independently and compared to determine which kernel was the most suitable for this system.

The overall averaged result is exhibited in Fig. 4, which is a clustered bar graph wherein the identification rate is plotted with different kernel functions (no. 1 represents the linear kernel, while no. 2 and no. 3 stand for the polynomial kernel and the RBF kernel, respectively) and clustered by four metallic oxide-modified electrodes. In Fig. 4a and b, it is obvious that each of the bars attains a higher identification rate than 80 %, what’s more, the third bar which is on behalf of the bismuth oxide-modified electrode achieves the highest in all the clusters. The above results indicate that this class of cyclic voltammograms combined with advanced pattern recognition methods acquires a satisfying prediction precision and a more digital identification of different teas.

Fig. 4
figure 4

Effects of different kernel functions on identification rate for each SVM classifier. a training dataset, b testing dataset

In the mean time, the type of kernel has some impact on the structure of the obtained SVM, thus, in this work three commonly used kernels were conducted and compared, details are shown in Table 1. Besides the classification accuracies of the dataset, specificity and sensitivity values have also been calculated to provide better statistical description of the model performance (Boulesteix et al. 2008). As we have said in the above statements, Bi2O3-modified electrode gets the best performance, and from Table 1, it can be observed that the prediction performance of the SVM classifier using the polynomial kernel is slightly better than using the other two kernels, with a training accuracy of 99.70 ± 1.02 % and a testing accuracy of 98.75 ± 2.86 %.

Table 1 Results of the SVM classification with the polynomial kernel using Bi2O3-modified electrode

Data Fusion of Multi-sensors

Furthermore, to make sure whether the accumulated responses from the sensors could improve the final results, several different combinations of sensors have been conducted. As we have mentioned above, Bi2O3-modified electrode got the best performance among the four single electrodes, so it was fixed in all the array groups. The SVM classifier with polynomial kernel was employed for all the newly combined information, and the result is shown in Table 2.

Table 2 Results of the SVM classification with the polynomial kernel using different groups of electrodes

In the two electrode systems, Bi2O3 plus TiO2, SnO2, or ZnO array shows better classification accuracy than the original TiO2, SnO2, or ZnO system; however, the newly combined arrays do not get as high classification accuracies as a single Bi2O3-modified electrode gets. For the three or four electrode system, a similar result has emerged.

In total, the single Bi2O3-modified electrode got the best performance in all the groups and the combination of sensors did not show evident improvement in the results. Usually, a vast amount of data is of high complexity (Li et al. 2012), for example, there will be outliers or a large number of irrelevant variables. The existence of outliers caused by experimental errors or other uncontrolled factors would make a predictive model misleading, and a large number of irrelevant variables would even interfere with the problem under investigation. In this work, the four electrodes have gotten such a large similarity that their complementary information is limited, thus, the more redundant data may result in worse results.

The Analysis of Bi2O3-Modified Nickel Foam Electrode

The most common polyphenols in tea are catechins, which could reach concentration levels of 30 % of the dry material. In the electrochemical reaction process, the catechin is oxidized to o-quinone, the mechanism of electrochemical oxidation could be expressed as follows:

figure a

Recently, metallic oxide nanoparticle-modified electrodes have obtained much attention due to their unusual physical and chemical characteristics. Bismuth oxide, which is known as an important transition metal oxide, has been used for a range of applications (Hwang et al. 2008; Li and Yan 2009; Zidan et al. 2011, 2010) because of its special energy band gap and photoconductivity. In this system, the electrode reaction of Bi2O3 is:

$$ {\mathrm{Bi}}_2{\mathrm{O}}_3+3{\mathrm{H}}_2\mathrm{O}+6{\mathrm{e}}^{-}\iff 2\mathrm{Bi}+6{\mathrm{O}\mathrm{H}}^{-} $$

In addition, previous literature has also demonstrated that bismuth oxide-modified electrode has a special sensitivity in detecting paracetamol (Zidan et al. 2011), which is a small molecule with a skeleton of phenol. While in this work, as it has been said previously, the main difference of green and black teas is their different concentrations in phenolic compounds. Therefore, the best performance that Bi2O3-modified electrode gets among other metallic oxides may probably result from bismuth oxide-modified electrode’s good sensitivity in identifying phenol substances.

Conclusion

Cyclic voltammograms in combination with chemometric methods of PCA and SVM were applied for the discrimination of different teas. In particular, four different metallic oxide-modified electrodes were studied, among them Bi2O3-modified electrode showed the best classification capabilities. Experimental results also revealed that SVM using the polynomial kernel performed the best in obtaining a specific determination.

In conclusion, the approach presented in this work, based on the SVM network, has high potential to evaluate tea sorts. Since our sensor is easy to implement, a simple and rapid classification may be obtained at a low cost, and the method may find applications in the food industry for the identification of other products.