Scholarly article on topic 'Application of the Classification and Regression Trees for Modeling the Laser Output Power of a Copper Bromide Vapor Laser'

Application of the Classification and Regression Trees for Modeling the Laser Output Power of a Copper Bromide Vapor Laser Academic research paper on "Physical sciences"

CC BY
0
0
Share paper
Academic journal
Mathematical Problems in Engineering
OECD Field of science

Academic research paper on topic "Application of the Classification and Regression Trees for Modeling the Laser Output Power of a Copper Bromide Vapor Laser"

Hindawi Publishing Corporation Mathematical Problems in Engineering Volume 2013, Article ID 654845,10 pages http://dx.doi.org/10.1155/2013/654845

Research Article

Application of the Classification and Regression Trees for Modeling the Laser Output Power of a Copper Bromide Vapor Laser

Iliycho Petkov Iliev,1 Desislava Stoyanova Voynikova,2 and Snezhana Georgieva Gocheva-Ilieva2

1 Department of Physics, Technical University of Sofia, Branch Plovdiv, 25 Tzanko Djusstabanov Street, 4000 Plovdiv, Bulgaria

2 Department of Applied Mathematics and Modeling, University of Plovdiv, 24 Tzar Assen Street, 4000 Plovdiv, Bulgaria

Correspondence should be addressed to Iliycho Petkov Iliev; iliev55@abv.bg Received 11 February 2013; Accepted 20 April 2013 Academic Editor: Bin Liu

Copyright © 2013 Iliycho Petkov Iliev et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

This study examines the available experiment data for a copper bromide vapor laser (CuBr laser), emitting in the visible spectrum at 2 wavelengths—510.6 and 578.2 nm. Laser output power is estimated based on 10 independent input parameters. The CART method is used to build a binary regression tree of solutions with respect to output power. In the case of a linear model, an approximation of 98% has been achieved and 99% for the model of interactions between predictors up to the the second order with an relative error under 5%. The resulting CART tree takes into account which input quantities influence the formation of classification groups and in what manner. This makes it possible to estimate which ones are significant from an engineering point of view for the development and operation of the considered type of lasers, thus assisting in the design and improvement of laser technology.

1. Introduction

Metal vapor lasers, including copper and copper halide lasers, have long been recognized to possess unique properties and capabilities with wide area of applications [1, 2]. They are known as the most powerful sources in the visible range (516.6 nm-578.2 nm) with coherent radiation and high beam convergence, generating at high repetition rates and high average output power. This type of devices continues to be subject of laser technology research for improved performance and for innovation. The main aspect of their development is the enhancement of the average output laser power.

This paper examines a copper bromide vapor laser which is from the class of copper halide vapor lasers. There is continuing interest in further improving the output characteristics of this laser and its applications [1, 2]. Alongside engineering design, the mathematical modeling (analytical, numerical, statistical, simulating, or other types) of laser devices is also widely applied in practice. Standard mathematical modeling includes systems of differential and integral

equations, optimization, and other mathematical methods, describing the system and allowing the calculation of solutions for the processes, occurring within the system under investigation, as well as the performance of simulations. Here, the most widely used types of models are kinetic models. These describe the particles and processes occurring in the operating laser medium. There exist a large number of such publications for metal vapor lasers, including copper bromide vapor lasers [3-5]. Although kinetic models describe the major processes within the laser medium and the interactions between particles using hundreds of equations, a general drawback of theirs is that they cannot provide a complex direct estimate of output characteristics such as the average output power, laser efficiency, and service life. Moreover, the results from the kinetic models are in the form of calculated numerical data, which need additional computer processing.

As a set off to that, during the last few years, statistical models were developed and applied on the basis of accumulated experiment data. The models are in the form of explicit statistical relationships, dependencies, and classifications of the basis laser parameters. These give the opportunity to

Figure 1: Structural diagram of the laser tube of a copper bromide vapor laser; 1—copper bromide reservoirs, 2—heat insulation of the active volume, 3—copper electrodes, 4—inner rings, and 5— mirrors.

Table 1: Technical parameters of a copper bromide vapor laser [1,6].

Characteristic Description

Emission wavelength Operating mode Pulse frequency Average volume power density Measured temperature of the wall

Pulse length Average output power Coefficient of efficiency (laser efficiency) Total service life Pulse energy Temperature of the active medium Start time

Structural elements

estimate the strength and the form of the relationship between the laser parameters. All this make it possible to direct the experiment towards increased output laser parameters and to make a preliminary estimate of experiment results using the models. Traditional parametric models of metal vapor lasers have been developed and analyzed in [6-10]. Multivariate regression with principal components analysis, hierarchical cluster analysis, factor analysis, and other statistical techniques has been used. A nonlinear model of output power has been built in [11]. Nonparametric models were obtained using the Multivariate Adaptive Regression Splines (MARS) method in [6,11]. In the recent paper [12], the models describe over 98% of experiment data with a relative accuracy comparable to that of measurements, making it possible to predict the output power of future lasers.

In this paper, another powerful nonparametric modeling method—CART (classification and regression trees)—is applied to available data for a copper bromide vapor laser. This method allows for the separation of all observations from the considered independent variables (predictors) in noninteracting groups in the form of a binary tree according to the degree of influence on the dependent variable, in this case, laser output power.

The objective of this study is to determine the influence 10 input laser characteristics (supplied electric power, geometric design of the tube, neon pressure, reservoir temperature, etc.)

on the average output power based on available experiment data. For the first time, the powerful nonparametric technique CART, described in [13,14], is applied for metal vapor lasers. The following basic problems are solved: (i) building an optimal solution regression tree; (ii) determining the adequate linear models on the basis of this tree; (iii) building a tree of second degree independent variables; (iv) using the models to estimate known experiments; (v) applying the models for experiment prediction; (vi) validation of models; (vii) comparison of results to previous parametric and nonparametric models of the same type of laser. The obtained models describe more than 98% of the data and demonstrate excellent predictive qualities. They are used to direct the construction and design of new copper bromide vapor lasers with increased output power.

The results have been obtained using the CART software package [15].

2. Subject of Investigation

The copper bromide vapor laser is an improved version of a pure copper vapor laser. It is the most powerful and effective laser in the visible spectrum demonstrating high coherence and convergence of the laser beam. We are investigating variations of this laser invented and developed at the Laboratory of Metal Vapor Lasers at the Georgi Nadjakov Institute of Solid State Physics of the Bulgarian Academy of Sciences, Sofia. The first patents related to this type oflaser are [16,17]. The copper bromide vapor laser is one of the 12 laser sources which have a wide range of applications and are commercially viable [1, 2]. The development and improvement of CuBr lasers is seen as a fundamental step in the study of copper lasers as a whole.

Copper bromide vapor lasers are sources of pulse radiation in the visible spectrum (400-720 nm) emitting at two wavelengths: green, 510.6 nm, and yellow, 578.2 nm. They are considered to be high-pulse lasers. Neon is used as a buffer gas. In order to improve efficiency, small quantities of hydrogen are added. Unlike the high-temperature pure copper vapor laser, the copper bromide vapor laser is a low-temperature one, with an active zone temperature of about 500°C. The laser tube is made out of quartz glass without high-temperature ceramics as a result of which it is significantly cheaper and easier to manufacture. The discharge is heated by electric current (self-heating laser). It produces light impulses tens of nanoseconds long. Its main advantages are short initial heating period, stable laser generation, relatively long service life, high values of output power, and laser efficiency. A simple scheme of the laser is given in Figure 1.

The specific technical parameters of the investigated copper bromide vapor lasers are given in Table 1.

3. Description of the Data

This paper takes into account the following 10 independent input variables (predictors) and one dependent variable (response)—laser output power Pout (W). The independent

510.6 and 578.2 nm Pulse-periodic, self-heating 10-125 kHz 1.4-2 W/cm3

500°C

20-50 ns 1-125W

>1000 hours 6.9 mJ

500-550°C

10-15 min

Quartz tube, outer electrodes, copper bromide reservoirs

Table 2: Descriptive statistics of CuBr laser experimental data.

Minimum Maximum Mean Std. deviation Skewness Kurtosis

Statistic Statistic Statistic Statistic Statistic Std. error Statistic Std. error

D (mm) 15.00 58.00 46.59 10.072 -0.809 0.12 1.451 0.25

DR (mm) 4.50 58.00 34.83 18.31 0.265 0.12 -1.602 0.25

PIN (kW) 1.00 5.00 2.10 1.27 1.065 0.12 -0.321 0.25

L (cm) 30.00 200.00 106.59 70.70 0.478 0.12 -1.670 0.25

PL (kW/cm) 5.00 16.67 10.92 2.51 -0.467 0.12 0.183 0.25

PH2 (torr) 0.00 0.80 0.36 0.25 -0.416 0.12 -1.430 0.25

PRF (kHz) 3.20 125.50 23.24 25.69 3.589 0.12 11.530 0.25

PNE (torr) 8.00 250.00 22.56 24.17 6.389 0.12 46.454 0.25

C(nF) 0.33 4.00 1.33 0.61 2.313 0.12 6.233 0.25

TR (°C) 350.00 590.00 478.22 23.25 -1.673 0.12 7.332 0.25

Pout (W) 0.25 120.00 34.024 35.57 0.808 0.12 -0.862 0.25

Valid N 387

variables are D (mm)—inner diameter of the laser tube, DR (mm)—inner diameter of the ring (without rings, D = D£), L (cm)—length of the active zone (distance between the electrodes), PIN (kW)—electric power supplied to the discharge, PL = P/N/L/2 (kW/cm)—electric power per unit length with 50% losses, PRF (kHz)—electric pulse repetition frequency, PNE (torr)—buffer gas pressure (neon), PH2 (torr)—pressure of the added gas (hydrogen), C (nF)— equivalent capacity of the condensation battery, and TR (°C)—temperature of copper bromide reservoirs.

The study uses the values of these variables taken from n = 387 experiments, published in [18-25]. It needs to be noted that the maximum output power achieved is Pout = 120 W in an experiment where the following values were measured for the input parameters as given previously: (58, 58, 200, 5, 12.5, 0.6,17.5, 20,1.3, and 490) [24].

The statistical summary for the whole set is given in Table 2.

It should be noted that the variables are not normally distributed, which is observed from the values of asymmetry and excess. The same is valid for the multivariate distribution of the data. For this reason, nonparametric methods which have no requirements towards the type of data distribution, both as a whole and for subsets, are more suitable.

4. Short Description of the CART Method

The CART method algorithm, as indicated by the name, solves the classification and regression problem. It was developed between 1974-1984 by Breiman et al. [13].

CART is a nonparametric solution tree technique which builds classification or regression trees depending on whether the dependent variable is categorical or numerical. In our case, this is a regression tree.

The algorithm is intended for the building of a binary solutions tree. The initial set of observations is divided into groups at the terminal nodes (leaves) of the tree. The goal is to find a tree which allows for a good distribution of the

data with the lowest possible relative error of prediction. Each branch of the tree ends with one or two terminal nodes and each observation falls into exactly one terminal node, defined by a unique set of rules.

More specifically, the objective of the regression tree approach is to distribute the data in relatively homogeneous (with minimum least squares or minimum standard deviation) terminal nodes and to obtain a mean observed value at each node in the form of a predicted value. The building of a tree starts from a root node, containing all observations. At each step (at each running node) a rule is applied to divide the set of observations within the node into two subsets (two children) according to some condition for an independent variable (predictor) Xk of the type

Xfc <dj or Xfc >0;, (1)

where 0j is the threshold value. If a given observation from the current node meets the left inequality in (1), it is classified to a group in the left child node split, and, if not it goes to the right child node split. In this way, the separation by nodes is repeated multiple times until a terminal node is reached. The general criterion for the selection of a predictor variable at each node and its threshold value is the minimum of the least squares or the minimum standard deviation from all possible predictors and all possible threshold values beginning from the current node and subset data. Defining a given node as a terminal one depends on the minimum error achieved as per a preset criterion for the minimum number of observations or some other type of restriction [26, 27]. The observations which find their way to a given tree node are defined by a series of rules of the type (1), starting at the root of the tree.

Validation is usually applied when building regression trees, since they maybe sensitive to random errors in the data. This helps diminish by "pruning" the initial tree, maintaining its regression characteristics and accuracy. In the case of fewer observations and variables, the use of the statistical method of cross-validation with V-fold is recommended. This validation technique in CART allows for the construction of

2 0.1 sV >

0 25 50 75 100 125 150 175 200 225 250 275 Number of nodes

ATM.2 ATM.50

- ATM.5 ATM_100

ATM 10 ATM.200

ATM.25

Figure 2: Diagram of the relative errors in linear models for different minimum numbers of cases (ATM) in terminal nodes.

Number of nodes

Figure 3: Curve of the relative errors of linear CART models with 10 predictors.

very reliable models superior to standard regression models. In general case, CART applies the least squares splitting rule to build the maximal tree and a cross-validation procedure to select the optimal tree.

In this study, we have used the standard 10-fold cross-validation, recommended for small samples. The data have been randomly divided into 10 equal nonintersecting subgroups, each containing approximately 10% of the dataset. The tree has been built using 9/10 of the data (learn sample) and the remaining 1/10 (test sample) have been used for prediction and to determine the level of the error. The tree construction process is repeated 10 times and the average error of the 10 series is taken as a general estimate. This procedure ensures accurate estimation of the dependent variable and allows for the tree tobe used for the classification or regression of another dataset.

The estimate y[T] for the value of the prediction in a terminal node with the number r is the mean value of all measurements for the dependent variable y, which fall within the following node:

= fk' yk 6 T- (2)

5. Linear CART Model of Output Power

First we will build and analyze a linear model, that is, where the predictors are the independent variables participating only with their first degree, as described in Section 3.

A CART model has been built in order to determine the relationship between laser output power and the 10 basis input laser variables. The minimum number of observations has been set at 10 for parent nodes and 5 for terminal nodes. It was established using a special feature Battery ATOM of the software CART [15, 28]. The comparative diagram of the relative error of the models with a given number of the terminal nodes is shown in Figure 2. It can be seen that minimum of 2 and 5 cases in the terminal node give almost equal relative error less than 2.5%.

Onemorespecific objectiveofour investigationistobuild a tree which classifies and predicts well experiments with high values of output power. For this reason, further on we will concentrate on the node which contains the highest values of output power Pout.

In order to specify the tree and its reverse prune so as to find a tree with an optimal small relative error for the data, we apply the 10-fold cross-validation procedure described in Section 4.

By setting the minimum number of the cases in the terminal nodes equal to 5 and 10 for the parent node, and setting the classification/regression criterion to least squares, an optimal regression tree is found. In practice, there exists a subset of trees that exhibit an accuracy performance statistically indistinguishable from the optimal tree. All of these models are candidates for optimal models too. This is called a "1 standard error" or 1 SE rule to identify these trees [28]. In this study we will choose the 1 SE tree that has the same performance with the optimal tree in the subtree with the maximum output power and has the simplest structure with the minimum terminal nodes.

The curve of relative errors of generated models, including the optimal model with the smallest error is shown in Figure 3. It can be seen that the optimal tree is this with 49 terminal nodes and 3.0% of relative error. After examining all other models following the 1 SE rule (visualized in green), we find a tree with 27 terminal nodes with minimum terminal nodes and the same performance in the hot spot nodes with the maximum output power Pout.

The selected regression CART model with 27 terminal nodes accounts for R2 = 98.1% of the sample following 10fold cross-validation procedure. It has a relative error 3.1%.

A detailed specific information about the hot spot node t = 22 is shown in Figure 4. This node contains the highest power values with a standard deviation STD = 9.64 and a local root mean square error RMS = 4.876. The value predicted by the regression using formula (2) is the average value of the response

PÓut[22] = 113.333 W. (3)

This approximation is within 6% relative error with respect to the maximum of experiment, and STD is relatively high, which is not sufficiently satisfactory, since it is comparable but still high with respect to the unavoidable experiment error, which is considered to be within 5-10%.

Figure 5 shows all splitters used to build the selected tree with 27 terminal nodes. For all terminal nodes, the corresponding local splitting classification rules are given in

120 -100 -80 -

3 60 -

40 -20 0 -

Node 23 C < 1.45 STD = 10.011 Avg = 104.667

W = 21.00 N = 21

Node 24 PIN < 4.75 STD = 8.46.0 Avg = 110.417 W = 12.00 N = 12

PIN > 4.75 _I_

Terminal Node 22 STD = 9.638 Avg = 113.333 W = 6.00 N = 6

Terminal

Node 23 STD = 5.963 Avg = 97.000 W = 9.00 N = 9

O yV/W

<0> mcmcs/o

co^ ofm/o

coywm.

R2 linear = 0.981

60 Pout (W)

PIN < 4.75 _I_

Terminal Node 21 STD = 5.766 Avg = 107.50 W = 6.00 N = 6

Figure 4: Specific characteristics of the nodes with maximum values of output power Pout and a hot spot terminal node 22 in a linear CART model with 10 predictors and 27 terminal nodes.

Figure 5: Distribution of splitters for each node in the regression CART model with 10 predictors and 27 terminal nodes.

Figure 6: Experiment values of Pout against predicted PredPout using the linear regression CART model with 10 predictors and 27 terminal nodes with a 5% confidence interval.

Table 3. For node 22, which is of special interest, through the cross-section of local rules, we find three variables PIN, C, and PR, limited as follows:

Node 22 : P/N > 4.75 kW,

C < 1.45 nF, 14.5 kHz < PPF < 20.5 kHz.

The overall quality of approximation with the regression tree is given in Figure 6, showing the experiment values of output power Pout against those predicted by the linear model. It can be added that the residuals of the selected model are normally distributed and no heavy tails were detected.

6. CART Model of Output Power Using up to Second Degree of Predictors

In order to build a CART tree including up to second-degree polynomials, from the 10 independent variables we form 65 predictors of the following type:

XtXj, i > j, i,j= 1,2,..., 10, (5)

where the variables, for ease of use, denote the input laser parameters given in Section 3. Analogically to the linear case, we construct the binary tree of solutions under restrictions: minimum 10 observations per parent node and a minimum of 5 for terminal nodes. The graph of distribution of the relative error for all obtained trees is given in Figure 7. It can be seen that the optimal tree with 3.8% relative error is with 62 terminal nodes.

To bring into comparison with the linear model we chose again a tree with 27 terminal nodes. It satisfies the selection criteria as in the linear case. More exactly, this model has 4.1% relative error (see Figure 7). The statistics and rules of

Mathematical Problems in Engineering Table 3: Statistics for a linear CART model of Pout with 27 terminal nodes and 10 predictors.

Terminal node Minimum Pout Maximum Pout Mean Observations Splitting classification rules

1 0.4 2.8 159 17 P7N < 1.95 DR < 30 PH2 < 0.0465 TR < 432

2 0.5 6.2 4.29 82 P7N < 1.95 DR < 30 PH2 < 0.0465 TR > 432

3 12.8 19 15.43 14 P7N < 1.95 DR < 30 PH2 > 0.0465 TR < 482.5 D < 43 PMF < 16

4 5.8 11.5 9.08 6 P7N < 1.95 PNF < 17.5 DR < 30 PH2 > 0.0465 TR < 482.5 D < 43 PMF > 16

5 1.6 10.8 6.92 5 P7N < 1.95 DR < 30 PH2 > 0.0465 PMF < 17.5 TR < 482.5 D > 43

6 5 10.9 8.33 62 P7N < 1.95 DR < 30 PH2 > 0.0465 PMF < 17.5 TR > 482.5

7 0.25 8.27 3.95 22 P7N < 1.95 DR < 30 PH2 > 0.0465 PMF > 17.5

8 16.8 32 22.94 35 P7N < 1.95 DR > 30

9 36 51.8 46 11 P7N > 1.95 P7M < 2.45

10 53 73 63.91 20 C < 1.75 P7N > 2.45 P7N < 2.85

11 70 90 80.33 6 P7N > 2.85 P7M < 3.15 C < 1.45 PRF < 16.25

12 88 92 89.2 5 P7N > 2.85 P7M < 3.15 C < 1.45 PRF > 16.25 PRF < 18

13 60 90 74.2 5 P7N > 2.85 P7M < 3.15 C < 1.45 PRF > 18

14 55 70 63.88 8 P7N > 2.85 P7M < 3.15 C > 1.45 C < 1.75

15 35 50 44.29 7 P7N > 2.45 P7M < 3.15 C > 1.75

16 64 96 80.56 9 P7N > 3.15 C < 1.75 PRF < 14.5

17 90 102 97.83 6 PRF > 14.5 PRF < 20.5 P7M > 3.15 P7M< 3.75 C< 1.15

18 80 94 88.6 5 PRF > 14.5 PRF < 20.5 P7M > 3.15 P7M< 3.75 C > 1.15 C < 1.45

19 90 104 100 6 PRF > 14.5 PRF < 20.5 C < 1.45 P7M > 3.75 P7M < 4.25

20 76 96 87.33 6 PRF > 14.5 PRF < 20.5 P7M > 3.15 P7M < 4.25 C > 1.45 C < 1.75

21 98 112 107.5 6 PRF > 14.5 PRF < 20.5 C < 1.45 P7M > 4.25 P7M < 4.75

22 94 120 113.33 6 PRF > 14.5 PRF < 20.5 C < 1.45 PIN > 4.75

23 85 102 97 9 PRF > 14.5 PRF < 20.5 P7M > 4.25 C > 1.45 C < 1.75

24 58 82 68.5 8 P7M > 3.15 C < 1.75 PRF > 20.5

25 45 76 63.38 8 C > 1.75 P7M > 3.15 P7M < 4.25

26 57 82 73.83 6 C > 1.75 P7M > 4.25 P7M < 4.75

27 63 90 80.43 7 C > 1.75 P7M > 4.75

Number of nodes

Figure 7: Relative error curve for all generated CART trees using 65 predictors from (5).

the selected tree are given in Table 4. Significant predictor variables in the model are the following 30 predictors of first and second degrees: D, DR, L, PIN, PRF, PNE, C, D • PIN, D • PH2, D • PRF, D • TR, • PIN, • PH2, • P£F, DR C, D£T£, !• TR, PIN^Pm, PINP^F, PINC, PINT^, PL P^F, PL C, PIT£, PH2 P^F, PH2 C, PH2TR, P^F C, P£F • TR, and PNF • C.

A detailed view of the hot spot nodes with the maximum values of Pout is presented in Figure 8.

The node with the highest values is number 20 (see Figure 8). The following approximation and accuracy values are achieved: the average value predicted for the leaf is

Pout[20] = 114.86 W, (6)

a standard deviation is 6.22, and RMS = 4.035 within the leaf. The model describes R2 = 98.710% of the sample. The approximation (6) is within 4% relative error with respect to the maximum of experiment and also the STD is admissible. So, the indices of this model are satisfactory with the experiment error, considered to be within 5-10%. The splitting rules for node 20 are as follows:

Node 20 : D • PIN > 86,

PIN > 3.15, P£F • C < 26.5,

P£F < 19.25,

PIN • P£F > 76.875.

Table 4: Statistics for a CART model of Pout with 27 terminal nodes and 65 predictors.

Terminal node Minimum Pout Maximum Pout Mean Observations Splitting classification rules

1 0.4 7 3.87 116 D • P7N <86 DR < 30 DR • PH2 < 1.6875

2 12.8 19 15.43 14 D • P7N < 86 DR < 30 DR • RH2 > 1.6875 D • TR < 19214 PNF < 16

3 5.8 11.5 9.08 6 D • P7N < 86 DR < 30 DR • RH2 > 1.6875 D • TR < 19214 PNF > 16

4 0.25 10.9 7.89 72 D • P7N < 86 DR < 30 DR • RH2 > 1.6875 D • TR > 19214

5 16.8 30.9 23.68 32 D • P7N <86 DR > 30 P7N • TR < 787.5

6 23 41 31.4 5 D • P7N <86 DR > 30 P7N • TR > 787.5

7 40 51 47.15 8 C < 1.75 D • P7N > 86 D • P7N < 135 P7N < 2 • 35

8 51.8 58 55.4 5 C < 1.75 D • P7N > 86 D • P7N < 135 P7N > 2.3 P7N < 3.15

9 55 72 62.6 5 P7N<3.15 C < 1.75 D • P7N > 135 P7N • PRF < 47.75 PRF < 14.75

10 63 81 71.67 12 P7N < 3.15 C < 1.75 D • P7N > 135 P7N • PRF < 47.75 PRF > 14.75 PRF < 20.25

11 88 92 89.14 7 P7M < 3.15 D • P7M > 135 P7M • PRF > 47.75 C < 1.45 PRF < 18

12 64 90 76.4 5 P7M < 3.15 D • P7M > 135 P7M • PRF > 47.75 C < 1.45 PRF > 18 PRF < 20.25

13 60 70 67.6 5 P7M < 3.15 D • P7M > 135 PRF < 20.25 P7M • PRF > 47.75 C > 1.45 C < 1.75

14 53 62 57.67 6 P7M < 3.15 C < 1.75 D.P7M > 135 PRF > 20.25

15 35 50 44.29 7 D • P7M > 86 P7M < 3.15 C > 1.75

16 64 91 78.29 8 D • P7M > 86 P7M > 3.15 PRF • C < 26.5 P7M • PRF < 56.875

17 90 104 98.23 13 D • P7M > 86 P7M > 3.15 PRF • C < 26.5 PRF < 19.25 P7M • C < 6.85

P7M • PRF > 56.875 P7M • PRF < 68.75

18 96 112 105.67 6 D • P7M > 86 P7M > 3.15 PRF • C < 26.5 PRF < 19.25 P7M • C < 6.85

P7M • PRF > 68.75 P7M • PRF < 76.875

19 80 100 ftft ft 5 D • P7M > 86 P7M > 3.15 PRF • C < 26.5 PRF < 19.25

88.8 P7M • PRF > 56.875 P7M • PRF < 76.875 P7M • C > 6.85

20 102 120 114.86 7 D PIN > 86 PIN > 3.15 PRF • C < 26.5 PRF < 19.25 PIN PRF > 76.

21 68 98 86.67 6 D • P7M > 86 P7M > 3.15 PRF • C < 26.5 P7M • PRF > 56.875 PRF > 1

22 45 80 58.5 6 D • P7M > 86 PRF • C > 26.5 P7M > 3.15 P7M < 3.75

23 96 102 99.33 6 D • P7M > 86 PRF • C > 26.5 P7M > 3.75 PRF <19 C< 1.75

24 60 76 70.67 6 D • P7M > 86 PRF • C > 26.5 P7M > 3.75 PRF <19 C> 1.75 P7M • PRF < 71

25 76 90 84.13 8 D • P7M > 86 PRF • C > 26.5 P7M > 3.75 PRF <19 C> 1.75 P7M • PRF > 71

26 70 85 77.6 5 D • P7M > 86 P7M > 3.75 PRF > 19 PRF • C > 26.5 PRF • C < 32.8

27 57 77 66.33 6 D • P7M > 86 P7M > 3.75 PRF > 19 PRF • C > 26.5 PRF • C < 32.8

The general distribution diagram of the tree splitters according to variables is shown in Figure 9.

Figure 10 compares the values of Pout to the ones predicted by the regression tree in variable PredP out „quadratic. It can be added that as in the linear case the residuals of the selected model are normally distributed and no heavy tails were detected.

7. Discussion of Results and Model Comparison

In the obtained linear model of the 10 independent physical parameters, only 6 participate in the constructed regression tree. These defining parameters are

F/N, C, FH2, F£F, and FNF. (8)

As shown in Figure 5, when the cases (experiments) are separated, three main third-level branches form, corresponding to a large degree to the three types of physical classification of copper lasers—small, medium, and large bore lasers [1]. Of the parameters (8), PIN is the most important quantity. It is the root of the tree and subsequently participates in 4 more nodes related to the classification of medium and high laser power values Pout. For lower power values (along the left end branch in Figure 5), the defining parameters are PIN, DR, PH2, and PNE. For medium power values—PIN and C. For high power, these are PIN, C, and PRF, respectively.

The analysis of the second-degree tree model (Figure 9) shows that on level three there are 4 groups, 3 of which are large, as is the case in Figure 3, but the second group of these has no continuation and practically the basic groups are 3. In this case, the defining parameters besides (8) also include D,

Node 19 P/N-PRF < 76.88 STD = 10.243 Avg = 101.903 W = 31.00 N = 31

PJN-FRF < 76.88

PJN-FRF > 76.88

Node 20 PJN-C < 6.85 STD = 7.812 Avg = 98.125 W = 24.00 N=24

PJN-C < 6.85

Node 21 PJN-FRF < 68.75 STD = 5.870 Avg = 100.579 W = 19.00 N=19

Terminal

Node 20

STD = 6.220 Avg = 114.857 W = 7.00 N=7

FJN-C > 6.85

Terminal Node 19 STD = 7.222 Avg = 88.800 W = 5.00 N = 5

PJN-FRF < 68.75 PJN-FRF > 68.75

Terminal Terminal

Node 17 Node 18

STD = 4.509 STD = 5.217

Avg = 98.231 Avg = 105.667

W = 13.00 W = 6.00

N=13 N = 6

Figure 8: Specific characteristics of the hot spot nodes with maximum values of output power Pout in a CART model with 65 predictors and 27 terminal nodes.

Figure 9: Splitters for the CART tree with 27 terminal nodes for 65 predictors.

L, and TR. Of these, D is more significant as it participates together with PIN in the root of the tree as well as in another node but not on its own. In view of the weaker prediction offered by the second-degree model, it can be concluded that

these 3 parameters are ancillary and therefore secondary in significance with regard to the classification of the sample.

After reviewing the predictive capabilities of both models, we can conclude that the linear and second-degree models

Pout (W)

Figure 10: Experiment values for Pout against PredPoutquadratic with values predicted using the second-degree 27 terminal nodes CART model with a 5% confidence interval.

are almost equivalent. They describe quite well the various groups of classified cases and predict the values for the nodes with maximum output power within a relative error less than 5%. Since the second-degree model is the same in structure and better at predicting the group of higher output power values, it is recommended for engineering applications which aim at increasing output power. However, the results of both models can be combined for experiment planning. Another important comparison can be made with the models obtained using another powerful nonparametric technique—MARS. For the same data, second-degree MARS models also concur with 98-99% of the data, but are more precise in prediction of the output laser power than the CART models (see [12]). The advantage of CART models is that they provide more accurate criteria for the classification of individual experiment groups which are of special practical use.

8. Physical Interpretation and Application of the Models

We will also discuss the influence within the models of the main parameters which define high Pout values, namely, PIN, C, and PRF.

Influence of PIN: when the supplied electric power PIN is increased, the energy of the electrons rises. This leads to a higher probability of the upper laser level being populated. Laser generation Pout increases.

Influence of C: when C goes up, the electric power supplied to the discharge increases according to the formula E = 0.5U2C, where U is the voltage between electrodes. This leads to an increase of the supplied electric power PIN in the tube and subsequently of laser generation.

Influence of PRF: when the frequency of the supply increases, the emission frequency of laser generation also goes up. The number of per unit time (1 second) laser pulses is higher which facilitates the increase of the average laser generation power.

Ensuring the combined action of these basic processes under the set conditions (8) find practical application in planning and conducting new experiments aimed at increasing the output power of a CuBr laser.

9. Conclusion

Regression models based on a CART tree, which classifies groups of similar experiments, have been built for a copper bromide vapor laser. The variables which play the main role in increasing laser output power have been identified for classified groups, as well as the intervals these should be within when conducting future studies and developing laser sources of the same type for improving laser technology.

Acknowledgment

This paper is published in cooperation with project of the Bulgarian Ministry of Education, Youth and Science, BG051PO001/3.3-05-0001 "Science and business" and financed under Operational program "Human Resources Development" by the European Social Fund.

References

[1] N. V. Sabotinov, "Metal vapor lasers," in Gas Lasers, M. Endo and R. F. Walter, Eds., pp. 449-494, CRC Press, Boca Raton, Fla, USA, 2006.

[2] P. G. Foster, Industrial applications of copper bromide laser technology [Ph.D. dissertation], University of Adelaide, School of Chemistry and Physics, Department of Physics and Mathematical Physics, Adelaide, Australia, 2005.

[3] M. J. Kushner and B. E. Warner, "Large-bore copper-vapor lasers: kinetics and scaling issues," Journal of Applied Physics, vol. 54, no. 6, pp. 2970-2982, 1983.

[4] "Numerical modeling of low-temperature plasmas," in Encyclopedia of Low-Temperature Plasma, Series B, M. Ianus, Ed., vol. 7, Moscow, Russia, 2004.

[5] A. M. Boichenko, G. S. Evtushenko, and S. N. Torgaev, "Simulation of a CuBr laser," Laser Physics, vol. 18, no. 12, pp. 1522-1525, 2008.

[6] S. G. Gocheva-Ilieva and I. P. Iliev, Statistical Models of Characteristics of Metal Vapor Lasers, Nova Science Publishers, New York, NY, USA, 2011.

[7] I. P. Iliev, S. G. Gocheva-Ilieva, D. N. Astadjov, N. P. Denev, and N. V. Sabotinov, "Statistical analysis of the CuBr laser efficiency improvement," Optics and Laser Technology, vol. 40, no. 4, pp. 641-646, 2008.

[8] I. P. Iliev, S. G. Gocheva-Ilieva, D. N. Astadjov, N. P. Denev, and N. V. Sabotinov, "Statistical approach in planning experiments with a copper bromide vapor laser," Quantum Electronics, vol. 38, no. 5, pp. 436-440, 2008.

[9] I. P. Iliev, S. G. Gocheva-Ilieva, and N. V. Sabotinov, "Classification analysis of CuBr laser parameters," Quantum Electronics, vol. 39, no. 2, pp. 143-146, 2009.

[10] S. G. Gocheva-Ilieva and I. P. Iliev, "Parametric and nonpara-metric empirical regression models: case study of copper bromide laser generation," Mathematical Problems in Engineering, vol. 2010, Article ID 697687,15 pages, 2010.

[11] S. G. Gocheva-Ilieva and I. P. Iliev, "Nonlinear regression model of copper bromide laser generation," in Proceedings of 19th International Conference on Computational Statistics (COMPSTAT '10), Y. Lechevallier and G. Saporta, Eds., pp. 1063-1070, Physica/Springer ebook, Paris, France, August 2010, http://www-roc.inria.fr/axis/COMPSTAT2010/images/ contents_ebook.pdf.

[12] I. P. Iliev, D. S. Voynikova, and S. G. Gocheva-Ilieva, "Simulation of the output power of copper bromide lasers by the MARS method," Quantum Electronics, vol. 42, no. 4, pp. 298-303,2012.

[13] L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone, Classification and Regression Trees, Wadsworth Advanced Books and Software, Belmont, Calif, USA, 1984.

[14] D. Steinberg and P. Colla, CART: Tree-Structured Non-Parametric Data Analysis, Salford Systems, San Diego, Calif, USA, 1995.

[15] CART Classification and Regression Trees. October 2012, http://www.salford-systems.com/en/products/cart.

[16] N. V. Sabotinov, P. K. Telbizov, and S. D. Kalchev, Copper bromide vapour laser. Bulgarian patent No. 28674, 1975.

[17] N. V. Sabotinov, N. K. Vuchkov, and D. N. Astadjov, "Gas laser discharge tube with copper halide vapors," United States patent 4635271, 1987.

[18] D. N. Astadjov, N. V. Sabotinov, and N. K. Vuchkov, "Effect of hydrogen on CuBr laser power and efficiency," Optics Communications, vol. 56, no. 4, pp. 279-282,1985.

[19] D. N. Astadjov, K. D. Dimitrov, C. E. Little, N. V. Sabotinov, and N. K. Vuchkov, "A CuBr laser with 1.4 W/cm3 average output power," IEEE Journal of Quantum Electronics, vol. 30, no. 6, pp. 1358-1360, 1994.

[20] V. M. Stoilov, D. N. Astaddov, N. K. Vuchkov, and N. V. Saboti-nov, "High spatial intensity 10 W-CuBr laser with hydrogen additives," Optical and Quantum Electronics, vol. 32, no. 11, pp. 1209-1217, 2000.

[21] NATO contract SfP, 97 2685,50W Copper Bromide laser, 2000.

[22] D. N. Astadjov, K. D. Dimitrov, D. R. Jones et al., "Influence on operating characteristics of scaling sealed-off CuBr lasers in active length," Optics Communications, vol. 135, no. 4-6, pp. 289-294,1997.

[23] K. D. Dimitrov and N. V. Sabotinov, "High-power and high-efficiency copper bromide vapor laser," in 9th International School on Quantum Electronics: Lasers—Physics and Applications, vol. 3052 of Proceedings ofSPIE, pp. 126-130,1996.

[24] D. N. Astadjov, K. D. Dimitrov, D. R. Jones et al., "Copper bromide laser of 120-W average output power," IEEE Journal of Quantum Electronics, vol. 33, no. 5, pp. 705-709,1997.

[25] N. P. Denev, D. N. Astadjov, and N. V. Sabotinov, "Analysis of the copper bromide laser efficiency," in Proceedings of the 4th International Symposium on Laser Technologies and Lasers, pp. 153-156, Plovdiv, Bulgaria, 2006.

[26] A. J. Izenman, Modern Multivariate Statistical Techniques Regression, Classification, and Manifold Learning, Springer, New York, NY, USA, 2008.

[27] R. Nisbet, J. Elder, and G. Miner, Handbook of Statistical Analysis and Data Mining Applications, Elsevier/Academic Press, Burlington, Mass, USA, 2009.

[28] D. Steinberg and M. Golovnya, CART6.0 USer's Guide, Salford Systems, San Diego, Calif, USA, 2006.

Copyright of Mathematical Problems in Engineering is the property of Hindawi Publishing Corporation and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use.