Skip to main content

Advertisement

Transforming the canonical piecewise-linear model into a smooth-piecewise representation

Article metrics

Abstract

A smoothed representation (based on natural exponential and logarithmic functions) for the canonical piecewise-linear model, is presented. The result is a completely differentiable formulation that exhibits interesting properties, like preserving the parameters of the original piecewise-linear model in such a way that they can be directly inherited to the smooth model in order to determine their parameters, the capability of controlling not only the smoothness grade, but also the approximation accuracy at specific breakpoint locations, a lower or equal overshooting for high order derivatives in comparison with other approaches, and the additional advantage of being expressed in a reduced mathematical form with only two types of inverse functions (logarithmic and exponential). By numerical simulation examples, this proposal is verified and well-illustrated.

Background

Piecewise-linear models are widely used in diverse fields, such as circuit theory, image processing, system identification, economics and financial analysis, etc (Chua and Ying 1983; Chua and Deng 1985; Hasler and Schnetzler 1989; Yamamura and Ochiai 1992; Russo 2006; Feo and Storace 2004, 2007; Brooks 2008). The factor that prevalently motivates the use of this type of models is the simplicity of their structure which let them be efficiently implemented in both algorithms and hardware. In general, piecewise-linear models looks very appealing for graphical tasks, like curve fitting, interpolation or extrapolation, where a function is constructed to fit or determine new values within or outside the range of a discrete set of known data points (Bian and Menz 1998; Dai et al. 2007; Magnani and Boyd 2009; Misener and Floudas 2010; Jimenez-Fernandez et al. 2014). However, a notorious shortcoming can be distinguished in this type of models when function derivatives are of interest. This is because the first derivatives of piecewise-linear functions are not continuous at breakpoints and the second derivatives do not exist or are vanished inside each linear partition. This fact limits their application in that cases where derivatives are imperatively required, such as device modeling, nonlinear systems simulation, and analysis of experimental data, among others. In that regard, although there are many reported piecewise-linear models (Chua and Kang 1977; Kang and Chua 1978; Chua and Deng 1988; Kahlert and Chua 1990; Guzelis and Goknar 1991; Pospisil 1991; Kevenaar et al. 1994; Leenaerts and Van-Bokhoven 1998; Julian et al. 1999; Li et al. 2001), due to its compact formulation, the most popular is the so-called canonical piecewise-linear representation (Chua and Kang 1977) which is given by the following theorem:

Theorem 1

Any single-valued piecewise-linear function with at most \(\sigma\) breakpoints \(\beta_{1} < \beta_{2} < \ldots < \beta_{\sigma }\) , can be represented by the expression

$$y(x) = a + bx + \sum\limits_{i = 1}^{\sigma } {c_{i} \left| {x - \beta_{i} } \right|}$$
(1)

with \(b = \left( {\frac{{J^{(1)} + J^{(\sigma + 1)} }}{2}} \right),\) \(c_{i} = \left( {\frac{{J^{(i + 1)} - J^{(i)} }}{2}} \right),\) \(a = y\left( 0 \right) - \sum\nolimits_{i = 1}^{\sigma } {c_{i} |\beta_{i} |}\) for \(i = 1,2, \ldots ,\sigma\) , and \(J^{\left( i \right)}\) denoting the slope of the i-th constitutive linear segment in the piecewise-linear function.

and more generally, for n-dimensional functions, (1) takes the form

$$y\left( {\mathbf{x}} \right) = a + {\mathbf{{\rm B}x}} + \sum\limits_{i = 1}^{\sigma } {c_{i} \left| {\left\langle {{\user2{\Lambda}}^{\left( i \right)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right|}$$
(2)

where \({\mathbf{x}}\), \({\mathbf{\rm B}}\), and \({\user2{\Lambda}}^{\left( i \right)}\) are n-dimensional vectors, \(a\), \(c_{i}\) and \(\beta_{i}\) are scalars, and “\(\left\langle , \right\rangle\)” denotes the inner product of two vectors.

As can be seen, this model is expressed by a closed formula with a minimal number of parameters. Nevertheless, due to a sum of absolute-value terms is included in (1) and (2), it is not completely differentiable.

Motivated by the fact of merging in a unique piecewise model these two fundamental characteristics: simplicity and differentiability, in this paper an algebraic transformation to smooth the canonical piecewise-linear model, is proposed. Such transformation let it obtain a new formulation which is based on natural exponential and logarithmic functions. It results in a new model that, besides of being smooth and preserving a minimum number of parameters, it makes the native piecewise-linear model completely differentiable. In this concern, it is important to mention that, in accordance with literature such lack of differentiability has been overcome by substituting the basis-function of the piecewise-linear model (in this case, the absolute-value) for its smooth approximation. Illustrative examples of this strategy can be found in (Bacon and Watts 1971; Seber and Wild 1989; Lazaro et al. 2001; Griffiths and Miller 1973), where the functions \(sign(x)\), \(\tanh (x)\), \(lch(x)\), and \(hyp(x)\) are used, respectively. Similarly, our smoothing transformation is based on the same principle but compared to those reported approaches, it reveals significant improvements, for example: (1) a better curve fitting accuracy can be achieved due to the deviation between the piecewise-linear function of reference, and the resulting smooth description, is restrictively focused at the breakpoints, (2) no additional parameters needs to be computed because it uses the same parameters of the original canonical model, and (3) the resulting smooth model exhibits a lower or equal overshooting for their derivatives. The paper is organized as follows. In section 2, the deduction of the smoothing transformation formula is explained in detail. Section 3 describes the transformation strategy by two illustrative examples (for one- and two-dimensional domains). In section 4 a comparative analysis and discussion about the curve fitting accuracy that can be achieved through the proposed transformation as well as the overshooting in derivatives, is exposed. This comparative is done among the most popular smoothing proposals reported in literature. Finally, section 5 presents the concluding remarks of this work.

Deduction of transformation formula

It follows from (Schmidt et al. 2007) that a smooth approximation for the absolute-value function can be expressed in terms of natural logarithms as

$$\left| x \right| = \frac{1}{{\alpha \ln \left( {10} \right)}}\left[ {\ln \left( {1 + e^{ - \alpha x} } \right) + \ln \left( {1 + e^{{\left( {\alpha x} \right)}} } \right)} \right]$$
(3)

Using the property \(\ln \left( {uv} \right) = \ln \left( u \right) + \ln \left( v \right)\) in (3), and simplifying the resulting algebraic expression we have

$$\left| x \right| = \frac{2}{{\alpha \ln \left( {10} \right)}}\ln \left( {e^{{\left( {\frac{\alpha }{2}x} \right)}} + e^{{\left( {\frac{ - \alpha }{2}x} \right)}} } \right)$$
(4)

After numerical simulations on (4), a slight deviation from the unity slope of the absolute-value function can be observed. In order to achieve more approximation accuracy, a constant µ is included as

$$\left| x \right| = \frac{2}{{\alpha \ln \left( {10} \right)}}\mu \ln \left( {e^{{\left( {\frac{\alpha }{2}x} \right)}} + e^{{\left( {\frac{ - \alpha }{2}x} \right)}} } \right)$$
(5)

being \(\mu = \ln \left( {10} \right)\) an appropriate fitting value.

This simplify (5) as

$$\left| x \right| = \frac{2}{\alpha }\ln \left( {e^{{\left( {\frac{\alpha }{2}x} \right)}} + e^{{\left( {\frac{ - \alpha }{2}x} \right)}} } \right)$$
(6)

Proof See Appendix A

After substituting the absolute-value function of (1) by its equivalent smooth approximation (6), it let us recast the canonical model as

$$y\left( x \right) = a + bx + \sum\limits_{i = 1}^{\sigma } {c_{i} \left( {\frac{2}{\alpha }} \right)\ln \left( {e^{{\frac{\alpha }{2}\left( {x - \beta_{i} } \right)}} + e^{{\frac{ - \alpha }{2}\left( {x - \beta_{i} } \right)}} } \right)}$$
(7)

Hence, performing an algebraic reduction of (7) yields:

$$y\left( x \right) = a + bx + \sum\limits_{i = 1}^{\sigma } {c_{i} \left( {x - \beta_{i} } \right) + \frac{2}{\alpha }\sum\limits_{i = 1}^{\sigma } {c_{i} \ln \left( {1 + e^{{ - \alpha \left( {x - \beta_{i} } \right)}} } \right)} }$$
(8)

that hereafter is denoted as the smooth-piecewise model whose parameters (\(a\),\(b\), and \(c_{i}\)) are the same as the canonical piecewise-linear model, and the parameter α is incorporated to controls the smoothness. A more formal definition for (8) is expressed by the following theorem:

Theorem 2

Any one-dimensional canonical piecewise-linear function that is characterized by L segments and σ breakpoints \(\beta_{1} < \beta_{2} < \ldots < \beta_{\sigma } ,\) can be transformed into smooth-piecewise function expressed as

$$y\left( x \right) = A + Bx + \sum\limits_{i = 1}^{\sigma } {C_{i} \ln \left( {1 + e^{{ - \alpha \left( {x - \beta_{i} } \right)}} } \right)} \quad with\,\sigma = (L - 1)$$
(9)

where the set of \(\left( {\sigma + 2} \right)\) parameters: \(\left\{ {A,B,C_{i} } \right\}\) can be determined as follows:

$$A = a - \sum\limits_{i = 1}^{\sigma } {c_{i} \beta_{i} }$$
(10)
$$B = b + \sum\limits_{i = 1}^{\sigma } {c_{i} }$$
(11)
$$C_{i} = \frac{{2c_{i} }}{\alpha }\quad for\,i = 1 \ldots \sigma$$
(12)

and the parameter α can be used to preserve a constant smoothness in all the function domain, or to define a specific smoothness grade \(\alpha_{i}\) at any i-th breakpoint location as

$$\alpha_{i} = \frac{{2c_{i} \ln \left( 2 \right)}}{\delta }$$
(13)

with δ being the deviation between the piecewise-linear and the smooth-piecewise functions at \(x = \beta_{i} .\)

Proof See Appendix B

Without loss of generality, for an n-dimensional representation of (9), a smooth transformation is derived from (2) and expressed as

$$ y\left( {\mathbf{x}} \right) = A + {{\hat{\mathbf{\rm B}}x}} \, + \sum\limits_{i = 1}^{\sigma } {C_{i} \ln \left( {1 + e^{{ - \alpha \left( {\left\langle {{\user2{\Lambda}}^{\left( i \right)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} } \right)}$$
(14)

where both parameters, \(A\) and \(C_{i}\), are calculated by using the same equations as for the one-dimensional case (Eqs. (10), (12), respectively), and \({\hat{\rm B}}\) is determined as follows:

$${{\hat{\rm B}}} = {\mathbf{\rm B}} + \sum\limits_{i = 1}^{\sigma } {c_{i} {\user2{\Lambda}}^{(i)} }$$
(15)

Proof See Appendix C

Illustrative examples

With the purpose of exploring (9), we present two application examples; the first shows how to obtain the smooth-piecewise representation of any one-dimensional function, and the second exposes a more practical case where the smoothing transformation is applied into a two-dimensional characterization curve for a n-channel MOS transistor.

Example 1

Consider any one-dimensional piecewise-linear function \(y_{PWL} \left( x \right)\) characterized by the following linear segments:

$$y_{PWL} \left( x \right) = \left\{ {\begin{array}{*{20}c} {\frac{1}{2}x} &\quad { - \infty < x < 1} \\ { - x + \frac{3}{2}} &\quad {1 < x < 2} \\ {\frac{3}{2}x - \frac{7}{2}} &\quad {2 < x < 3} \\ { - x + 4} &\quad {3 < x < 4} \\ {x - 4} &\quad {4 < x < + \infty } \\ \end{array} } \right.$$
(16)

from (16), it can be directly obtained: \(L = 5,\) \(\sigma = 4,\) \(\beta = \left\{ {1,2,3,4} \right\},\) and \(J = \left\{ { + \frac{1}{2}, - 1, + \frac{3}{2}, - 1, + 1} \right\}.\)

After substituting the slopes (\(J^{\left( i \right)}\) for \(i = 1,2, \ldots ,5\)) and breakpoints (\(\beta_{i}\) for \(i = 1,2, \ldots ,4\)) values, into the parameters of (1) and substituting in (10), (11), and (12), the smooth description (17) can be obtained.

$$\begin{aligned} y\left( x \right) &= - 4 + x + \frac{2}{\alpha }\left( { - \frac{3}{4}\ln \left( {1 + e^{{ - \alpha \left( {x - 1} \right)}} } \right)} \right) + \frac{2}{\alpha }\left( {\frac{5}{4}\ln \left( {1 + e^{{ - \alpha \left( {x - 2} \right)}} } \right)} \right) \\ & \quad - \frac{2}{\alpha }\left( {\frac{5}{4}\ln \left( {1 + e^{{ - \alpha \left( {x - 3} \right)}} } \right)} \right) + \frac{2}{\alpha }\left( {\ln \left( {1 + e^{{ - \alpha \left( {x - 4} \right)}} } \right)} \right) \end{aligned}$$
(17)

In order to exemplify how the smoothness can be controlled by fixing the value of the parameter \(\alpha\), a summary of curves (black) for \(\alpha = \left\{ {6,8,10,15} \right\}\) is reported in Fig. 1. As reference, the original piecewise-linear curve \(y_{CPWL} \left( x \right)\), derived from Theorem 1, is also included in this figure (red).

Fig. 1
figure1

Smooth-piecewise approximations of \(y_{CPWL} (x)\) for \(\alpha = \left\{ {6,8,10,15} \right\}\)

$$y_{CPWL} \left( x \right) = - 2 + \frac{3}{4}x - \frac{3}{4}\left| {x - 1} \right| + \frac{5}{4}\left| {x - 2} \right| - \frac{5}{4}\left| {x - 3} \right| + \left| {x - 4} \right|$$
(18)

From this figure it can be observed that, when the parameter α is small the smoothness of (17) is increased, in contrast, when α is greater it is decreased. From a geometrical interpretation, this means a trade-off between the deviation from the breakpoint coordinate, and the desired smoothness. In Fig. 2 the first and second derivatives of \(y(x),\) for \(\alpha = 10,\) are contrasted with the corresponding derivatives of \(y_{CPWL} (x).\) As it was expected, the first derivative for \(y_{CPWL} (x)\) yields a discontinuous step curve, and the second and higher order derivatives are always zero. In contrast, it must be highlighted the existence of the first and higher order derivatives for the smooth function.

Fig. 2
figure2

First and second derivatives for \(y_{CPWL} (x)\) and \(y(x),\) (\(\alpha = 10\))

Example 2

In order to illustrate the smoothing transformation for a two-dimensional function, the characteristic curves and equilibrium equations of a metal–oxide–semiconductor (MOS) field-effect transistor are considered. This is a four-terminal device: source (S), gate (G), drain (D), and body (B) which is used for amplifying or switching electronic signals.

Let us start considering a n-channel MOS transistor connected in the common source configuration with \(v_{1} = v_{GS} ,\) \(v_{2} = v_{DS} ,\) and \(i_{2} = i_{D} ,\) where \(v_{1}\), \(v_{2}\) are in volts, and \(i_{2}\) are in microamperes. We assume that the piecewise-linear description \(i_{2}\) follows the Shichman-Hodges model for \(k = 50\,\mu A/V^{2} ,\) \(V_{t} = 1\,V,\) \(\lambda = 0.02\,V^{ - 1} ,\) and it is expressed in the canonical form of the Chua model (Chua and Deng 1986) as follows:

$$\begin{aligned}i_{2,cpwl} &= - 12.405 + 3.286v_{1} + 71.493v_{2} + 0.438\left| {37.738v_{1} - v_{2} + 42.459} \right| \\ & \quad - 54.407\left| {0.6705v_{1} - v_{2} - 1.5385} \right| - 15.715\left| {1.043v_{1} - v_{2} - 1.3058} \right| \\ & \quad + 1.809\left| { - 21.904v_{1} - v_{2} + 54.166} \right| \end{aligned}$$
(19)

The resulting piecewise-linear characteristic \(y_{CPWL} \left( {v_{1} ,v_{2} } \right) = i_{2}\) is shown in Fig. 3.

Fig. 3
figure3

Canonical piecewise-linear approximation for the drain current \(i_{2}\)

From (19) in reference to (2) is obtained:

\(A = - 12.405,\) \({\mathbf{\rm B}} = \left[ {\begin{array}{*{20}c} {3.286} \\ {71.493} \\ \end{array} } \right],\) \({\user2{\Lambda}}^{\left( 1 \right)} = \left[ {\begin{array}{*{20}c} {37.738} \\ { - 1} \\ \end{array} } \right],\) \({\user2{\Lambda}}^{\left( 2 \right)} = \left[ {\begin{array}{*{20}c} {0.6705} \\ { - 1} \\ \end{array} } \right],\) \({\user2{\Lambda}}^{\left( 3 \right)} = \left[ {\begin{array}{*{20}c} {1.403} \\ { - 1} \\ \end{array} } \right],\) \({\user2{\Lambda}}^{\left( 4 \right)} = \left[ {\begin{array}{*{20}c} { - 21.904} \\ { - 1} \\ \end{array} } \right],\) \(c_{1} = 0.438,\) \(c_{2} = - 54.407,\) \(c_{3} = - 15.715,\) \(c_{4} = 1.809,\) \(\beta_{1} = - 42.459,\) \(\beta_{2} = 1.5385,\) \(\beta_{3} = 1.3058,\) \(\beta_{4} = - 54.166\)

After applying (10), (12) and (15) for \(\alpha = 5\), it results

$$\begin{aligned}i_{2s} &= - 233.2142 + 79.2517v_{1} + 3.618v_{2} \\ & \quad + 0.1752\ln \left( {1 + e^{{188.69v_{1} - 5v_{2} + 212.295}} } \right) - 21.7628\ln \left( {1 + e^{{3.3525v_{1} - 5v_{2} - 7.6925}} } \right) \\ & \quad - 6.286\ln \left( {1 + e^{{5.215v_{1} - 5v_{2} - 6.529}} } \right) + 0.7236\ln \left( {1 + e^{{ - 109.52v_{1} - 5v_{2} + 270.83}} } \right) \end{aligned}$$
(20)

Figure 4 shows the characteristic curve of \(i_{2s} .\)

Fig. 4
figure4

Smooth curve \(i_{2s}\) that result from the transformation of \(i_{2}\)

With the aim of estimating the difference between the piecewise-linear function \(i_{2}\) and the smooth-piecewise function \(i_{2s}\), the deviation between their characteristic curves is depicted in Fig. 5 (shadow regions). Similarly, as it happens in the one-dimensional example, the most precise curve fitting is achieved within each linear segment (in the two-dimensional case, each plane) but the deviation (controlled by the smoothing parameter α) only appears near the breakpoints.

Fig. 5
figure5

Deviation between the piecewise-linear function \(i_{2}\) (red) and the smooth-piecewise function \(i_{2s}\) (blue)

Comparative analysis and discussion

In this section, an analysis and discussion about the curve fitting accuracy and the overshooting in function derivatives due to the smooth-piecewise model (9), is outlined. In order to have a comparative reference, besides of our proposal, other smoothing alternatives are considered and illustrated.

Smooth approximation for the absolute value function

As it was exposed in section 2, the proposed smoothing strategy uses an approximation for the absolute-value function based on a natural logarithmic with a Euler’s exponential argument, for simplicity, such approximation hereafter will be denoted as \(lne\) and it is obtained by recasting (6) as follows

$$\frac{2}{\alpha }\ln \left( {e^{{\frac{\alpha }{2}x}} + e^{{\frac{ - \alpha }{2}x}} } \right) = \frac{2}{\alpha }\ln \left( {\frac{{e^{\alpha x} + 1}}{{e^{{\frac{\alpha }{2}x}} }}} \right) = \frac{2}{\alpha }\ln \left( {e^{\alpha x + 1} } \right) - \frac{2}{\alpha }\ln \left( {e^{{\frac{\alpha }{2}x}} } \right)$$

After simplifying, it results

$$lne = k_{1} \ln \left( {e^{{\frac{2x}{{k_{1} }}}} + 1} \right) - x$$
(21)

with \(k_{1} = \frac{2}{\alpha }\)

Other reported approximations are, the here denoted, \(lch\) (Lazaro et al. 2001) and \(th\) (Seber and Wild 1989). The first one based on the natural logarithm of a hyperbolic cosine argument, and the second one directly expressed in terms of a hyperbolic tangent. Both approximations are expressed as

$$lch = k_{2} \ln \left( {\cosh \left( {\frac{x}{{k_{2} }}} \right)} \right)$$
(22)
$$th = x\tanh \left( {k_{3} x} \right)$$
(23)

Figure 6 shows the absolute-value function \(abs = \left| x \right|\) and their approximations \(lne,\) \(lch,\) and \(th.\) In all cases, the same curvature (smoothness) is considered (\(k_{1} = 1\), \(k_{2} = 1,\) and \(k_{3} = 0.4\)). This fact can be graphically observed by the circles that are circumscribed at the breakpoint.

Fig. 6
figure6

Smooth approximations for \({\text{abs}} = \left| x \right|\): \(lne\) for \(k_{1} = 1\), \(lch\) for \(k_{2} = 1\), and \(th\) for \(k_{3} = 0.4\)

For this example, curve fitting deviations of \(lne\), \(lch\), and \(th\), with respect to the absolute-value function, are shown in Fig. 7 where clearly it can be seen that, in \(lch\) and \(th\), a considerable variation is presented, especially far away or in the two quadrants near the breakpoint at \(x = 0\). However, in the close proximity of this point the curve deviation is progressively minimized. In contrast, by \(lne\), the reciprocal behavior can be observed. That is to say, the main deviation is focused at \(x = 0\), and it drops to zero as the curve moves away this point.

Fig. 7
figure7

Curve fitting deviations in \(lne\), \(lch\) and \(th\), with respect to the absolute-value function. \(k_{1} = 1\), \(k_{2} = 1\), and \(k_{3} = 0.4\)

Overshooting in derivatives of the absolute value function approximations

In Figs. 8 and 9 the first and second derivatives of functions: \(lne\) for \(k_{1} = 1\), \(lch\) for \(k_{2} = 1\), and \(th\) for \(k_{3} = 0.4\) are contrasted with the corresponding derivative of absolute-value function. As it was expected, the first derivative for the absolute-value function yields a step function while their second and higher order derivatives are always zero. We can also see that, in the both cases shown in Figs. 8 and 9, derivatives of \(th\) exhibit more overshooting than \(lne\) and \(lch\). Moreover, it can be noted the same overshooting for \(lne\) and \(lch\).

Fig. 8
figure8

First derivatives: \(lne^{\prime}\) of \(lne\) for \(k_{1} = 1\), \(lch^{\prime}\) of \(lch\) for \(k_{2} = 1\), and \(th^{\prime}\) of \(th\) for \(k_{3} = 0.4\)

Fig. 9
figure9

Second derivatives: \(lne^{\prime\prime}\) of \(lne\) for \(k_{1} = 1\), \(lch^{\prime\prime}\) of \(lch\) for \(k_{2} = 1\), and \(th^{\prime\prime}\) of \(th\) for \(k_{3} = 0.4\)

Comparative example

By this example, the two previously discussed characteristics: curve fitting of breakpoints and overshooting for function derivatives, are explored. Hence, consider a piecewise-linear curve defined by two breakpoints: \(\beta = \left\{ {1,2} \right\}\), and three slopes: \(J = \left\{ {2, - 3,1} \right\}\). In accordance with (1), from these input data the canonical piecewise-linear model description is given by

$$y_{pwl} \left( x \right) = - \frac{3}{2} + \frac{3}{2}x - \frac{5}{2}\left| {x - 1} \right| + 2\left| {x - 2} \right|$$
(24)

Smoothing transformations of (24) can be now intuitively achieved by replacing the absolute-value function with any of their approximations (\(lne\), \(lch\), and \(th\)). After applying the corresponding substitutions, we obtain

$$y_{\ln e} (x) = 2x - \frac{1}{4}\ln (e^{20x - 20} + 1) + \frac{1}{5}\ln \left( {e^{20x - 40} + 1} \right)$$
(25)
$$y_{lch} (x) = - \frac{3}{2} + \frac{3}{2}x - \frac{1}{4}\ln \left( {\cosh \left( {10x - 10} \right)} \right) + \frac{1}{5}\ln \left( {\cosh \left( {10x - 20} \right)} \right)$$
(26)
$$y_{th} (x) = - \frac{3}{2} + \frac{3}{2}x - \frac{5}{2}\left( {x - 1} \right)\tanh \left( {4x - 4} \right) + 2\left( {x - 2} \right)\tanh \left( {4x - 8} \right)$$
(27)

where (25), (26), and (27), are the smooth functions that correspond to \(lne\), \(lch\), and \(th\), respectively. It is important to point that, in order to evaluate these functions under equally conditions, the same smoothness is fixed by the parameters \(k_{1} = 0.1\), \(k_{2} = 10\), and \(k_{3} = 4\). Plots for these functions are depicted in Fig. 10.

Fig. 10
figure10

Smooth-piecewise transformation for \(y_{pwl} \left( x \right)\). Functions: \(y_{lne} \left( x \right)\), \(y_{lch} \left( x \right)\) and \(y_{th} \left( x \right)\) for \(k_{1} = 0.1\), \(k_{2} = 10\), and \(k_{3} = 4\), respectively

Curve fitting deviations of (25), (26), and (27), with respect to the reference function (24) can be appreciated in Fig. 11. From this figure it can be seen that a minimum deviation corresponds to (25) and the worst case is given by (27).

Fig. 11
figure11

Curve fitting deviations of \(lne\), \(lch\) and \(th\), with respect to the absolute-value function. \(k_{1} = 0.1\), \(k_{2} = 10\), and \(k_{3} = 4\)

Figure 12 shows that similarly as it was observed in Figs. 8 and 9, the most pronounced overshooting is due to the \(th\) approximation while an equally overshooting corresponds to \(lne\) and \(lch\).

Fig. 12
figure12

Overshooting for the first derivatives of \(y_{lne} \left( x \right)\), \(y_{lch} \left( x \right)\) and \(y_{th} \left( x \right)\)

Conclusion

The proposed transformation was successfully applied to one-dimensional and two-dimensional piecewise-linear functions. By numerical simulations, it was verified that in comparison with other reported strategies, our smooth-piecewise model has important advantages, like preserving the original parameters of its native canonical piecewise-linear representation, the capability of controlling the smoothness by an artificial parameter (α), a lower or equal overshooting for derivatives, and the additional advantage of being expressed in a more reduced mathematical form with only two types of inverse functions (logarithmic and exponential).

References

  1. Bacon DW, Watts DG (1971) Estimating the transition between two intersecting straight lines. Biometrika 58(3):525–534

  2. Bian S, Menz J (1998) Using piecewise linear interpolations in spherical harmonic analysis. J Geodesy 72(7):473–481

  3. Brooks C (2008) Introductory econometrics for finance. Cambridge University Press, Cambridge, pp 462–464

  4. Chua LO, Deng AC (1985) Canonical piecewise-linear analysis-II: tracing driving-point and transfer characteristics. IEEE Trans Circuits Syst 32(5):417–444

  5. Chua LO, Deng A (1986) Canonical piecewise-linear modeling. IEEE Trans Circuits Syst 33(5):511–525

  6. Chua LO, Deng A (1988) Canonical piecewise-linear representation. IEEE T Circuits Syst 35(1):101–111

  7. Chua LO, Kang SM (1977) Section-wise piecewise-linear functions: canonical representation, properties and applications. Proc IEEE 65(6):915–929

  8. Chua LO, Ying R (1983) Canonical piecewise-linear analysis. IEEE Trans Circuits Syst 30(3):125–140

  9. Dai TS, Wang JY, Wei HS (2007) An ingenious, piecewise linear interpolation algorithm for pricing arithmetic average options. Lect Notes Comput SC 4508:262–272

  10. Feo OD, Storace M (2004) Pwl identification of dynamical systems: some examples. IEEE Int Sym Circuits Syst 4:665–668

  11. Feo OD, Storace M (2007) Piecewise-linear identification of nonlinear dynamical systems in view of their circuit implementations. IEEE Trans Circuits Syst I Regular Papers 54(7):1542–1554

  12. Griffiths DA, Miller AJ (1973) Hyperbolic regression model based on two phase piecewise-linear regression with a smooth transition between regimes. Commun Stat 2(6):561–569

  13. Guzelis G, Goknar I (1991) A canonical representation for piecewise affine maps and its application to circuit analysis. IEEE Trans Circuits Syst 38(11):1342–1354

  14. Hasler MJ, Schnetzler C (1989) On the number of solutions of piecewise-linear resistive circuits. IEEE Trans Circuits Syst 36(3):393–402

  15. Jimenez-Fernandez VM, Cerecedo-Nunez HH, Vazquez-Leal H, Beltran-Parrazal L, Filobello-Nino UA (2014) A parametric piecewise-linear approach to laser projection. Comput Appl Math 33(3):841–858

  16. Julian P, Desages A, Agamennoni O (1999) High-level canonical piecewise linear representation using a simplicial partition. IEEE Trans Circuits I Fundam Theory Appl 46(4):463–480

  17. Kahlert C, Chua LO (1990) A generalized canonical piecewise-linear representation. IEEE Trans Circuits Syst 37(3):373–383

  18. Kang SM, Chua LO (1978) A global representation of multidimensional piecewise-linear functions with linear partitions. IEEE Trans Circuits Syst 25(11):938–940

  19. Kevenaar TAM, Leenaerts DMW, van Bokhoven WMG (1994) Extensions to chua’s explicit piecewise-linear function descriptions. IEEE Trans Circuits I Fundam Theory Appl 41(4):308–314

  20. Lazaro M, Santamaria I, Pantaleon C, Sanchez AM, Puente AT, Fernandez T (2001) Smoothing the canonical piecewise-linear model: an efficient and derivable large-signal model for mesfet/hemt transistors. IEEE Trans Circuits I Fundam Theory Appl 48(2):184–192

  21. Leenaerts DMW, Van-Bokhoven WMG (1998) Piecewise linear modeling and analysis. Springer, Berlin

  22. Li X, Wang S, Yin W (2001) A canonical representation of high-dimensional continuous piecewise-linear functions. IEEE Trans Circuits I Fundam Theory Appl 48(11):1347–1351

  23. Magnani A, Boyd SP (2009) Convex piecewise-linear fitting. Optim Eng 10(1):1–17

  24. Misener R, Floudas CA (2010) Piecewise-linear approximations of multidimensional functions. J Optim Theory Appl 145(1):120–147

  25. Pospisil J (1991) Universal network realising basic piecewise-linear I/V characteristics. Electron Lett 27(5):402–404

  26. Russo F (2006) A method based on piecewise linear models for accurate restoration of images corrupted by gaussian noise. IEEE Trans Instrum Meas 55(6):1935–1943

  27. Schmidt M, Fung G, Rosales R (2007) Fast optimization methods for l1 regularization: a comparative study and two new approaches. In: Machine learning, vol 4701. Springer, Berlin, p 286–297

  28. Seber GAF, Wild CJ (1989) Nonlinear regression. Willey, New York

  29. Yamamura K, Ochiai M (1992) An efficient algorithm for finding all solutions of piecewise-linear resistive circuits. IEEE Trans Circuits-I Fundam Theory Appl 39(3):213–221

Download references

Authors’ contributions

All authors contributed extensively in the development and completion of this article. All authors read and approved the final manuscript.

Acknowledgements

The authors would like to express their sincere thanks to the anonymous referees for their valuable review of this manuscript.

Competing interests

The authors declare that they have no competing interests.

Author information

Correspondence to Victor M. Jimenez-Fernandez.

Appendices

Appendix A

Let Eq. (3) be rewritten as

$$\left| x \right| \cong h\left( { - x,\alpha } \right) + h\left( {x,\alpha } \right)$$
(28)

where

$$h\left( {x,\alpha } \right) = \frac{1}{{\alpha \ln \left( {10} \right)}}\ln \left( {1 + e^{\alpha x} } \right)$$
(29)

In Fig. 13, the graphs of \(h(x,\alpha )\) and \(h( - x,\alpha )\) for low and high values of the smoothing parameter \(\alpha\) (i.e. α = 1 and α = 100) are depicted in dashed and solid line styles, respectively. Their linear approximations are also traced in dot style.

Fig. 13
figure13

Graphs of (29) for \(\alpha = 1\) (dashed), \(\alpha = 100\) (solid) and their linear approximations (dot). a For \(x\) negative, b for \(x\) positive

From the above graphs, the following linear approximations are proposed.

$$h( - x,\alpha ) = \left\{ {\begin{array}{*{20}c} { - mx} & {\begin{array}{*{20}c} {\text{if}} & {x \le 0} \\ \end{array} } \\ 0 & {\begin{array}{*{20}c} {\text{if}} & {x > 0} \\ \end{array} } \\ \end{array} } \right. \quad h( x,\alpha ) = \left\{ {\begin{array}{*{20}c} { + mx} & {\begin{array}{*{20}c} {\text{if}} & {x \ge 0} \\ \end{array} } \\ 0 & {\begin{array}{*{20}c} {\text{if}} & {x < 0} \\ \end{array} .} \\ \end{array} } \right.$$
(30)

where the slope m is unknown.

In accordance with (30), it can be inferred that if \(h\left( {x,\alpha } \right) \cong mx\), then \(\frac{{h\left( {x,\alpha } \right)}}{mx} \cong 1\). As the same occurs with \(h\left( { - x,\alpha } \right)\) (that is, \(h\left( { - x,\alpha } \right) \cong - mx\) and \(\frac{{h\left( { - x,\alpha } \right)}}{ - mx} \cong 1\)), we consider in this proof only \(h\left( {x,\alpha } \right)\) since the final result will remain also valid for \(h\left( { - x,\alpha } \right)\).

Consequently,

$$\frac{{h\left( {x,\alpha } \right)}}{mx} \cong \frac{{\ln \left( {1 + e^{\alpha x} } \right)}}{{\alpha \ln \left( {10} \right)mx}} \cong \frac{1}{{m\ln \left( {10} \right)}}\frac{{\ln \left( {1 + e^{{\left( {\alpha x} \right)}} } \right)}}{{\left( {\alpha x} \right)}} \cong 1$$
(31)

After a change of variable \(X = \alpha x\), from (31) we have

$$\frac{h(x,\alpha )}{mx} \cong \frac{1}{m\ln (10)}\frac{{\ln \left( {1 + e^{(X)} } \right)}}{(X)} \cong 1$$
(32)

Hence, taking into account that the approximations of (30) are completely valid for huge values of α and \(x\), then an estimated value of \(\frac{{\ln (1 + e^{(X)} )}}{(X)}\) can be obtained by considering the following limit evaluation:

$$\begin{array}{*{20}c} {\lim } \\ {X \to + \infty } \\ \end{array} \frac{{\ln \left( {1 + e^{(X)} } \right)}}{(X)} \cong 1$$
(33)

From this result, Eq. (32) can be simplified as

$$\frac{h(x,\alpha )}{mx} \cong \frac{1}{m\ln (10)} \cong 1$$
(34)

where \(m\) is given by

$$m \cong \frac{1}{\ln (10)}$$
(35)

whose value deviates from the unity slope presented in the absolute-value function of reference (\(\left| x \right|\)). From here, in order to force an unity slope in both, \(h(x,\alpha )\) (for \(x \ge 0\)) and \(h( - x,\alpha )\) (for \(x \le 0\)), a compensation constant \(\mu = \ln (10)\) is included in (29) as

$$\hat{h}(x,\alpha ) = \mu h(x,\alpha ) = \frac{\mu }{\alpha \ln (10)}\ln \left( {1 + e^{\alpha x} } \right) = \frac{1}{\alpha }\ln \left( {1 + e^{\alpha x} } \right)$$
(36)

which allows recast the approximation (3) in terms of \(\hat{h}(x,\alpha )\) and \(\hat{h}( - x,\alpha )\) as follows:

$$\left| x \right| \cong \hat{h}(x,\alpha ) + \hat{h}( - x,\alpha ) \cong \frac{1}{\alpha }\left[ {\ln \left( {1 + e^{ - \alpha x} } \right) + \ln \left( {1 + e^{{ {\alpha x} }} } \right)} \right]$$
(37)

Finally, after algebraic simplifications of (37), it yields

$$\left| x \right| = \frac{2}{\alpha }\ln \left( {e^{{\left( {\frac{\alpha }{2}x} \right)}} + e^{{\left( {\frac{ - \alpha }{2}x} \right)}} } \right)$$
(38)

Appendix B

Consider that the canonical piecewise-linear function \(y_{cpwl} (x)\) and its smooth-piecewise version \(y_{s} (x)\) have the form

$$y_{cpwl} (x) = a + bx + \sum\limits_{i = 1}^{\sigma } {c_{i} \left| {x - \beta_{i} } \right|}$$
(39)

and

$$y_{s} (x) = \left( {a - \sum\limits_{i = 1}^{\sigma } {c_{i} \beta_{i} } } \right) + \left( {b + \sum\limits_{i = 1}^{\sigma } {c_{i} } } \right)x + \sum\limits_{i = 1}^{\sigma } {\left( {\frac{{2c_{i} }}{\alpha }} \right)\ln \left( {1 + e^{{ - \alpha (x - \beta_{i} )}} } \right)}$$
(40)

Suppose that we do not like that the approximation \(y{}_{s}(x)\) differs from the original function \(y_{cpwl} (x)\) too much around to any specific breakpoint \(\beta_{i}\).

That is

$$\left| {y_{cpwl} (\beta_{i} ) - y_{s} (\beta_{i} )} \right| < \delta$$
(41)

After substituting \(x = \beta_{i}\) in (39) and (40), the evaluation of (41) yields

$$\left| { - \frac{{2c_{i} \ln (2)}}{{\alpha_{i} }}} \right| < \delta \quad {\text{or}}\; \frac{{2c_{i} \ln (2)}}{{\alpha_{i} }} < \delta$$
(42)

where a value of the smoothing parameter \(\alpha_{i}\) that ensures fulfilling the condition (41) is given by

$$\alpha_{i} > \frac{{2c_{i} \ln (2)}}{\delta }$$
(43)

with \(\alpha_{i}\) being a positive number, and more generally, for a specific deviation \(\delta\), it yields

$$\alpha_{i} = \frac{{2c_{i} \ln (2)}}{\delta }\quad {\text{for}}\;i = 1 \ldots \sigma$$
(44)

Appendix C

Firstly, consider the n-dimensional form of (6) given by

$$\left| {\mathbf{x}} \right| \cong \left( {\frac{2}{\alpha }} \right)\ln \left( {e^{{\frac{\alpha }{2}\left( {\left\langle {{\user2{\Lambda}}^{\left( i \right)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} + e^{{\frac{ - \alpha }{2}\left( {\left\langle {{\user2{\Lambda}}^{\left( i \right)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} } \right)$$
(45)

then, after replacing the form of each absolute-value term of (2) by its smooth approximation (45) results

$$y\left( {\mathbf{x}} \right) = a + {\mathbf{{\rm B}x}} + \sum\limits_{i = 1}^{\sigma } {\left( {\frac{{2c_{i} }}{\alpha }} \right)\ln \left( {e^{{\frac{\alpha }{2}\left( {\left\langle {{\user2{\Lambda}}^{(i)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} + e^{{\frac{ - \alpha }{2}\left( {\left\langle {{\user2{\Lambda}}^{(i)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} } \right)}$$
(46)

for \(i = 1, \ldots ,\sigma\)

with \({\mathbf{x}} = \left[ {x_{1} , \ldots ,x_{n} } \right]\) expressed as \((1 \times n)\) matrix, and both, \({\mathbf{\rm B}} = \left[ {b_{1} , \ldots ,b_{n} } \right]^{T}\) and \({\user2{\Lambda}}^{(i)} = \left[ {\lambda_{1}^{(i)} , \ldots ,\lambda_{n}^{(i)} } \right]^{T}\) expressed as \((n \times 1)\) matrices.

After that, an algebraic expansion of the logarithm argument in (46) yields

$$y({\mathbf{x}}) = a + {\mathbf{{\rm B}x}} + \sum\limits_{i = 1}^{\sigma } {\left( {\frac{{2c_{i} }}{\alpha }} \right)\ln \left( {\frac{{1 + e^{{ - \alpha \left( {\left\langle {{\user2{\Lambda}}^{(i)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} }}{{e^{{\frac{ - \alpha }{2}\left( {\left\langle {{\user2{\Lambda}}^{(i)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} }}} \right)}$$
(47)

using the property \(\ln \left( {\frac{u}{v}} \right) = \left( {\ln (u) - \ln (v)} \right)\) and simplifying we have

$$y({\mathbf{x}}) = a + {\mathbf{{\rm B}x}} + \sum\limits_{i = 1}^{\sigma } {c_{i} \left( {\left\langle {{\user2{\Lambda}}^{(i)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right) + \sum\limits_{i = 1}^{\sigma } {\left( {\frac{{2c_{i} }}{\alpha }} \right)\ln \left( {1 + e^{{ - \alpha \left( {\left\langle {{\user2{\Lambda}}^{(i)} ,{\mathbf{x}}} \right\rangle - \beta_{i} } \right)}} } \right)} }$$
(48)

In order to simplify (48) we collect like terms

$$y({\mathbf{x}}) = \left( {a - \sum\limits_{{i = 1}}^{\sigma } {c_{i} \beta _{i} } } \right) + \left( {{\mathbf{B + }}\sum\limits_{{i = 1}}^{\sigma } {c_{i} {\mathbf{\Lambda }}^{{(i)}} } } \right){\text{x + }}\sum\limits_{{i = 1}}^{\sigma } {\left( {\frac{{2c_{i} }}{\alpha }} \right)\ln \left( {1 + e^{{ - \alpha \left( {\left\langle {\Lambda ^{{(i)}} ,{\text{x}}} \right\rangle - \beta _{i} } \right)}} } \right)}$$
(49)

finally, in accordance to the form of (14), we obtain

\(A = a - \sum\limits_{i = 1}^{\sigma } {c_{i} \beta_{i} }\) \({{\hat{\rm B}}} = {{\rm B}} + \sum\limits_{i = 1}^{\sigma } {c_{i} {\user2{\Lambda}}^{(i)} }\) \(C_{i} = \frac{{2c_{i} }}{\alpha }\) for \(i = 1 \ldots \sigma\)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Jimenez-Fernandez, V.M., Jimenez-Fernandez, M., Vazquez-Leal, H. et al. Transforming the canonical piecewise-linear model into a smooth-piecewise representation. SpringerPlus 5, 1612 (2016) doi:10.1186/s40064-016-3278-y

Download citation

Keywords

  • Smooth
  • Canonical
  • Piecewise-linear
  • Model