Approximation of curve-based sleeve functions in high dimensions

Beinert, Robert

doi:10.1007/s10444-023-10088-2

Approximation of curve-based sleeve functions in high dimensions

Open access
Published: 30 November 2023

Volume 49, article number 91, (2023)
Cite this article

Download PDF

You have full access to this open access article

Advances in Computational Mathematics Aims and scope Submit manuscript

Approximation of curve-based sleeve functions in high dimensions

Download PDF

Robert Beinert ORCID: orcid.org/0000-0002-7813-2762¹

231 Accesses
Explore all metrics

Abstract

Sleeve functions are generalizations of the well-established ridge functions that play a major role in the theory of partial differential equation, medical imaging, statistics, and neural networks. Where ridge functions are non-linear, univariate functions of the distance to hyperplanes, sleeve functions are based on the squared distance to lower-dimensional manifolds. The present work is a first step to study general sleeve functions by starting with sleeve functions based on finite-length curves. To capture these curve-based sleeve functions, we propose and study a two-step method, where first the outer univariate function—the profile—is recovered, and second, the underlying curve is represented by a polygonal chain. Introducing a concept of well-separation, we ensure that the proposed method always terminates and approximates the true sleeve function with a certain quality. Investigating the local geometry, we study an inexact version of our method and show its success under certain conditions.

Article PDF

Data Fitting on Manifolds with Composite Bézier-Like Curves and Blended Cubic Splines

Article 04 December 2018

Entropy and Sampling Numbers of Classes of Ridge Functions

Article 27 September 2014

Multipoint Estimates for Radial and Whole-Plane SLE

Article 16 March 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Data Availibility

All implementations and simulations underlying this article are publicly available at https://github.com/robertbeinert/curve-based-sleeve-functions.

References

Hinrichs, A., Novak, E., Woźniakowski, H.: The curse of dimensionality for the class of monotone functions and for the class of convex functions. J. Approx. Theory 163(8), 955–965 (2011). https://doi.org/10.1016/j.jat.2011.02.009
Article MathSciNet Google Scholar
Novak, E., Woźniakowski, H.: Approximation of infinitely differentiable multivariate functions is intractable. J. Complexity 25(4), 398–404 (2009). https://doi.org/10.1016/j.jco.2008.11.002
Article MathSciNet Google Scholar
Bellman, R.: Adaptive control processes: a guided tour. Princeton University Press, Princeton, N.J. (1961)
Book Google Scholar
John, F.: Plane waves and spherical means applied to partial differential equations. Springer, New York (1981). Reprint of the 1955 original
Logan, B.F., Shepp, L.A.: Optimal reconstruction of a function from its projections. Duke Math. J. 42(4), 645–659 (1975). https://doi.org/10.1215/S0012-7094-75-04256-8
Article MathSciNet Google Scholar
Donoho, D.L., Johnstone, I.M.: Projection-based approximation and a duality with kernel methods. Ann. Statist. 17(1), 58–106 (1989). https://doi.org/10.1214/aos/1176347004
Article MathSciNet Google Scholar
Friedman, J.H., Stuetzle, W.: Projection pursuit regression. J. Amer. Statist. Assoc. 76(376), 817–823 (1981). https://doi.org/10.2307/2287576
Article MathSciNet Google Scholar
Pinkus, A.: Approximation theory of the MLP model in neural networks. Acta Numer. 8, 143–195 (1999). https://doi.org/10.1017/S0962492900002919
Article MathSciNet Google Scholar
Ismailov, V.E.: Approximation by ridge functions and neural networks with a bounded number of neurons. Appl. Anal. 94(11), 2245–2260 (2015). https://doi.org/10.1080/00036811.2014.979809
Article MathSciNet Google Scholar
Candés, E.J.: Harmonic analysis of neural networks. Appl. Comput. Harmon. Anal. 6(2), 197–218 (1999). https://doi.org/10.1006/acha.1998.0248
Article MathSciNet Google Scholar
Jorgensen, P., Stewart, D.E.: Approximation properties of ridge functions and extreme learning machines. SIAM J. Math. Data Sci. 3 (2021). https://doi.org/10.1137/20M1356348
Petrushev, P.P.: Approximation by ridge functions and neural networks. SIAM J. Math. Anal. 30(1), 155–189 (1999). https://doi.org/10.1137/S0036141097322959
Article MathSciNet Google Scholar
Xie, T.F., Cao, F.L.: The ridge function representation of polynomials and an application to neural networks. Acta Math. Sin. (Engl. Ser.) 27(11), 2169–2176 (2011). https://doi.org/10.1007/s10114-011-9407-1
Aliev, R.A., Asgarova, A.A., Ismailov, V.E.: A note on continuous sums of ridge functions. J. Approx. Theory 237, 210–221 (2019). https://doi.org/10.1016/j.jat.2018.09.006
Article MathSciNet Google Scholar
Konovalov, V.N., Kopotun, K.A., Maiorov, V.E.: Convex polynomial and ridge approximation of Lipschitz functions in Rd. Rocky Mountain J. Math. 40(3), 957–976 (2010). https://doi.org/10.1216/RMJ-2010-40-3-957
Article MathSciNet Google Scholar
Kroó, A.: On approximation by ridge functions. Constr. Approx. 13(4), 447–460 (1997). https://doi.org/10.1007/s003659900053
Article MathSciNet Google Scholar
Maiorov, V.E.: On best approximation by ridge functions. J. Approx. Theory 99(1), 68–94 (1999). https://doi.org/10.1006/jath.1998.3304
Article MathSciNet Google Scholar
Maiorov, V.: Geometric properties of the ridge function manifold. Adv. Comput. Math. 32(2), 239–253 (2010). https://doi.org/10.1007/s10444-008-9106-3
Article MathSciNet Google Scholar
Lin, V.Y., Pinkus, A.: Fundamentality of ridge functions. J. Approx. Theory 75(3), 295–311 (1993). https://doi.org/10.1006/jath.1993.1104
Article MathSciNet Google Scholar
DeVore, R., Petrova, G., Wojtaszczyk, P.: Approximation of functions of few variables in high dimensions. Constr. Approx. 33(1), 125–143 (2011). https://doi.org/10.1007/s00365-010-9105-8
Article MathSciNet Google Scholar
Cohen, A., Daubechies, I., DeVore, R., Kerkyacharian, G., Picard, D.: Capturing ridge functions in high dimensions from point queries. Constr. Approx. 35(2), 225–243 (2012). https://doi.org/10.1007/s00365-011-9147-6
Article MathSciNet Google Scholar
Fornasier, M., Schnass, K., Vybiral, J.: Learning functions of few arbitrary linear parameters in high dimensions. Found. Comput. Math. 12(2), 229–262 (2012). https://doi.org/10.1007/s10208-012-9115-y
Article MathSciNet Google Scholar
Kolleck, A., Vybiral, J.: On some aspects of approximation of ridge functions. J. Approx. Theory 194, 35–61 (2015). https://doi.org/10.1016/j.jat.2015.01.003
Article MathSciNet Google Scholar
Tyagi, H., Cevher, V.: Learning ridge functions with randomized sampling in high dimensions. In: Proceedings of the ICASSP (25-30 March 2012, Kyoto, Japan), pp. 2025–2028 (2012). https://doi.org/10.1109/ICASSP.2012.6288306. IEEE
Mayer, S., Ullrich, T., Vybiral, J.: Entropy and sampling numbers of classes of ridge functions. Constr. Approx. 42(2), 231–264 (2015). https://doi.org/10.1007/s00365-014-9267-x
Article MathSciNet Google Scholar
Keiper, S.: Approximation of generalized ridge functions in high dimensions. J. Approx. Theory 245, 101–129 (2019). https://doi.org/10.1016/j.jat.2019.04.006
Article MathSciNet Google Scholar
Rockafellar, R., Wets, R.J.-B.: Variational analysis. Grundlehren der mathematischen Wissenschaften. A Series of Comprehensive Studies in Mathematics, vol. 317. Springer, Dortrecht (2009). https://doi.org/10.1007/978-3-642-02431-3
Dudek, E., Holly, K.: Nonlinear orthogonal projection. Ann. Polon. Math. 59(1), 1–31 (1994). https://doi.org/10.4064/ap-59-1-1-31
Article MathSciNet Google Scholar
Hastie, T.: Principal curves and surfaces. Technical Report 11 (AD-A148 833), Laboratory for Computational Statistics, Department of Statistics and Computational Group, Stanford Liniear Accelerator Center, Stanford University, Stanford (November 1984)
Hastie, T., Stuetzle, W.: Principal curves. J. Amer. Statist. Assoc. 84(406), 502–516 (1989). https://doi.org/10.2307/2289936
Article MathSciNet Google Scholar
Binev, P., Dahmen, W., DeVore, R., Dyn, N.: Adaptive approximation of curves. Preprint Series of the Interdisciplinary Mathematics Institute, University of South Carolina– http://imi.cas.sc.edu/papers/86/ (2004)
Mollweide, K.B.: Zusätze zur ebenen und sphärischen Trigonometrie. Mon. Corresp. Befoerd. Erd Himmelskunde 18, 394–400 (1808)
Google Scholar
Hämmerlin, G., Hoffmann, K.-H.: Numerical mathematics. Undergraduate Texts in Mathematics. Springer, New York (1991). https://doi.org/10.1007/978-1-4612-4442-4
Song, H.-C., Xu, X., Shi, K.-L., Yong, J.-H.: Projecting points onto planar parametric curves by local biarc approximation. Comput. Graphics 38, 183–190 (2014). https://doi.org/10.1016/j.cag.2013.10.033
Article Google Scholar
Hu, S.-M., Wallner, J.: A second order algorithm for orthogonal projection onto curves and surfaces. Comput. Aided Geom. Design 22(3), 251–260 (2005). https://doi.org/10.1016/j.cagd.2004.12.001
Article MathSciNet Google Scholar
Limaiem, A., Trochu, F.: Geometric algorithms for the intersection of curves and surfaces. Comput. & Graphics 19(3), 391–403 (1995). https://doi.org/10.1016/0097-8493(95)00009-2
Article Google Scholar
Liang, J., Hou, L., Li, X., Pan, F., Cheng, T., Wang, L.: Hybrid second order method for orthogonal projection onto parametric curve in n-dimensional Euclidean space. Mathematics 6(12) (2018). https://doi.org/10.3390/math6120306
Allard, W.K., Chen, G., Maggioni, M.: Multi-scale geometric methods for data sets II: geometric multi-resolution analysis. Appl. Comput. Harmon. Anal. 32(3), 435–462 (2012). https://doi.org/10.1016/j.acha.2011.08.001
Article MathSciNet Google Scholar
Dontchev, A.L., Rockafellar, R.T.: Implicit functions and solution mappings. Springer Monographs in Mathematics. Springer, Dordrecht (2009). https://doi.org/10.1007/978-0-387-87821-8
Sard, A.: The measure of the critical values of differentiable maps. Bull. Amer. Math. Soc. 48(12), 883–890 (1942). https://doi.org/10.1090/S0002-9904-1942-07811-6
Article MathSciNet Google Scholar

Download references

Acknowledgements

The author is especially grateful to Sandra Keiper, the author of [26], for many fruitful discussions and for drawing my attention to the topic of generalized ridge and sleeve functions.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institut für Mathematik, Technische Universität Berlin, Straße des 17. Juni 136, Berlin, 10623, Germany
Robert Beinert

Authors

Robert Beinert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Robert Beinert.

Ethics declarations

Conflict of interest

The author declares no competing interests.

Additional information

Communicated by: Holger Rauhut

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: The set of ambiguity points

In this appendix, we prove Theorem 5 for merely twice differentiable curves. Note that the proof of the original statement in [30, Prop 6] requires that $\gamma $ is infinitely often differentiable. To study the set of ambiguity points, we use that the projection onto a Jordan $C^2$-curve is differentiable for most unambiguity points. These results can be found in [28], and we state it for our specific setting with $C^2$-curves.

Theorem 20

(Dudek–Holly [28, Thm 4.1]) Let $\gamma $ be a Jordan $C^2$-arc, and let $x \in \mathbb {R}^d$ be a point within an open neighbourhood where the projection is single-valued. If ${{\,\textrm{proj}\,}}_\gamma (x)$ is an inner point, then ${{\,\textrm{proj}\,}}_\gamma $ is differentiable at x.

The restriction to a point that is projected to an inner point is crucial since the projection becomes undifferentiable at the end points.

Counterexample 21

(End points) Consider the curve or line segment $\gamma (t):= (t,0)^\text {T}$ with $t \in [0,1]$. For $x:= (0,1)^\text {T}$, the derivative in direction $(1,0)^\text {T}$ is $(1,0)^\text {T}$ but $(0,0)^\text {T}$ in direction $(-1,0)^\text {T}$. Thus, the projection is not differentiable at points $x:= \gamma (0) + v$ with $v \perp \dot{\gamma }(0+)$ and ${{\,\textrm{proj}\,}}_\gamma (x) = \gamma (0)$.

The ambiguity points with respect to a Jordan $C^2$-curve have a benign structure. The restriction $A_2:= \{ x \in \mathbb {R}^d: \#[ {{\,\textrm{proj}\,}}_\gamma (x)] = 2 \}$ of the ambiguity set $A:= \{ x \in \mathbb {R}^d: \#[ {{\,\textrm{proj}\,}}_\gamma (x)] > 1 \}$ consisting of all the points with exactly two projections has Lebesgue measure zero.

Lemma 22

(Ambiguity points) Let $\gamma $ be a finite-length Jordan $C^2$-arc. Then, the subset $A_2:= \{ x \in \mathbb {R}^d: \#[ {{\,\textrm{proj}\,}}_\gamma (x)] = 2 \}$ has Lebesgue measure zero.

Proof

Let x be an ambiguity point in $A_2$ with projection $P_1$ and $P_2$. Since the distance function is continuous, we find a small open neighbourhood $U_x$ such that ${{\,\textrm{dist}\,}}(y,\gamma )$ is attained by a curve point near $P_1$ and/or $P_2$, i.e. ${{\,\textrm{dist}\,}}(y,\gamma ) = \min \{ {{\,\textrm{dist}\,}}(y,\gamma _1), {{\,\textrm{dist}\,}}(y,\gamma _2) \}$, where $\gamma _1$ and $\gamma _2$ are small arcs around $P_1$ and $P_2$. Further, $U_x$ may be chosen small enough such that the projection to a single arc $\gamma _1$ or $\gamma _2$ is single-valued in $U_x$ such that ${{\,\textrm{proj}\,}}_{\gamma _1}$ and ${{\,\textrm{proj}\,}}_{\gamma _2}$ become continuously differentiable by Theorem 1. The ambiguity points in $U_x$ are the zeros of the function

$$\begin{aligned} F :U_x \rightarrow \mathbb {R}: y \mapsto {{\,\textrm{dist}\,}}(y,\gamma _1) - {{\,\textrm{dist}\,}}(y, \gamma _2). \end{aligned}$$

Since the gradient

$$\begin{aligned} \nabla F(x) = \frac{x - P_1}{\Vert \hspace{1pt}x - P_1\hspace{1pt}\Vert } - \frac{x - P_2}{\Vert \hspace{1pt}x - P_2\hspace{1pt}\Vert } \end{aligned}$$

is non-zero, $P_1$ and $P_2$ are distinct points, Dini’s implicit function theorem [39, Thm 1B.1] states that the ambiguity set $A_2$ in an open neighbourhood $\tilde{U}_x \subset U_x$ around x is the realization of a continuously differentiable map $a_x :\mathbb {R}^{d-1} \rightarrow \mathbb {R}^d$ and is thus a Lebesgue zero set by Sard’s theorem [40]. Since the Euclidean $\mathbb {R}^d$ is second-countable, already countably many set $U_{x_n}$ cover $A_2$, whose union is again a Lebesgue zero set. $\square $

Proposition 23

(Ambiguity points) Let $\gamma $ be a Jordan $C^2$-arc. Then, the ambiguity set $A:= \{ x \in \mathbb {R}^d: \#[ {{\,\textrm{proj}\,}}_\gamma (x)] > 1 \}$ is the closure of $A_2$.

Proof

Since the distance to the curve is continuous, the points in $\overline{A}_2$ are ambiguous. To show $A \subset \overline{A}_2$, we take an ambiguity point x with $\#[{{\,\textrm{proj}\,}}_\gamma (x)] > 2$. In two dimensions, the set ${{\,\textrm{proj}\,}}_\gamma (x)$ is located on a circle. Since $\gamma $ is not closed, we can either shrink the circle and move it into a gap between to projection points or, if ${{\,\textrm{proj}\,}}_\gamma (x)$ lie on a half-sphere, we can move the circle outwards and enlarge it (see Fig. 12). In both cases, the centre y of the deformed circle is contained in $A_2$. By controlling the radius, the centre may be arbitrarily close to x. This construction generalizes to $\mathbb {R}^d$ by changing the radius and moving the sphere containing the projection points in several steps. $\square $

Figuratively, higher ambiguities with $\#[{{\,\textrm{proj}\,}}_\gamma (x)] > 2$ occur at points, where the charts constructed by the implicit function theorem are glued together. Since $A_2$ is locally a hypersurface, the Lebesgue measure of the closure remains zero, which establishes Theorem 5.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Beinert, R. Approximation of curve-based sleeve functions in high dimensions. Adv Comput Math 49, 91 (2023). https://doi.org/10.1007/s10444-023-10088-2

Download citation

Received: 09 March 2022
Accepted: 06 November 2023
Published: 30 November 2023
DOI: https://doi.org/10.1007/s10444-023-10088-2

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Approximation of curve-based sleeve functions in high dimensions

Abstract

Article PDF

Similar content being viewed by others

Data Fitting on Manifolds with Composite Bézier-Like Curves and Blended Cubic Splines

Entropy and Sampling Numbers of Classes of Ridge Functions

Multipoint Estimates for Radial and Whole-Plane SLE

Data Availibility

References

Acknowledgements

Funding