H76b

Systems with Infinite-Dimensional State Space:
The Hilbert Space Approach
Abstract-The contrd of infmitedimensional systems has received of them. Ideally, a complete theory would have several layers.
much attention from engineers and even mathematicians Realizability There should be a general theory giving the structure common
although fmt considered m [4] has been ignored until recently. Ironi-
to many situations and then there should be a branched hier-
d y enough while state-spacesystemstheorywasdeveloping in the
e d y 1960’s a mathematical study of scattering and of wn-selfadjoint archy of increasingly special conditions and increasingly spe-
operptorsproducedaparalleltheorywhich was infmitedimensional cific theorems. Such an extensive theory will, of course, take
%om the beginning. When the dose relationship between the two sub a long time to develop. On the positive side the general theory
jects became known time invariant infiitedimensional s y s t e m s theory
of infinite-dimensional A , B , C, D-type linearsystems now
advanced quickly and at a general level it now seems reasonably com-
plete. This paper desaibes the connection between mathematical scat- seems reasonably complete and that is whatthisarticle pre-
tering and systems Itthen gives a thaough treatment of i n f i t e sents. We describe a realistic setting for many infinite-dimen-
dimensionnlrimeinvariant continuous time systems Thelast section sional problems to which the main properties of general finite-
lists recent scatteringresults whi& might be of engineering intexest. dimensional systems adapt suitably.
Although the world abounds in important infinite-
I.INTRODUCTION dimensional systems, of what benefit is a theory about them?
Certainly the abstract considerations t o be presented here are
V ERY ROUGHLY SPEAKING, a situation which re- not of immediate interest to the practicing engineer. However,
quires infinitelymany parameters to specify will be they may ultimately be influential. For example, theone
modeled by asystemwithinfinite-dimensional state problem common to mostinfinite-dimensional design situa-
space. Very early in the c o m e of a design, one restricts atten- tions is: How does one select a finitely parameterized approxi-
tion to a few parameters which he believes are the most im- mation and how does one determine its accuracy?Problems
portant. However, the resulting heavily proscribedsystems of this type are mathematically speaking infinite dimensional
will still have infinite-dimensional state space. For example, a and one would have little hope of developing a decent theory
variable coefficient transmission
line
has an infinitely of finite-dimensional approximationswithout first having a
parameterized space of capacitance, inductance, etc., distribu- good theory of infinite-dimensionalsystems.Hopefully the
tions and any system corresponding to a transmission line will approach taken here addresses that need and provides a foun-
have infinitedimensional state space. Even thoughthere are dation for building the more specialized layers of theory which
many ways t o discard all but a finite numberof design parame- will follow. Another benefit of having an infinite-dimensional
ters such as to consider only cascades of a prescribed number theory closely attuned to a classical mathematical physics ap-
of uniform lines or to prescribe u priori the tapering of the proach (as this one is) is that developments in that area can be
line, all such systemsstill have infinite-dimensional state adapted to (and encouraged by) engineering theory; some of
space. In thissame vein semiconductorsadmit an infinite Section IV andtheentirety of Section V are examples.
variety of doping intensities, of geometries, and of doping Whereas, it would be highly irresponsible to urge a practicing
geometries and even the practical devices with heavily re- engineer to learn the abstract theory described herein it would
stricted geometries and doping patterns have an infinite-dimen- be equallyfoolish for theoretical engineers to abandon such
sional space of states. This applies to an enormous number of pursuits since unified a and
thoroughunderstanding of
devices and though finite dimensional systems suffice as mod- infinite-dimensional realizability should be valuable.
els for basic circuits, many components of a modem circuit Now that we have motivated a study of infinite-dimensional
when modeled carefully would be infinite dimensional. From systems, we give some idea of what the theory entails and how
this viewpoint, infinitedimensional engineering problems pos- itconnects with other areas. The basic general principles
sibly outnumber the intrinsicallyfinite-dimensionalones. To which have emerged in finite-dimensional systems theory are
reemphasize this point we mention that besides the many in- the fact that two ‘minimal’ systems with the same frequency-
finite-dimensional systems thought of more in connection with response function (FRF) are isomorphic, every rational func-
control (cf. [ 4 3 ] ) there are transmission lines, microwave cir- tion is the FRF for some ‘minimal’ system, and the eigenvalues
cuits, thin film devices, surface acoustic wave devices and in- of the ‘state operator’ for a minimal system give the pole loca-
deed most semiconductor devices. tions of the FRF. During the early 1960’s,while this was de-
In fact, there are so many examples of infinite-dimensional veioping a mathematicalcounterpart evolved independently.
systems that any theory general enough to deal with most of The mathematical theory is infinite dimensional and by com-
them would be too general to say much specific about any one bining it with’the systems approach precise analogs of these
basic theorems are now available in infinite dimensions. Thus
Manuscript received January 15, 1975;revised June 15, 1975. there has been a recent advance in systems theory initiated by
This work was supported in part by the National Science Foundation.
The author is withtheDepartmentofMathematics, University of the discovery of existing mathematical work and greatly expe-
California at San Diego, La Jolla, CA. dited by its application. A surprising fact is that not only do
146 PROCEEDINGS OF THE
IEEE, JANUARY 1976
the mathematical works provide techniques for the study of devoted his time to design of lumped circuits using the chain
systems,theworksthemselvesarereasonablyclose to being formalism(cf. the survey [ 16]), and his colleagueArovhas
studiesofsystems,Forexample,thebookby Nagy and pursued infinitedimensional problems [ 1 1.
Foias [491, work which was done for purely mathematical rea- Amusingly enough a major problem which has troubled the
sons, could be viewed after a translation of terms almost en- field since completion of the earlier work centers on deciding
tirely as atreatiseoninfinitedimensional discrete-timesys- what one means by an infinite dimensional system. The type
tems. If the results and methods of the book are restricted t o of linear system treated in these works always has the form
finitedimensions they agreewith thestandard engineering
d
ones for the case of‘lossles:’ discrete systems. A description - x ( t ) = A x ( t ) + Bu(t)
of the relationship appears in[ 281. dt
The work of Lax and Phillips though slightly less similar in
y ( t ) = Cx(t) +Du(r)
form is moresimilar in spirit to systemstheory.This is a
theory of scattering, which includes nonsymmetric scatterers but amusingly enough it is not at all clear what type of linear
as opposed to the radially symmetric ones concentrated upon maps B and C should be. In fact this has been a major prob-
in classical physics. The original work compiled in [37] treats lem troubling the field since the connection between systems
lossless situationswhile [38] gives alossyscatteringtheory. andoperatortheory was discovered.Fuhrmann andHelton
This works puts great emphasis on the ‘state operator,’ but in- avoid the problem by treating mostly discrete time systems.
put and output operators never appear and are replaced by a In treating the continuous time case Dewilde avoids B , C en-
more general notion of “incoming and outgoing space.” De- tirely while Baras and Brockett choose them in a mathemati-
spite these differences the author feels that Lax-Phillips scat- cally convenient way whicheliminatesmoststandardexam-
teringandinfinite-dimensionalsystems theory arelinked so ples. Thus an essential task is t o finda formulationfor
closely in substanceandspiritthatfuturedevelopment of continuoustimesystems whichcontainsthemainexamples
these subjects should go hand in hand. Section 111 is devoted and to which the basic finite dimensional systems facts extend
to describing the preciserelationshipbetweenthetwosub- in a reasonable fashion. Ironically enough the techniques for
jects. An immediateconsequence is thatone can seehow proving these facts have (for the last three y e i n ) been much
systems theory looks in the setting of classical scattering, how clearer than what the systems themselves should be. Even for
for example the classical objects of scattering, wave operators, studying discrete time systems a slightly broader set up than
andscattering
operators,compare t o objects
in
systems the one presently used might be desirable since it is necessary
theory. Quite possibly these more established objects will be t o embrace some common circumstances (see Section11-B).
easier to use for infinite dimensions than controllability and The major accomplishment of this paper is that it gives a
observability
operators. The
article
[281
connects Lax- theory of continuous time systemswhich is physically realistic
Phillips scattering with systems in discrete time; while [ 3 11 as well as being complete at the general level. The theory also
does it for nonlinear situations. fits together perfectly with Lax-Phillips scattering which guar-
Now we describe some history. Among mathematicians the antees that it contains the physical situations they have stud-
first work in this direction was done by Livsic [441, 1451 who ied and is consistent with their long experience in abstracting
began a study of operators on Hilbert space which are not self- mathematics from physical situations.
adjointin the early1950’s, cf. [9]. Thework of Lax and We also mention that
the
distributional
approach of
Phillips on scattering theory began in the early1960’s. Also Balakrishnan and more recently Aubin and Bensoussan is very
about that time DeBrange and Rovnyak and Nagy and Foias general and contains many physical situations. The work does
began a study of “nonunitary” operators which though similar not effectively introduce a Hilbert space structure and thus is
t o Livsic’s in general philosophy was basically very much dif- moreabstractthan our approach. No isomorphism property
ferent. Another closely related study was done by Helson. In or spectral results exist for distributional systems although the
1964, AdamajanandArov [ 21 showed the Lax-Phillips and author does not know if these will come from additional work
Nagy-Foias theories t o be equivalent. In 1965, Livsic used his or if they require additional Banach or Hilbert space structure.
operator theory to derive a type of systems theory. The book Our treatment of systems might be thought of as a combined
received no attention in this country until connection of the distributional and Hilbert space approach. Hilbert spaces usu-
above mathematical theories with systems was discovered here ally arise from energy or cost considerations. Energy consider-
abd generated enough interest to produce a translation [46] in ations play a fundamental role in mathematical physics which
1973.Thefirstworkoninfinite-dimensionalsystems was accounts for the prominence ofHilbert space there.
done in control. In realizability the first work was Balakrish- Our notation is standard.If H is aHilbertspaceand
nan’s [6] in 1966 which described the response function for a 1 <p < 0 0, then LP(a, b , H)denotes functions h ( t ) with values
systemwithdistributionalentries.Littlehappeneduntil re- in H so that $’ 11 h(t)llP dt< O0. A function inLz [ - j m , j m , HI
centlywhen the mathematicaltheory was seen t o pertain; which is the Fourier transform of a function in L z [O,m, HI
Dewilde [ l l ] , Fuhrmann [17]-[26], andHelton [27], [28]. has an analytic continuation to the right-half plane (RHP) of
Also around that time systems theorists Brockett and his stu- complexnumbers;denotethespace of thesefunctions by
dent, Baras, became interested in the subject [ 51. There was H2(H).The corresponding space for the left-half plane (LHP)
considerable subsequent work on this [ 61, [ 121, [ 181, [ 19 1, isz2(H). The space of all infinitely differentiable H valued
[291,[311,[321andrelatedtopics[131,[201-[261,[301. functions defined on [ a , b] and vanishing near a and b is de-
In quite a different spirit there is the algebraic approach taken noted CF(a, b , H). If H and K are Hilbert spaces QH, K ) de-
by Kamen [ 341 -[36] and the distribution theoretic approach notes the space of all bounded linear operators (ones whose
ofAubin and Bensoussan [ 31. More recentRussianwork norm
known to the author is the book [45] describing new direc-
tions in systems theory. Also the mathematician Potopov has
HELTON: SYSTEMS WITH INFINITE-DIMENSIONAL STATE SPACE 147
is finite) from H to K . Note a linear operator is continuous if Thus a signal entering thetransmission line at x = 1 satisfies
and only if it is bounded. The space of all uniformly bounded
functions on [a, b ] with L(H, K ) values is Lm(u, b, H , K ) while
H"(H, K ) [resp. F ( H , K ) ] denotes those which have a uni-
formlyboundedanalyticcontinuation tothe RHP [resp. while one leaving satisfies
LHP] . A one parameter semigroup G ( t ) of linear operators on
H will always satisfyadifferential equation of the form
d C(t)/dt = A G ( t ) on a dense subspace 9 ( A ) of H on which the
linear operator A is d e f i e d . The converse also holds. A com-
plete reference on the subject is [ 3 3 ] , an introductory refer- this describes the input and output to the line. The state-space
ence is [ 561 and an intermediate reference is [ 151. A good equations for this setup are:
general reference on functional analysis is [ 5 5 1 .
'at( ' ) = A ( ; ) + B u
11. THESETUP
A . Continuous Time.
For our purposes a system [ A , B , C , D ] is defied by Y = c(:)
)*d = A x ( t ) + Bu( t ) where A is the unbounded operator onHE defined by

dt
y ( t )= C x ( t ) +Du(t)
where A , B ,C , D , are linear operators and x ( t ) , u(t), and y ( t )
are vector-valued functions. If the vector spaces involved are
not finite dimensional, then precisely where the operators B
and C mapfromand t o is adelicate matter. One naturally acting on domain
wants to specify them in a nice way but it turns out that the
simplest definitions exclude many interestingexamples.
For example, one seemingly reasonable set up would be to
have Hilbertspaces U, Y , X input, output, and state spaces,
finite i(1) - 4 1 ) = 0 and 40)= 0 ,
respectively. The operator A is densely defined on X,and is
}
the infinitesimal generator of a strongly continuous semigroup
eAt on X, the operator B maps U t o X and C maps X to Y.
Despite thenaturalappearance of this structureit is very
Ba = 6 ( x - 1) (-:) a, and C k ) = 4 1 ) + i(1).
*restrictive. As Baras and Brockett [ 51 observe (see also [31) As another example, consider this same transmission line in
'the impulse response function C 8 ' B u applied to input vector the impedanceformalism. Theinput is then i(1) while the
u is continuous in t . This property does not holdeven for loss- output is 41). The state-space equations are again (2.1), but
less transmission lines; to cite the simplest example a uniform this time
transmission line with c = I = 1, unit length, and with one end
short-circuitedhasimpulseresponse equalthe diracdelta
6 ( t - 2) when treated in the scattering formalism(reference
line c = I = 1). Thus a more elaborate definition of system is
required in order to avoid vacuousnw of the theory. Before Ba=6(x - 1) (:)a, and C(i)= v(1).
presenting our definition of "system" wegive as an example
the statespace description of a lossless transmission line. This Let us make an observation about the examples; while A is
illustrates the difficulties just described and gives one some-
an unbounded operator on X as we expected, B is a distribu-
thing concrete to hold onto during abstract discussions. The tion and C is an unbounded operator. Thus the situation is
example is like the one in[ 321.
partly operator theoretic and partly distributional. This type
Consideratransmission linewith capacitance c(x), induc- of behavior occurs frequently and to deal with it we shall use a
tance I(x), nonvanishing and resistance d x ) , conductanceg(x) "rigged Hilbert space."' Such spaces should be familiar to fol-
at the point x in [ 0, 11. The energy of a current-voltage dis- lowers of the Lions school; a general referenceis [ 2 5 ] .
tribution (i)in the line is If H and H 1 are two Hilbert spaces with H 3 H 1 , then a
third Hilbert space H' can be defined as the set of all continu-
ous linear functionals on H 1 . Since to each x in H we may as-
sign the linear function I on H 1 defined by I ( y ) = ( y , x) for
and HE will denote the Hilbertspace of all (i)
withfinite all y in H 1 , the space H is regarded as being imbedded in H'.
If M is an operator onH whoce adjoint M* maps H1 into itself,
energy. The line is short-circuited at x = 0. To study this line
we connect the x = 1 end to a 'lossless reference line having thzn it induces an operator M on H' according to the formulz
c = I = 1 and send signals down it toward x = 1. A signal mov- [MI] ( y ) = I(M*y) for each I E H' and y E H 1 . The operatorM
ing left [resp. right] is known to have a spatial distribution at
each instantof time 'An alterqativeandequivalentsettinginvolves bilinear forms on
U X H , . We could use thisformalism throuaout withtheadvantage
that bilinear forms are more fashionable in current mathematical phys-
ics (see [ 55, ch. 81) than rigged Hilbert space. On the other hand, the
rigged Hilbert space seemedless encumbered.
148 PROCEEDINGS OF THE
IEEE, JANUARY 1976
is just the continuous extension of M to H,because G l , ( y ) = line, X is a space of functions 3 (C) is asubspace of fairly
y )so it will be denoted M. If B
(M*y, x) = ( y , M x ) = l ~ ~ (and smooth functions and Bu for each u in U is a "rough func-
maps Hilbert space U into H',then its adjoint maps H 1 to U tion." Moreover, the operator ( A - z0)-' is an integral opera-
and is given by ( u , B*x) = [Bu] (x). Likewise if C: H1+ U its tor which hasa smoothing effect and ( A - zo)-'Bu will be
adjoint sends U to H' and is given by [ C * u ] (x) = ( u , Cx). Note smooth enough to lie in 9 (0. This guarantees that
B** = B and C** = C. C(A - z0)-' Bu and C(A - z0)-' e(A-zo)tBu exist and conse-
Let X, U,Y be Hilbert spaces. The closed operator A maps quently that the output y ( t ) = C x ( t ) + D u ( t ) is well d e f i e d at
a dense subspace g ( A ) of X to X and generates a uniformly least for anydifferentiable inputfunction u ( t ) . This isthe
boundedone parametersemigroup denoted e A r . The motivation behind the definition of compatible system.
expression There are further properties which systems frequently have.
These properties of systems have been isolated by Lax-Phillips
(1,Y ) A = (x, Y ) + ( A x , AY)
as common to all systems they study.
is meaningful for pairs of vectors x , y in 9 ( A ) and gives an in-
Meromorphic property:
ner producton 9 ( A ) whichmakes it aHilbert space. Let
$ ( A ) ' be the dual of 9 ( A ) and we have a rigged structure The operator valued function (zl- A)-' is mero-
%(A)' 3 X 3 % ( A ) . The same can be said for %(A*). Since A morphic in thewhole complex plane @.
generatesa nice semigroup A* does, and thisimplies that if
A > 0 the map (A - A)*-' sends X continuously into 9 (A*) A stronger propertyis
with I1 IIA* topology. Thus (A - A)-' is well defined on 9 (A*)' Compactness property:
and for w in 9 (A*)' the linear functional [(A - A)-' w ](x) =
w((A - A ) * - ' x ) is continuous in the X topology. By the Riesz There is a number T > 0 so that eTA(Zl- A I-'
is a
compact operator forRe z > 0.
representationtheoremit equals ( x , w o ) with W n in X. In
other words (A - A)-' w = w o and so (A - A)-' 9 (A*)' C x. The A , B , C, 0 from the transmission line example is a compat-
Also eAIt $ ( A * ) C 9 ( ~ *(cf.) 115, ch. VI11 1.5, lemma 7(b)l ible system having T = 0 compactness. Note, compactness is
and is uniformly [ - ] A : boundedand so eAr is defined on related to stability in thatanyasymptoticallystable system
9(A*)' and is uniformly bounded there. with the compactness property is exponentially stable.
A system [ A , B , C, D l will consist of maps, A as above, The basic objects one uses to study asystem are control-
B : U + $ ( A * ) ' , C: !??(C)+ Y , and D:U + Y where the domain lability, observability, and either
input-output, impuise-
of C satisfies % ( A )C 9(C) C D(A*)', the operator B is con- response, or frequency-response maps. Inthiscontextthey
tinuous, and C is continuous on % ( A ) in the ( , ) A topology. are defined as follows:
A compatible system satisfies ( z o l - A)-'B C 9(C) for some
zo with Re zo > 0. Most systems in the author's experience
The controllability map e: L' (U) + 9 (A*)' is
are compatible.
Let us pause for a moment to see what type of objects one
can have for solutions x ( t ) , y ( tto) the system equations. For-
eu = lm &'B u ( t ) d t
mally the state x ( t ) arising from input functionu ( t ) is

where u is the function u ( t ) in L' (U).
x ( t ) = eArxo +
I' B u(s) ds.
If x. E !??(A*)'and u ( t ) is a continuous U-valued function,

(2.2)
The observability map 9 : L' (Y)+ 9 ( A ) is
qy = [ C d i ' ] * y ( t )d t .
then #'xo is in 9 (A*)', Bu(s) is a continuous 9 (A*)' valued
function as is eA(t-S)Bu(s) and its integral is in 9 (A*)'; thus The FRF is
x ( t ) is a continuous function with values in !??(A*)'.So much
for solutions tothestateequation. To obtainoutputs y ( t ) D + C(ZI- A)-' B
from the states one needs to apply C to x ( t ) . This for systems for Re z > 0.
as we have defined them is impossible because of their extreme Thelaterobject requires muchexplanation. It is well d c
generality: C simply cannot be applied to an arbitrary element Tiedfor a compatible system. To check this note
in 9 (A*)'. The situation is not really so g r i m , for set x . = O C ( z o l - A ) - ' B is well defined and since the operator on the
and suppose that u(s) is differentiable. Thenintegration by right-hand side of the "resolvent identity"
parts in the formula forx ( t ) gives for any zo with Re zo > 0
( z l - /I)-'- ( z o l - A ) - = ( z o - z ) (zol- A)-' (zl- A)--'
(2.3)
maps D (A*) into 9 ( A ) we can apply C from the left and B
from the right to get a natural definition of C(zl- A ) - ' B . The
FRF is not necessarily defined for an arbitrary system. The
The integral is in % ( A )C 9(C) and so, the only obstruction to FRF is, however, given in an intuitive sense up to a constant
having Cx(t) defined lines in the last two terms. These vanish by
whenever support u C (0,t ) and as will be discussed later this D + (zO - Z) C(z01- A)-' (zZ- A ) - ' B . (2.4)
insures that the "Hankel" operator for the system exists. In
most practical situations,forexample, see the transmission For many purposes the indeterminacy is unimportant. This is
because the Hankel operator for a system plays a dominant exactobservability, etc. will be defined as above with 2 re-
role in realizability theory and according to some thought is placing C . A one parameter semigroup eAr of operators on X
about the only thing oneusually measures anyway; the Hankel is called asymptotically stableif eA'x + 0 for each x in X.
operator for a system is determined by its FRF but is inde- Remark 2.1. In the defmition of system we assumed that A
pendent of any constant added to the FRF. generated a uniformly bounded semigroup eAr. One could re-
Now we discuss Hankel operators. If F( j y ) takes values in place this by the assumption that thesemigroup is bounded by
oe(U, Y ) and is uniformly bounded, then for u in H2(v) the ll&'ll Q ept with p > 0, since a simple scale change A - p con-
function F( j y ) u ( j y ) is in L2(-", 00, Y ) and can be written verts this to auniformly bounded semigroup. The FRF (at
uniquely as the sum a + b of functions a in H2(Y)and b in least (2.4)) will be defined on the half-plane Re z > p rather
p(Y). Define HankelF u 2 a. Note that if F is constant, then than the RHP, and everything else works with analogous modi-
HankelF = 0 so two functions differing by a constant have the fication. In fact the main reason for taking p = 0 was to avoid
same Hankel operators. If the function F given by (2.4) is uni- carrying the subscript p through the whole paper.
formly bounded on RHP, then the Hankel operatorforthe Remark 2.2. A more general but natural definition of sys-
system is HankelF. Now let's lookatHankelFinthetime tem is possible. Namely, the operators A and D remain as is,
domain. Suppose for the moment that the Fourier transform the operator B maps U into % (An*)', while C : 9 (Am)+ Y .
8 of F which in general must be regarded as a distribution &) With this definition everything in the section after a straight-
1s in fact a well-behaved function.ThentheFourier trans- forwardmodificationstill goes through.To make sense of
formed Hankel operator x: L 2 [-", 0,U] + L 2 ( 0 ,-, Y ) is most formulas one simply requires what in the time domain
[Xu] (s) = im E(t + s) u(-'s) ds.

amountsto repeated integration by parts. For example, in
(2.5) just integrateby parts n + m - 2 more times.
B. DiscreteTime
If fi is a distribution this definition extends naturallyto give a What is the correct notion of a discrete-time system? Let us
map X: c;(-=,0,V)+ ~ ~ (m,0Y ), . The formally associ- consider a typical situation-discretizing the transmission line
ated with a system is from subsection a. The method we choose is a usual one (cf.
[ 10, Appendix C] namely, send in pulses of duration T and
read values or averages of the output at timeintervals of length
[ x u ](s) = CeA('+')B u ( t ) dt T . The state propagation equation for the discrete version of
(2.l)is
which rigorously is defined on CF[-m, 0, v) through integra- w(n + 1) = &v(n) + &(n)
tion by parts
where A" = 8" and
[ x u ](r) = C ( A - p)-2 e(A-P)('+')B -
d2
dt2
u ( t )d)t
(2.5 1
B" = I1
eAq B d q .
and does not require F to be uniformly bounded. The input, output, and statespace are the same as before. For
each a(t) put into the continuous time system, the resulting
The adjoint system to [ A , B , C, D l is [A*, C*, B*, D*] . The
state at time T is f eA(T-q)Ba(q) dn and its energy and con-
controllability [resp. observability] maps fortheadjointare
the same as the observability [resp.controllability] maps sequently 11 ;1 is no greater than .jT
la(t)12 d t the energy of
a(t). If a(t)is a pulse of height a, then the state is precisely
forthe original. Notethat 2 : 9 ( A ) + L 2 ( 0 , m, Y ) takes
x E $ ( A ) into the functionC p x and similarly for e.
Thus h;thus B" is a bounded operator. Since A is dissipative eA"
has norm Q 1. Now we turn to C. There are several reason-
able ways to define it. A common one is set 2; = C. Thp 2; is
2*Cu =
1- C&('+')B u ( t ) d t
which up to u(r) + u(-r) equals the Hankel Operator for the

an unbounded operator on HE. Another is to have E(:) give
the average value of i - u over some small interval (0, 1). If T
is small and 0 is chosen appropriately this can be approximated
in practice by taking a time average-of $1, t ) - u( 1, t ) over a
system. sampleinterval of length T . Here C is a boundedoperator.
While the controllability map sends L' ( O , m , v) to 9 (A*)' The first methodthough seemingly simple is incompatible
integration by parts shows that functions m CF(0, m, v) map with energy considerations, while the second is consistent with
into X . If the set of vectors so obtained is dense in X the sys- energy considerations.
tem is called approximately controllable or often just control- The second type of discretesystem is treated in (281 and
lable. If in addition e is continuous it is called continuoudy works of Fuhrmann.The first can becompletelyanalyzed
controllable. If maps some subspace of L 2 onto all of X , using the same techniques adapted to a setting like we use in
then it is exactly controllable. By the open mapping theorem this paper.
pseudo
a inverse of any continuouslyexactly
control-
lablesystem is a boundedoperator. If R is asubspace of
111. LAX-PHILLIPS SCATTERING AND SYSTEMS
L 2 ( 0 , m, v) for which range CIR = range e,
then for most pur-
poses we can use CIR instead of e.We call such a subspace A . The Lax-Phillips Model
controlling for e. Reachability is also used by some authors This section begins with a summary of Lax-Phillips structure
instead of the term controllability. Henceforth observability, adaptedsomewhat to our purposes. Chapter 1 of [37] is a
-
150 JANUARY O F THE IEEE,
PROCEEDINGS 1976
boonto aphysicalunderstanding of thisformalism-rather

dryly presented here.
Let U,Y , X be Hilbert spaces. We define H to be the Hilbert
INCONING I
L
HEATER
TRANSNISSION HOT ROD

space LINE
H=L2(-=,0,U)@X@L2(0,=,Y) ll -I
OUTGO ING
TPANSNISSION
and T ( t ) to be a strongly continuous uniformly bounded one LINE
parameter semigroup of operators on H which describes how TENPERATURE

HEASURENENT
information in H changes as time goes by. Let T - ( t ) [resp.
y+(r)] denote translation by t units to the right on L 2 ( - = ,
PROVERBIAL
0, v) [resp. L 2 ( 0 , w, Y ) ] . Denote by S[-,,b] the subspace BLACK BOX
L 2 ( - a , 0, U ) @ X @ L 2 ( 0 ,b , Y ) of H ; by S[X, the sub- Fig. 1.
space X @ L 2 ( 0 ,-, Y ) , by s [ ~ -J
-] , the subspace L (0,=, Y ) ,
etc. Let Pi-,, b] be the orthogonal projection onto S[-a, b l ,
B a = ah(r)
etc. The semigroup T ( t ) has the following properties.
cx = x(+)
(i) T(t)g = T+(t)gon ~'(0, =, Y )
T(r)*g = T-(t)*g on L'(-=, 0,v). D = 0.
>
(ii) For a 0 define Z ( t ) = P[-,,,] T ( t ) P [ , , - 1 . Then It is compatible and satisfies the compactness property. To
Z ( t ) and Z ( t ) * are asymptotically stable. fit the system into a Lax-Phillips model, connect an infinitely
(iii) For a > 0 longtransmission line to the heater and another one io the
measuring device. Energy is supplied to the heater by sending
an incoming electric signal down the input wire, and the mea-
surement is converted into an outgoing electric signal which
travels out the output wire (see Fig. 1). The space of possible
incoming [resp. outgoing] signals is
The number a is not critical and in practice a is chosen in an
arbitrary fashion. The choice of a effects the scattering matrix
but in a trivial fashion, namely, S b ( z ) = e(b-4zS,(z). {(-f) with f in L2 (-=, 0, R ')
1
A further restriction [38, eq. (1.2)1 which all Lax-Phillips
models satisfy is that of inertness:
T+(t)S[o,-l = T(t)SIo,-1 is orthogonal to range T(t)S[,,a
resp.{(:)withfinL2(0,-,R') 1 .
Thus the incoming and outgoing spaces are naturally identifi-
T-(t)*S[-,, 0~ = T(rF S[--, 0~ is orthogonal to range able withL2 spacesandXisL2(-=,0,R')@X@L2(0,=,R').
At aparticular time in theoperation of thiscontraption a
power signal will be coming in, something will be going on in
An inert modelwhich satisfies (ii) and (iii) for aparticular X,and a measurement signal will be going out, in other words,
a> 0 will satisfy them for any a > 0. We call the setup just one would observe avector in X. Later t timeunitsone
described a Lax-Phillips model (with reference a ) . would observe a vector h'. The map sending h to h' defined on
all X by this process is T ( t ) the time evolutionsemigroup. We
B.Example could write it down explicitly but do not toavoid redundancy
The transmission line example fits nicely in the Lax-Phillips with the next section which gives a formula for doing this in
framework and in fact is the typeof situation which motivated general. The Lax-Phillips model here is inert. Clearly any in-
their theory. By reading [37, ch. 11 one can see how to do coming signal or state when propagated for t time units has no
this example in the form of Section 111-A. However, it is more influence on L2 [ t, =, R '), thus the first condition is satisfied.
instructive to d o a typical systems example which is somewhat Intuitively the condition says that state and incoming waves
bizarre from the Lax-Phillips viewpoint since this underscores cannot have influence in the outgoing space which propagates
what is happening ingeneral. faster than theprevailing signal speed.
Consider a rod of unit length and temperature distribution
x(r, t ) which is fixed, x ( 0 , t ) = x( 1, r) = 0, at the ends. There C. Getting a Lax-Phillips Model froma System
is one heater in the rodwhich upon receipt of input a supplies We just saw how a Lax-Phillips model could be associated to
heat in a smooth distribution &(r), and one measuring instru- one particular system; now we show how to do this in general.
ment which reads off the temperature atr = 4. The associated It is done with an abstraction of the preceding construction
system is which we call the imbedding construction.
Proposition 3.1: If [ A , B , C, D ] is a system the imbedding
construction associates with it a Lax-Phillips model with ar-
bitrary reference if and only if [ A , B , C, D ]
(a) is continuously controllable and observable,
A f =-d 2 f on 9 ( A ) (b) has uniformly bounded FRF,
dx2
(c) has d" and Prasymptotically stable.
To describe the imbedding constructionwe need definitions.
Let x [ o ,tl be the function on R' which equals 1 on the intet-
MS HELTON: WITH INFINITE-DIMENSIONAL STATE SPACE 151
Val [ 0, t ] and 0 off of that interval. Define R , : L 2 ( 0 ,00, H ) following identities

+-
L 2 ( 0 , 00, H) by [ R r f ](r) = x [ ~rj(r)f(t

, - T) and denote by m y+(t)K(W) + K(t)Pw =K(f + W) (3.2a)
the reversal operator u(r) u(-r). The space X in the Lax-
--f
Phillips model is obviously LZ(-=,0, U ) @ X @ L 2 ( 0 ,=, Y) d'rp(w) + P ( t ) = P(t + w ) (3.2b)

and we must define operators T(t) on X. Since X is naturally
decomposed into three subspaces it is natural to express T ( f ) Y + ( t ) p ( W ) K(t)P(w) p(t) 3 - ( W ) = P(t W ) (3.2~)
as an operator entried 3 X 3 matrix: which we must verify. Lets check 1 ; its right side is
y+(t)R,Cd" + R,Cd'('+ w).
The first term is C&(w- r + 'I,for t Q r Q w + c and 0 for all

other r , while the second is Cd'('- w, for 0 < r < t and so
+
where P ( t ) : L' (-00, 0, U ) + !i?(A*)' is defined by bothterms sum to C d ' ( f * w - r ) for O < r < t + w . This is
R t + wCeA' as required in 1. The adjoint of 2has the same
form as 1 and so it holds. Assume for the moment thatD = 0.
The left side of ( 3 . 2 ~ applied
) t o f i n C y ( - - , 0, U ) is
and K ( t ) : $ ( A ) + L 2 ( 0 ,=, Y) is 3 + ( t ) R w C 0I r $ 1 ( r - h ) 8 f ( - A ) d X + R f C ~ r
K(t)x = R,C eArx = R , a * x
andp(t):L'(--,O, V ) + - L 2 ( 0 , = ,Y)is
Properties of the systemcorrespond topropertiesof T(t)

as follows:
plus
System Lax-Phillips model
(a) + (b) + eA 'uniformly bounded T(r) uniformly bounded
(i) no restriction
(C) (ii) plus
(a) + (4 (iii)
inertness
no restriction t-r
CJ, e ~ ( r - h ) B f ( - X- w ) dA, for r E 10, t l .
as we now demonstrate. Since IlRtll = 1 = Ilmll, we have
IIP(t)ll < IIeII and Ilu(t)ll < (I 2*11 = 119 II for all t . Conversely A change of variables makes the last integral
llRrfllLl l l f l l ~ l as t +- and so max, I I K ( f ) I I = 119 11. More-
--f +-
over Ut > range Rt is dense in L 2 (0, =, V), so max, I I P ( t ) I I =

B f(-X) dX.
IICII. Since p(t) is Rr applied totheinput-output
max, Ilp(t)ll equals the norm of the input-output map which
in turn equals maxRez > 0 IIS(z)II where S(z) is the FRF. This
map
Jwr-r+w #(r-r-k+w)
This plus the middle integral is

establishes the equivalence surrounding boundedness.
,.art
Lax-Phillips property (i) and inertness are built intothe
c.onstruction. Property (ii) is the same as (c).The second
of property (iii) says p f P x h +P(t)P[-,,olh + 0 and
3-(t)Pi--,01h 0. The last convergence is automatic and
--f
the eAr convergence follows from (c). Let us treat P(t)u for
cI1+w-* &(t+w-r-A) B f ( - X ) dX,
which added to the first integral extendstheformulato

for r E [0, t ]
fixed u in Lz [--, 0, VI. Suppose E > 0, pick N so that [ 0, t + w]. However, this is R,+ J', 8('- ' ) B f(-h) dX as
I I P [ - m ,u~l~l ~<le . For r > N required. To verify (c) when D is not 0, all we must do is
check that 3+(t)RwD +.R,D 3Jw) = R,+ wD,. The opera-
P(t)u = I" 6 + &(f-r)B u(-r) dr

tor D factors out to the left, m y-(w) = y+(w)*rn and so the
problem reduces to verifying
y+(t)Rw +Rr3+(w)*=Rt+w
and note that the second integral is dominated by IIClle. The
f i s t integral is P('-")x, where x = :$ & ( N - r ) B u ( - r ) dr and which is easy.
so goes to zero as t -. Thus continuous controllability plus
+-
The imbedding construction can be carried out with many
(i) gives the second part of property (ii):the first part follows variations. One we shall use takes
similarly from continuous observability. ;IC=LZ(-m,-Q,U)8X,dLZ(U,m,Y)
We still mustshow that T ( t ) is a semigroup. Upon multi-
p&ing T ( t )T ( w ) and equating it to T(t + w ) we obtain the with Q > 0; we call it the a-imbedding.
152 PROCEEDINGS
JANUARY OF THE IEEE, 1976
D. Getting a system from a Lax-Phillips model a/2): Observe using inertness that K(t)[P[-,, X I x + ,I x ]
To a Lax-Phillips model for each number a > 0, we associate is orthogonal to range T+(t+ 0/2), in other words, it is identi-
the compatible system: cally zero on [ t + a / 2 , - 1 . Thus K ( t ) x = ~ ( t ) p + P [ ~ , , ]on
x
[ t + a/2, =o] and if x E $ ( A ) the argument used to imply con-
A the infinitesimalgenerator of the semigroup P[-,,,] tinuity of x € 2 ( A * ) at -a implies here that p + P [ ~ ,x~ is l a
TWP[-,,- 1 , left continuousfunctionat a . Thus [K(f)X] ( T ) = [K(t)p+ *
Eu is the linear functional on 9 ( A * ) satisfying [ E u ] (x) = P [ o , a ~ ~ l ( ~ ) = R r ~ + ( r ) P [ o , ~ ~ x ) , , b ut tT+(t-r)PIo,.1x=
lim,t
( u , [P[-,,01x1 ( - a ) ) , [P[O,,]xl(a) = c x .
Cx [P[o,,l l x ( a ) , for x in ii? (A) + S I I I , u ~which
~ we call the Since p, 0, adjoint have the same basic form as K ,K, the argu-
emitted system. ment above implies that p = p,. Now we turn to p and p,.
Theirdifference 6 satisfies (3.3) with g(s) = T-(s). Inertness
The first order of business is to clarify these defiitions. In
implies that the action of T ( t ) on a function in L z (-00,- a , U )
particular we need to establish that E and C are defined for
is not felt in L z ( a , 00, Y ) for 2u time units; thus p ( t ) = 0 if
those x cited. After completing that we prove the emitting
and imbedding construction are inverse to each other. t < 2.u. This is also true of p s ( t ) . Therefore, A ( t ) = 0 for
t < 2.u and for a < t < 2u we have lim,f r A(t)(r) = 0. On the
Proposition3.2: The a-imbedding construction applied to
other hand,inertness implies that 3+(t)*A(t)= 0 which to-
the system emitted at a from a Lax-Phillips model gives the
gether with (3.3) gives A ( t ) = 3(t)*A(r + s). That is A(t + s) .
original model and conversely.
To prove that E is well defined we need to show for x in (r + s) = A ( t ) ( s ) ; so lim, t t A(t)(r)is independent of t and from
9 ( A *) the function g 4Pi-,,ol x is right continuous at -a. Let the above observation it is 0.
P O , p- be positive smooth functions on [ - a , 01 with po + p- =
At this point, we mention an additional example. The trans-
1 and p- 0 outside of [ - a , - a / 4 ] , The vector x is in ii?(d*) if mission line example effectively demonstrates the system one
and only if P[+,,I[ ( T ( t ) * - I ) / t ] x converges. Consider would use to model many typical one-dimensional scattering
situations. While the state operator A varies with the trans-
P[-,,,I A ( t ) p - g = P [ - u , n( ~A ( t ) p - ) g + P [ - , , a l ( T - ( ~ ) * PN- )t ) g mission line one is testing the operators E and C do not; they
are intrinsic to the reference line. For scattering in odd dimen-
where \(f) = [(T-(t)* - I ) / t l . The first term converges to
sions the same is true and the input-output operatorsE and C
P[-,,,]p-g. Inertness implies that P[-,,,] (IT-(t)*p-)[T(t)* -
can be written very explicitly by using directly the formulas
I/t] P[~,,l= x 0 for small t , thusthe second term equals
of [ 38, sect. 71. Instead of writing out the setup and formulas
P[-,,,I ( T - ( t ) * p - ) [ T(t)*- Z/t] x which converges since x E
% ( A * ) . We have shown that l i m r + . o P [ - a , u l A ( t ) p - g exists explicitly we just give the recipe. Here U = Y = L z ( S " - ' )
in L z [ - a , 0, Y ] . In other words p - g is differentiable in a
-
where S"-' = {(x1, * * * , X") : Ixl Iz + * + l x n f = 1). To de-
certain sense. It is well known (and easy to prove, cf. [53,
-
fine Eu plug h(x * w , w )= 6-,(x w ) u ( w ) into (7.17). To
define C one evaluates j d of (7.11) ata ; note (7.1 1) is given in
Theorem 1.151 ) that this type of differentiability implies
terms of (7.8) and (7.4).
p - g is continuous. Thus PI-,,o ~ is x continuous near - a . A
similar argument holds for x € % ( A ) and suffices to show that E. Finer Structure
C is well defined.
Proofof Proposition 3.2: The space Jc arising from the Now that we have shown how systems theory and scattering
a-variation on the imbedding constructionfromone system theory correspond it is natural to examine how basic objects
[ A , E , C ] is ~ = L ' ( - = o ,-a, U ) @ X , @ L z ( a ,00, Y ) and we such as the FRF the controllabilityand observability operators
denote by Os, K,, and ps theconstituents of the semigroup compare to basic objects in scattering theory. There are three
T J t ) in thismodel. The space in the original Lax-Phillips such objects in scattering: the scattering matrix, the forward
model can be identified with Jc in the apparent way. (X,% and the backward wave operators.Sincesystems correspond
L z (-a, 0, U ) @ X @ L 2 ( 0 , a , Y ) , etc.) and the semigroup T ( t ) to Lax-Phillips models and any Lax-Phillips model has wave
can be written as a 3 X 3 matrix with critical entries denoted and scattering operators these objects can be associated with
p, K , p. We shall show that p = Os, etc.; the main tool is any system. The bulk of this section is devoted to comput-
Lemma 3.3: Suppose that Z is a vector space and g(s) is a ing them explicitly for a system; we find that they are closely
one parameter semigroup of linear operators on Z . If for each related to C, 0, and the FRF. In [281 this business was thor-
t , s 2 0 the map 6 ( t ) : Z + L 2 (O,m, Y ) satisfies oughly worked out for thediscrete-time case.
Before beginning the computation we mention another di-
T+(t)6(s)- G(t)g(s) = 6 ( t + s ) (3.3)
rection. Certain functions called 'distorted plane waves' play
and if for eachfuted z, t the L z function 6 ( t ) z satisfies a fundamental role in classical scattering theory. Helton and
lim, t b [ 6 ( t ) z ] ( r ) = 0,then S(b)z is identically zero on [0,b]. Ralston [32] showed that for the system [ A , E , C ] naturally
Proof: For fixed t , s the function fT+(t)S(s) z is identically associated with the lossless transmission line (as in Section I!)
zero on [0, t ) and so on that interval 6 ( t + s)z = -6(t)g(s)z. the state-space valued function (z - A)-' Eu[resp. (z - A)-'
Thus 1-im, t [ 6 ( t + s)z] ( r ) exists and is 0; this is independent C*y] actuallyequals the classical physics incoming[resp.
of s so the lemma follows. outgoing] distorted plane wave. (Here u , y must be chosen in
Consider 6 = K, - K , since both K , and K satisfy (3.2a), acertain basis.) The (z - d)-'E objects are used repeatedly
their difference 6 does also; note(3.2a) is a special case of in studying systems and the author feels it is quite interesting
(3.3). For x in ii? ( A ) the l im,t b [K,(b)X] ( r ) exists and is c x . that they correspond to such important objects in scattering
To complete the proof we need only show that this limit on theory. It seems reasonable clear that this correspondence will
K equals C because the result subsequentlyfollows from carry to most situations. One should be justified in thinking
Lemma 3.3. To d o thislet po, p+ be smoothfunctions on of (z - A)-'&, etc., as asuperposition of distorted plane
IO, a ] which satisfy po + p+ = 1 and po E 0 outside of [O, waves.
HELTON:SYSTEMS WITH INFINITE-DIMENSIONALSTATE SPACE 153
Nowwe compute the wave andscattering operatorsfor a Bg(f') d [ + D. The lower left entry is
given system.The basic philosophybehindscattering is to
study a situation by comparing it to one where nothing hap-
pens,free
a situation. Given a Lax-Phillips model as in P ( t ) p l ( t ) * g =CR, B:P:('-'))C:R,g(r) dr.
Section 111-A the appropriate free model has evolution opera-
tor denoted S ( t ) which is the shift by t units to the right on Uponwriting R , explicitly, making the change of variables
L(-m, 00, U ) . Forthe sake of convenience we shall always c
t - r -+ f', and sending t + - we obtain P ( t ) p l ( t ) * g -+ /; *
take U = Y which is no real restriction since the smaller space B:eA;(*-[)C:g(f') d t which we abbreviate a:.
Here s1 is
can be enlarged to give two equidimensional spaces which are the transfer operator for [ A l , B 1 ,C 1 , O,] . Finally, the top
unitarilyequivalent. If the Lax-Phillips modelhasreference comer entry is
a, then let be a map of L Z ( - a ,a , U ) into L Z ( - a ,0, U ) @ X @
Lz(O,a , U ) . Let p denote the map of L Z ( - m ,m, U ) into JC
which is the identity on L 2 ( - w , - a , U ) and L z ( a , QO,U ) and
p" on L Z ( - a , (I, U ) . The backward wave operator is the map
W - of L z (-00,m,U ) to X defined as
This we now compute. Let [ A1 , B 1 , C1, 01 be the system Change variables twice, namely t - s -+ 5 and then t - h + q to
whose Lax-Phillips model with reference Q is the free system. obtain
Then T(t)p S ( t ) * written in matrix form is:
Pl(t)*
: ).
Pl(t)* 5--(t)* Thelimitast+ais5,Sl*g.
These expressions can be seen in a very simple way because
As t goes to infinity y+(t)*,K ( t ) , and & I f go to zero (in the theaction of :
and 5 11 = 5 are trivial. Theoperator W -
strongoperatortopology [55, ch. 6, sect. 11.Thus we see when written with respect to the basis
immediately that p has no effect on W-. Furthermore, since LZ(-m, 0, U ) @LZ(O,m, U )
the Lax-Phillips model for T ( t ) is inert we have p ( t ) y-(t)*,
P ( t ) - 3-(t)*,3 - ( t ) p l ( t ) * , and y - ( t ) p l ( t ) * equal zero. What -+L2(-=, 0, U ) $1@LZ(O,=, Y )
remains is
is
t+-
0
The main influence of Q is in the middle row of S ( t ) * and we

w-=(! i).
The classical forward wave operatormakes no sense for most
have seen that this row disappears in the limit. The influence lossy situationsbutnatural generalizations lim,
+ 0 T(t)* *
of Q on the rest of the calculation is merely in adjusting ranges S(t)* = W + oritsadjoint W z = lim, + s(t)T O ) domake
of integration. Suchbookkeeping merely confusesotherwise sense, cf. [ 291 or [38]. The same procedure as above yields
illuminating
and
straightforward computations.Therefore, that W, in the basis L z ( - m , 0, U ) @ L 2 (0, m, U )+ LZ(-=,
we shall henceforth take Q = 0. 0, U ) @ X @L'(o, m, V )is
Now we computetheentries of the matrix. Thebottom
(: 3
right one is
W+= 0 Q .
The classical scattering operator is W,*W- and from our com-

Above it we have putations is
(;*e 9).
The entry [Q*Cgl (Q= C . !I e A 6 - ' ) B g ( q ) dq while the inte-
gral expression for 5 has already been given. When these are
substituted into the matrix onefinds that the scattering opera-
tor is
[I: C eACt-')B g(g) d t

1
+ D g(f') .
That is, the scattering operator is g. This completes the com-

putation of wave and scattering operators.
154 PROCEEDINGS OF THE IEEE, JANUARY 1976
Now we give a brief scattering to systems dictionary. Lemma 4.2: Projx, 9 ( A * ) iscontained and is dense in
%A:).
ips Herein Proof: Write dl*' as a 2 X 2 operator entried matrix with
respect to thedecomposition X, @X? of X, applyit to a
Df vector (g) in X, and integrate
D!
3 ( t )or Uo(t)
3(-t)or uo(-t)
''9 - 1 Pf" and P!+'
X , TO),20) = P
translation representationwith It is straightforward to check that f ( x , s) is in D ( A * ) . Clearly
U=Y=N Projx- 9 ( A * ) C $ ( A : ) and so p ( x , s) P Projx, f ( x , s ) is in
imaginary,jw axis real axis !D(A:). Since e x is norm continuous with limit x at t = 0,
we get p ( x , s)/s -P x , moreover, if x E $(A,?), then A : p ( x , s)/
s + A,?x. Thus p ( x , S)/S + X in the 1) l l ~ , topology.
*
Iv. THE BASICPROPERTIES OF SYSTEMS
The first lemma guarantees that E , once defined on Projx,
There are three basic theorems which hold for finite dimen-
Y)(A*) is defined on adensesubspace of $ ( A *) while the
sional systems: there is a controllable and observable system
second says that E, has a continuous extension to $(A:).
with a given FRF, two such systems are equivalent in a certain
The various properties asystemmight have are preserved
sense, every 'reasonable' function is the FRF for some system.
under his cut down operation. Onecan check ina straight-
These threeproperties when suitably interpretedholdin
forward way that since eArXc C X,, the operator ( z - A,)-'
infinite dimensions as is described in the first three parts of
equals ( z - A)-' restricted to X,; moreover, (z - A,)-' E , =
this section.Thefourthpart shows thatformeromorphic
( z - A)-' E for Re z > 0. This is enough to show that the
systems the poles of the FRF correspond to eigenvalues of A
function C(z - A)-' (s - A)-' E , which determines the Hankel
as aconsequence of Lax-Phillips theory. It also describes
operatorforthe original systemequals C,(z - A,)-' (s -
eigenvalue behavior foranothernatural class of systems.
A,)-* B,; thusthetwo Hankel operators are equal. If the
Throughout this section we work only with continuously con-
original system is compatible ( z - A)-' E is contained in $(C)
trollable and observable systems.
as well as X,; thus it is contained in D(C,) = 9(C) n X, and
A . Canonical Decomposition of a System the new system is compatible. The two FRF's are clearly the
same. If an operator-valued functionismeromorphicor
We shall see that iny systemcan be 'cut down' to acon- compact valued its restrictionto an invariant subspace also has
trollable observable one having the same Hankel operator (or these properties;thusthemeromorphicandcompactness
FRF if the original system is compatible). Here is the proce- property are preserved.
c
dure. Let X, = cl range and X0 = cl range 9 for the system The same type of process works withobservabilityand
[ A , E , C,D l . Let X,-and Xb denote the orthogonal comple- projecting [ A , E , C,D l down to X,. One proof relies on the
ment of X, and X,, respectively. fact that Q is the controllability map for the adjoint system
Now we describe how to restrict the system [ A , E , C,Dl to [ A *, C*,E * , D * ] ; the preceding construction applies. Then
X, and obtain a system [ A , , E,, C,, D , ] which acts on X,. one can take adjoints to obtain [ A0 , CO,E o , Dl , an observable
Since eAf X, C X,,its restriction toX, is a semigroup and con- system. If one combines thesetwo processes, onegets(by
sequently has an infinitesimal generatordenoted A , whose first decomposing the system [ A , E , C,Dl into controllable
domain Y)( A , ) is contained in Y)( A ) . Set D , = D. The operator and uncontrollable parts and then decomposing these systems
C, is defined on Y)(C) n X, by C,x = Cx. The operator C, is into obserable and unobservable parts).
11 [la, continuous. B, is first defined as a linear functional on Theorem 4.3: There is a closed subspace X,, of the state
projx, 9 ( A *)by space X of asystem [ A , B , C,Dl so that the system [A,,,
[Bcul ( w )= [Bul ( w ) E,,, C,,, D ] gotten by restricting and projecting the original
system t o X, (as described above) is controllable and observ-
and then extended to w in ? ( A : ) which is shown to be possi- able and has the same Hankel operator as the original systerk
ble by the next twolemmas. The system is compatible if the original one is and the FRF is
Lemma 4.1: If u E U,there is a constant K(u), so that the the same for both. The
meromorphic and
compactness
linear functional Eu satisfies properties are ako preserved.
I[Bul (w)l Q K ( u IIProjx,
) w l l ~ Note: W e E , and , C mightberegardedformally as
restrictionsfollowedbyprojections A,, is definedby the
for each w in Y)( A9. relationship = Projx,, d"tx,, andthis is notthesame
Proof: For p < 0 set x&) = C ( X [ ~ ,e-pf ~ I u ) ; then (x&), thing.
'
x > = $ , (Be-"' u ( t ) , &*f x ) d t . Since A* generates a uniformly
bounded semigroup range A * - p = X , see [33, Theorem E. Two Systems with the Same ERE
2.3.21. Ifwe set x = (A* - p l y , the integral becomes [Eu] Thissection describes theextent to whicha system is
({&*-Ps - I ) w ) . Since P ) ~ W I-toI ~ we
~ get lirn,,, . determined by its Hankel operator or if the system is compati-
( x p ( s ) , x ) = - [Eu] ( w ) . Thus x p ( s ) converges weakly to a ble with itsFRF. First it is necessary t o discuss pseudo-inverses
vector x(=) which is in X, because each x p ( s ) is. We have the of an operator. Suppose that M is an unbounded operator with
estimate I[Bu] ( w ) l < Ilx(=)ll IIProjx, ( A * - p)wll from which domain 3 fromone Hilbertspace H1 to another H z , and
the lemmafollows. suppose 63 is a subspace of 3, which does not intersect thenull
HELTON:SYSTEMS WITH INFINITE-DIMENSIONAL STATE SPACE 155
space of M . We definethe 63 pseudo-inverse of M , written From (4.1), we see that

M i ' , to be the operator G with range equal to 61 so that GM M eAr = eArM
(resp. M G ) is the identity operator on63 (resp. MG). The main
theorem here is on range andadditionally
Theorem 4.4: Suppose that two.co?trollable and observable
systems [ A , B , C,D l and [ A , B , C , D l have the same Hankel MA =AM
operator. If the two systems are continuously controllable and
observable, thenthereis a possibly unbounded closeable
on S A . Furthermore, M e = e. Thus if u E U , the vectors
operator M with dense range suchthatthe
equations
'intertwining'
S
I,'d ' f Bu d t 4 b(u, s), PrBu d t 4 b(u, s)
MA=AM MB=B CM=C D = D
each holdon adense set. One suchoperator M is gotten
e
in range and satisfyMb(u, s) = d(u, s). Now ( x , M u , s)) +
[Bul ( x ) F d (x, b(u, s)) + [hul ( x ) and so [MBu] ( x ) = [ B u ] *
explicitly by ( M * x ) = [Bu] ( x ) for x .in S A * . To see that M intertwines C
~=(9;~)*2=&;l and C observe that C eAsMx = 2 *Mx = 2 *x = C eAsx for x E
S A . Set s equal to 0 to obtain the result.
defmedon range C whenever -63 and 6: are controlling and The 'note' and last line of the theorem are straightforward
observing spaces for e,
and 2 , 2 ,respectively. Note that to prove (cf. [ 28 p. 291 1.
Remark 4.5: If the two systems in Theorem 4.4 areem-
di= [ n u l l C = n u l l t I l a n d L = [ n u l l 2 = n u l l91' bedded in a Lax-Phillips model, then T ( f )and T ( t ) are related
by G T ( r ) = T ( t ) C where C in matrix form is
are one set of spaces for which the theorem applies. Clearly
if bothsystemsarecontinuouslyexactly controllable or
observable, then M is bounded and invertible.
Proof: The continuous operator 9 * ehf applied to u in
Cr[O, -, U]gives the function C d l s e A r ;1 eArB u ( t ) d t of
s. Since unlimited integration by parts is possible, there is no
problemin interchanging operatorswithintegration to get C. System Realizability
2 * eArCu(s)= (- 1)" JtC dits"") ( A - p)" B(ePr u ( r ) ) ( " ) d t . Thissubsection describes how to construct system a
When [O,gl containsthesupport of u we may substitute with preassigned FRF. An operator-valued function F ( z )
c(f) = u ( g - t ) for u then change variables and get
analytic onthe RHP is said t o converge to 0 inthe
positive direction if lim, -,+* max yllF(x +jy)ll is 0, it con-
Q* eArC C(s> = (- 1)"
I,"+s+r eA(s+r+g-r)
( A - p)-" B(eP(g-') u(t))(")d t .

verges to F , if F ( z ) - F , converges to 0. The function F is
unitary inner if F ( j y ) is a unitary operator for almost all y .
Note that if F is analytic on Re z > p and is the FRF of a
compatible system C ( z - A ) - ' B , then the modification A -+
This is just the response of the system [ A , B , C , D l to input A - p - E of the system changes the region of analyticity for
e p ( 8 - r )u for values >g. Thus by hypothesis we have FRF to the RHP or greater. Thus as far as the realizability
Q* @ r e = g*,Ar e problem is concerned we might as well assume that the (pro-
spective) FRF is well behaved on theRHP.
Theorem 4.5: If F ( z ) is a uniformly bounded 6:(U , Y)-valued
for u E Cr and by continuity it holds for all u E L 2 [ 0, 00, v] .
analytic function on the RHP with a limit F ( - ) in the positive
From this, one gets if 6? and 6: are controlling and observing
direction, then there is a compatible continuously controllable,
for e, and 9 , 9, respectively, then and observable system satisfying Il@'II < 1 with F as its FRF,
~
(9-1 ) * eA2r x -- e-5-1gr a x which
(i) is exactlycontrollableand observable if andonly if
$r x E range e. By setting K = 0, we see that HankelF has closed range. Note suchsystems are always
(C&l)* 9 * = (4.1) asymptotically stable.
(ii) has 8' and d" asymptoticallystable if F ( j y ) has
holds on range = e a , regardless of which 6: and fl are used. either the form V ( j y ) G ( j y )or G ( j y ) U ( j y ) where Uis
Define M to be this operator on range e.
The construction also a unitaryinnerfunctiononthe RHP and G is the
works for the adjoint systemandyields an operator M + de- boundary value of an analytic and uniformly bounded
f i e d on range 4); the two operatorssatisfy ( M x , y ) = ( x , M'y) function in theLHP.
for x , y in their respectivedomains. Thus M has adensely (iii)has controllabilityand observability operatorswith
d e f i e d adjoint and by 159, ch. VII, sect. 2, Th. 31 is closable. isometric adjointsif F is unitary inner.
Let SA = {oE g ( A ) : a = & for a uEC"(O,-, U) with Proof: We begin informally and give an intuitive idea of
t u i n g(A)}. Since any u in Cr(0, -, U) will produce sucha the construction. Assume that F goes to zero in the positive
vector, SA is dense. (even inthe II lh topology) in $ ( A ) n direction. Let X ( t ) denote the shift by t units to the left on
range :.likewise MSA is dense. By (4.1) the map M takes SA L2 (z-, -, Y). Its infinitesimal generator A" is dldx acting on
into g ( A ) and so M as an unbounded operator on I)(A) is $ ( A ) = { u E L 2 ( - - , = , Y): the derivative of u is in L 2 ( 0 ,
closable. Likewise M* restricted to a dense subspace SA* of m, Y)}. The adjoint of A" is - d / d x acting on the same domain.
$ ( A * ) is closable, etc. The inverse Fourier transform of Fu is not in general a func-
156 PROCEEDINGS OF THE IEEE, JANUARY 1976
tion, but it can be naturally viewed as a linear functional on c

By construction = Hank+, so the system [ A , B , C, Dl is
g(z*) = $ ( A ) ; namely, if 3 dezotestheordinaryFourier exactly controllable if and only if range HankelF is closed. As
transform on L2 functions, set [Bul (f)=.E,, F(j[) Sf([) d [ mentionednear the end of SectionItheoperator 9 * C m is
where f E g ( A * ) and B is the generalized Fourier transform of unitarily equivalent to HankelF. Thus 119*xll > e Ilxll; conse-
F. Note that the integral exists because Sf(t)/(l + [[I2) is in quently, range 9 isclosedand wehave exactobservability.
L 2 ( - m , m, Y). A simple estimate (cf. [53, Th. 7.251) shows Conversely, foranycontinuouslyexactlycontrollableand
that evey fun_ctionin $@) is continuous, thus we may de:ine c
observable system 9 * has closed range; thus, the associated
a m a p C : $ ( A ) + Y b y C h = h ( O ) f o r h i n a ( & .T h e i , B , e Hankel operator has closedrange. An argument like that in
constitute a system. If the Fourier transform of F were a con- [28, Remark4.21 demonstratesthatcontinuousexactcon-
impulseresponse
-
tinuous function, then [ Z(t)FF] (x) = 3 F ( t + x) and so the
function ?eA' a u = [SFu] ( t ) is 3F, as
trollability[resp.observabilitylforces
asymptotically stable.
'
8 [resp. &'] t o be
required. To prove (ii), suppose thatthefirstfactorizationholds.
Now we turn to aprocedurewhichworksingeneraland Then range Hankelu 3 HankelF and so if x belong to the range
rigorously.Everything is set in the frequency domain. First of the latter it can be written x = Hankeluw for some w in
**-d
Fourier transform A , B , C: L2(-=, 0, Y). Now E ( t ) * x = Hankelug(t) where g(t) =
PL'[--, 0 , y ] Z ( f ) * w . Since g ( t ) + 0 as t +m, we h a d
3-'(z- & ) = z - j [ E ( t ) * x + 0, that is T ( t ) * x + 0. Note T ( t ) x = PL'(~,-, y)
[S-' &I (g) = 1, m
SF(jC;)ug([) d [
E ( f ) x automaticallygoes to 0. If the reverse factorization
holds, then a similar argument with the adjoint system proves
stability.
If F ( j [ ) is a unitary operator then it is straightforward to
. r M show that HankelF is apartialisometry. An argumentlike
that on [28, p. 321 Finishes the theorem.
D. PolesandEigenvalues
while this is a system it is not necessarily compatible so we When X is finite dimensional the polesof the FRF fora con-
extend C to be defined on g(C) = {g : R YI 3-'g has finitely
--f
trollable and observable system are located at the eigenvalues
many poles in the RHP and [3-'g(E) goes to zero in the posi- of A , in other words, input and output information completely
tive direction}.Thissystemiscompatiblebecause 3-' ( z - determinethe eigenvalues of A . Theimportance ofpole
&-' & = ( z - jt)-' F(j0 is in 39(C) and its FRF is location and of this fact in fiiite dimensions is well known.
For infinitedimensional situations the author is more familiar
withscatteringtheoreticimplicationsthanengineeringones
(possibly because of background and possibly because efforts
in the first areaare moreadvanced).The eigenvaluesof A
This is one boundary of integration over the contour { z : lzl = correspond to modeswhicharedecayingintheinside of
M ,Re z > 0) U { z : - M < z <M}. Theintegralover the the system and thus poles of the scattering matrixgive informa-
circular boundary goes t o zero as M + m because F goes to tion on this. Indifferentcasesthepoleshavedifferent
zero in the positive direction. Thus the FRF equals the residue interpretations. Much of Section V is devoted to describing
of the contour integral at z . In other words it is F ( z ) . If F this.
goes to _F, *#*0 in the positivedirectionset D =Fm. The When X is infinite dimensional the pole to eigenvalue cor-
system A , B , C, D is compatible and hasFRF equal to F . respondence is precise for exactly controllable and observable
Thesystemmay notbecontrollableor observable in any systems as we shall soondemonstrate. However, inthe
sense. Now we remedythat;Thecontrollabilityoperator is controllable and observable case (which is the most common)
for each input function u E L ' ( O , m , v) a n d f E 1)(A*) the situation is murkier. The problem-is that two systems with
the same FRF have operatorsA and A which are merely inter-
twined by a 1-1 densely defined M having dense range. This
is a very .weak relationship and it well known that even wheri
A and A are bounded operators no correspondence between
thespectra of A and A can be inferred(cf. [49, ch. VI,
sect. 4.21). Thus if a system had spectrum in correspondence
with pole location one might be able to fiid an M so that the
equivalent system produced would have completely .different
By the Plancherel theoremtheformulaextends to all f in spectrum.
L 2 ( - = , m , Y) and u in L 2 ( - m , m, v). Thus for such u, the All of this pessimism is not justified, because as we shall see
function F ( j [ ) [SUI([) is in L 2 ( - - , m, Y). By the closed the systems which come up in practice behave well. This is a
graph theoremthesystem is continuouslycontrollable.The consequence of the work dating back t o Moeller and used for
observability map when applied to y in L 2 ( 0 , m , Y ] is 2 y = this purpose by Lax-Phillips, who prove the following.
z(t)6 ( x ) y ( t ) d t whichsays that 2 is the identity map. Theorem 4.7: A compatiblecontinuouslycontrollableand
Since the system is continuously controllable and observable, observable system with the meromorphic property has a FRF
Theorem 4.3 implies that the sptem A , B , C, D obtained by whose poles are located exactlyat the eigenvalues of A .
x,
restricting and projecting B , C, D t o the closure of the range Proof: This is a rephrasing of [38, Th. 5.51 applied t o the
of the Hankel operator associated with F is compatible and Lax-Phillips model in which thesystem is imbedded(see
has FRF equal to F. Section 111-C).
MS HELTON: WITH INFINITE-DIMENSIONAL
SPACE STATE 157
Toapplytheir[38,Th. 5.51, we require in additionto required bythetheorembutwith W replacing F . Thusit

themeromorphicproperty a hypothesison 'incoming and remains to show that the singularities of W equal those of F .
outgoing vectors'. Actually an immediate consequence of this Clearly any singularity of F is a singularity of W. Note that in
additionalhypothesis [38, Lemma5.31suffices. In systems the nonpseudomeromorphic case W is not unitary inner and
theory language, it says that if x is an eigenvector of A [resp. a refined version of the Moeller theorem gives the last state-
A * ] , then Cx # 0 [resp. B*x # 01. The statement about C is ment in the theorem.
a weaker requirement then observability, while the one on B Now we show that every singularity of W is also a singularity
follows from controllability. The conclusion of their theorem of F . Obviously it will depend heavily on the fact that W-'
says that the poles of the s-matrix are located at precisely the and G have no common (left) factor whichcould cause the
eigenvalues of A . The FRF for a system is just the s-matrixof cancelation of asingularity.Thisindivisibility property has
the Lax-Phillips model in which it is imbedded and so the geometric inteIpretation that thesum S of the subspaces
Theorem 4.7 follows. W-'?(Y) and G H 2 ( U )is dense inHZ(Y), because C I S being
A compatible meromorphic system has a meromorphic FRF. (backwards) translation invariantcan bythe Lax-Beurling
A more general class of FRF is mathematically reasonable to theorem be written cl S = 8 2 ( Y ) for 8 a conjugate an_alytic
consider,namely, the pseudomerornorphic F ( z ) whichare inner function. Since S contains both W-' H z ( Y ) and G H Z( V )
q o s e having a factorization F ( j y ) = G ( j y ) V ( j y )or equiva- thefunction 8 divides both W-' and G from-the left and,
fently V1 ( j y ) G 1 ( j ywith ) the G's analytically continuable to therefore, 0 = constant or equivalently cl S = H2( y ) . Since
a bounded function analytic in the LHP and the V's unitary S consists of vector functions of the form
inneronthe RHP. Theseoccurredin Theorem 4.6, part n = W-'c+Gd (4.2)
b. When U and Y are finite dimensional another definition of
pseudomeromorphic is possible [ 131.There is a function where c E H Z ( U ) and d € H 2 ( Y ) the space 61&
{ c + F d } is
H ( z ) meromorphic and of bounded type in the LHP so that dense in 61' 4 Wgz(Y). We shall show that this can happen
lim,,o H ( x + j y ) = F ( j y ) exists for almost all y . Thus F is only if W has no more singularities than F .
meromorphic except on the jy-axiswhere it could have bad For the sake of a clear presentation let us do the dim Y = 1
discontinuities. It is in these finitedimensional circumstances case first; then W and G are scalar valued. Let us suppose that
where pseudomeromorphic FRF's are particularly interesting W has a singularity at z o while F does not. Note that since the
since they turn out to be precisely those function for which singularity set for F is a closed subset of { z : ( z (2 l}, zo is at
there is anycorrespondencebetweenproperties of A and some positive distance from any singularity of F and so F is
properties of theFRF. Thus they are thebiggest class o f uniformly boundedin some region about z o . A classical
functions that one would probably want to use in connection factorization theorem due to Buerling (see [26]) implies that
with classical A , B , C , D type systems theory. This point is w = m * p where both m and p are inner, m has no singularity
well illustrated by the type of system described in Section IV-A in common with F and p is bounded in a neighborhood of z o .
s t t e n byrestricting the shift to acertainsubspace.Sucha Since W-'F is in Hz(@) and m and F have nocommon
restricted shift system, as Fuhrmann calls it, hasa structure singularities p-' F is in Hz@. Thus from (4.2) every rnn is in
which can be immediately analyzed by standard mathematical H z ( @ ) which is impossible since the n aredense and m is a
theory arising from Moeller's theorem I261, [371, [491. nonconstant function in Hz(@).
Theorem 4.8: The singularities of a pseudomeromorphic After setting things up properly this same type of reasoning
FRF for a restricted shift system with U and Y finite dimen- will apply to the dim Y = K > 1 case. Suppose there is a zo as
sional are equal precisely to the spectrum of A . The spectrum before. Then there will besome entry, say w of the matrix
of A with Re z < 0 consists of isolated eigenvalues. function W which is singular at zo . The functions w ' 1 factor as
If the FRF is not pseudomeromorphic, then the spectrum of before into mlpl. Now
d " = ( 2 : (21< 1).
Proof: It suffices to work on the unit disk rather than the
RHP since a standard transformation (Caley) connects the two.
If F is a unitary inner function then the theorem is essentially
Moeller's theorem(cf.[37,ch.3,sect. 31 or [49,ch. VI, is contained in
xh. 4.1 I ) and so our proof consists of reducing Theorem
$3 totheunitaryinner case. Supposethat F is pseudo-
meromorphic.Then rangeHankelF is asubspace of H 2 ( Y )
whose orthogonal complement 7 is invariant under multiplica-
tion by z . By the Lax-Beurling theoremthere is an 'inner' which implies that for an entry f i r o f F the function m;'m;'
function W such that 7 = WHz( Y ) . The pseudomeromorphic * * * mi'p;' . * p k ' f l l is in B2(Q)and consequently p;' * * *
property forces W to have V as a factor and so W is unitary p k ' f l l is in p ( Q ) .61;
Since is dense in 611, one has w11 =
inner; this is equivalent to F being pseudomeromorphic. Set n p l E clR ;. However, thisforces m l p i ' * . * p i ' to be in
G ( e i e )= W ( e i e ) * F ( e i e )Observe
. if f i s any function inH2(U), p ( Qwhich ) is impossible, since m l is singular at zo and the
then Ff E ' 7 + pz(Y)and so W*Ff is in H2(Y). This forces p;' cannot cancel this singularity. The proof is complete.
G to belong to H"(U, Y ) . We have F = WG where by construc- As an aside it might be informative to observe how Theorem
tion cl range HankelF = range Hankelw. This latter property 4.7follows from Theorem 4.8for U , Y finite dimensional
implies that there is no nonconstant inner function W1 which (and otherwisewith slightly more work).Let [ A , B , C, D l
divides both W * and G fromtheleft, i.e., W I K = W * and satisfy the hypothesis of Theorem 4.7; with FRF denoted F .
W I L = G for some analytic K and L . By construction eAr is Let A arise from the restricted shift. realization of F. By
just the shift restricted to HankelF = Hankelw. The Moeller Section IV-B, there is M with A M = M A . Since the restricted
theorem says thatthespectrum of A behaves precisely as shift system is exactly controllable the inverse of its control-
158 JANUARY OF THE IEEE,
PROCEEDINGS 1976
lability operator and therefore M is a bounded operator (by

the formula for M). If A x = k,then AMx = XMx so X isan
eigenvalue of A . Thus every pole of F is an eigenvalue of A .
The reverse directionfollowsin an elementary fashion from
( z - A)-’ meromorphic together with the (X - A ) w = 0 *
Cw f 0 and (x - A *) w ’ = 0 *B*w‘ = 0 property.
The subsectionconcludeswithsomehistoricalremarks.A
correspondence between the spectrum of A and poles of the
FRF was widely observed once the connection between sys-
tems theory and the work of Lax-Phillips, Nagy-Foias, Helson,
etc. was understood and provided many new techniques, [ 121,
[ 181, [ 271.The first thoroughpresentationforthenon- Fig. 2.
boundary singularities is [ 121 . The pseudomeromorphic func-
tions were studied first by mathematicians [ 141. An equiva- where all coefficients are smooth and nonnegative-A(x) is a
lent notion called ‘roomy’ was introduced independently by positive definite matrix-and we assume Lu = A u , for Ix I > R .
Dewilde who asserted strongly that they were the proper class The objective is to study the coefficients in terms of the scat;
for the study of systems [ 111. He also asserted that certain tering matrix. The onedimensional case includes the losslesd
‘roomy’ functions were appropriateforthe electrical design transmission line, e.g., b = 0, u = (l/c), A 1/2. Very recently
technique called Darlington synthesis. In [ 13] the equivalence rudimentary Lax-Phillips scattering has been applied to sound
of ‘roomy’ with [ 141 was established and they were shown to impinging on an obstacle with a ‘springy’ elastic surface [8].
be precisely the class for which Darlington synthesis is appro- In one dimension this corresponds to a singly infinite constant
priate; the result was independently done in Russia by Arov coefficient transmission line connected to a circuit with one
[ 1I . Meanwhile Fuhrmann in his work on systems theory de- resister, capacitor and inductor at the finite end. We reiterate
voted quite a bit of attention to pseudomeromorphic functions that many of the resultswhichfollowdeal with or extend
[22], [23]. Inparticularheconcentratedon a subclass of easily to mixtures of the first and second situation (the third
functions which are precisely those which can be the FRF of has not been studied yet). In fact for many purposes an ob-
an exactlycontrollable and observable system. It was obstacle problem can be thought of as a limit of a variable coef-
served by Clark and Helton [28, Th. 3c.l plusRemark ficient problem as appropriate coefficients get infiiitely large.
3c.21 that in the scalar discrete-time case this class has a very Also many of the results will most likely have an analog for
nice characterization and Fuhrmann extend this to the multi- lossy situations.Thepreoccupationwith losslessness derives
variable and certain real time cases [ 191 , [ 2 1I . In the author’s from the fact that in classical physics most phenomena are
opinion the next necessary step in this direction to fiidlossy energy conserving; a general ‘lossy’ scattering theory did not
exactlycontrollable and observable systems which arise emerge until 1973 [381. Another point is that although the
physically. one dimensional case will be emphasized through the expos:
tion this is entirely for concreteness. Practically all structure
v. MISCELLANEOUSRESULTSINSCATTERING THEQRY is greatlysimpler for dimension one than for higherdimen-
This section has a different spirit than the rest of the paper sions. Thus it is regarded as trivial and frequently ignored in
and is not related to systems per se. It lists some qualitative the works to be described; three-dimensional structure, how-
results obtained in scattering theory over the last five years. ever, is a good guide to behavior in all odd dimensions.
The hope is that this will benefit engineers primarily by sug- One general class of results concerns how close poles of a
gesting structure which they did not know. Most surely the scattering matrix can come to the imaginary axis.
results as they stand will not apply without modification to 1) According to Beale [ 7 ] , suppose that a sequence of ob-
someone’s particular problem. However, practically all of the stacles 0, gets closer and closer to enclosing a region R (see
results are flexible andnobody knows their ultimategenerality. Fig. 2) and that the Laplacian on R with zero boundary con-
The mathematicians involved have not bothered to do rela- ditions has eigenvalues e 1 , ez , * * * . Then each corresponding
tively straightforward generalizations because motivation was scattering matrix s,(z) has asequence of poles p : , p : , * * *
lacking. Thus if a particular fact listed herein catches a readers such that p i +j e&. This result will almost surely apply to tha
fancy he is urged to consult the source article for a precise variable coefficient case, say for example to atransmissioh
statement of the result and if the context in that article is not line withvariable capacitance having a huge well. Very crudely
quite right he should not dispair since the result will probably speaking it describes an effect that pockets of very high signal
hold in many physically reasonable contexts. We emphasize speed have on poles near the imaginary axis.
that the listing here is not intended to be complete (orprecise) 2) For which situations does the scattering matrix have all
andconcentratesonworkdone primarilybypeoplein the of its poles in Re z < -6 < 0 for some6? A stronger question
Lax-Phillips branch of scattering theory. for the corresondingsystem is when is eAr exponentially
We begin bydescribing thecommonlystudiedsituations. stable, i.e., 11 < e-“? Beforedescribing general theorems,
Thereisobsrucle scattering for the wave equation in 1 , 3 , 5 , . * we list some special cases where exponential stability holds:
dimensions. Here one has an obstacle and bounces waves off when the obstacle is starlike (anypoint inside the obstacle
of the obstacle usually in order to determine its shape. (Di- can be joined to the origin with a line lying insidethe obstacle),
mensions 2,4, e --
are highly anomalous, see [41 I , and will not when the variable coefficientsrestricted to anyray through
the origin decrease with distance from the origin. The principle
be discussed here.) Variable coefficientscattering usually
treats an equation behind this can be thought of in terms of geometrical optics.
There are high frequency solutions of (5.1) with b = 0 which
u n = L u = u ( x ). V
A(x)Vu - b(x)u (5.1) can be thought of as propagating along rays-called bichar-
acteristic curves of L . They are the rigorous version of ‘light A is constant near a52 then if ?(x) never equals 1 on a52, the
rays’ and bend according to the laws which govern light rays system is observable. Inthe one-dimensional case this says
going through a changing medium.Forthe general variable that if the transmission line in Section I1 is terminated in resis-
coefficient case one has exponential decay provided that any tance R and has coefficients constrained as above (more than
bicharacteristic curve leaves any finiteregion in a finite amount necessary), then the system associated to it in Section I1 is not
of time, [ 37, Appendix 21, [41]. This bicharacteristic condi- observable if and only if R is the characteristic impedance of
tion also implies [401 that the poles of s(z) are in the region the line.
Rez<-(a+blogIzI) ACKNOWLEDGMENT
The author would like to thank J. V. Ralston for his con-
for some a > 0. In dimension one rays can never be trapped in
siderable help with Section V.
a finite region and so one always has exponential decay. For
the obstacle case one fixesa sphere around the obstacle 52,
draws a line to the obstacle, optically reflects it, follows it to REFERENCES
the obstacleagain, optically reflects it,etc.untilthe ray D. Z. Arov, “Darlington’s method for dissipative systems,” Sov.
crosses the sphere (on its way out). Let I be the length of the PhyS., Dok&dy,vol. 16, pp. 954-956, 1972.
A. Adamajan and D. Z. Arov, “On unitary couplings of semiuni-
which youtake to be 00 if the path requiresinfinitely tary operators,” Trans. Math. Monogr. AMs,vol. 95, pp. 75-129,
many reflections. Let L(n)be the maximum of all lengths I 1970.
J. P. Aubin and A. Bensoussan, “Models of representation of
possibly obtained thisway. The conjecture (137, p. 155)l is linear time invariant systems in continuous time,” MRS Tech.
that (a) if L(52) < m, then e A t decays exponentially, (b) if Rep. 1286,Sept. 1973.
L(52) = 0 0, then I(e A r11 = 1 for all t . In other words one sus- A. V. Balakrishnan, “On the statespace theory of linear systems,”
J. Math. AM^. Appl., vol. 14,pp. 371-391, 1966.
pects the same principle is behind both the variable coefficient J. Baras and R. Brockett, “H functions and infinite dimensional
and the obstacle case. A main difficulty with :he present state realization theory,” SIAM J. Control,
of the art is that rays hitting the obstacle tangentially can pro- J. Baras, R. Brockett, and P. Fuhrmann, “State-space models for
infinite dimensional systems,” IEEE Trans. Automat..Contr., vol.
duce someincredible mathematicalpathology. Ralston [SO] AC-19, pp. 693-700, Dec. 1974.
verified (b) while (a) has been proved for a large class of ob- T. Beale, “Scatteringfrequencies of resonators,” to appear in
Commun. Pure Appl. Math.
stacles, most recently 1471, [581. I . T. Beale and S. J. Rosencrans, “Acoustic boundary conditions,”
If the operator e A t is a compact operator for t sufficiently Bull. AMS,vol. 80,no. 6, pp. 1276-1279, Nov. 1974.
large, then e A r is exponentially stable ([37, Cor. 5.11). This M. S. Brodski, “Triangular and Jordan representations of linear
operators,” Trans. Math. Monogr. A M s , vol. 32.
stronger compactness condition is frequently satisfied, for ex- C. T. Chen, Introduction to Linear Systems Theory. New York:
ample, if the obstacle is convex 1481 or if the bicharacteristic Holt, 1970.
condition holds [ 53 ] . P. Dewilde, “Roomyscattering
fornia, Berkeley, Tech. Rep. 1971.
matrix synthesis,” Univ. Cali-
Another class of results describes how close poles get to the -,“Input-output description of roomy systems.”
?rigin. Given p > 0 there existsa region N in the complex R. G. Douglas and J. W. Helton, “Innerdilations of analytic
matrix functions and Darlington synthesis,” Acta Sci. Math., vol.
plane containing the interval [ - n / p , n / p ] so that the scattering 34, pp. 61-67,1973.
matrix for any obstacle which is contained in the sphere about R. G. D o u g h , H. S. Shapiro, and A. L. Shields, “Cyclic vectors
the origin of radius p has no poles in N ; see [ 421 . and invariant subspaces for the backward shift operator,” Ann.
Inst. Fourier,vol. 20, pp. 37-76, 1970.
A third class of results concerns how the scattering matrix N. Dunford and J. Schwartz, Linear Operators, vol. 1. New
changes when the coefficients are changed. This type of work York: Interscience, 1967.
goes under the heading of ‘sensitivity analysis’ in engineering A. V. E f i i o v and V. P. Potapov, “Jexpandingfunctions and
their role in the analytic theory of electrical circuits,” R u m
literature. Math. Surveys, Uspeki,vol. 28, no. 1, 1973.
1) Suppose the obstacle is starlike or that a andA decrease P. A. Fuhrmann, “On realization of linear systems and applica-
monotonically on rays through the origin. Then as a decreases tions to some questions of stability,” to appear in Math. Syst.
Theory.
and A increases the poles of s(z) which are located on the real -, “Realization theory in Hilbert space for a class of transfer
axis slide out the real axis. Obstacle case [37]-[42] ;variable functions,” J. Math. AMI. Appl.
-, “Exactcontrollabilityand observability andrealization
coefficient case [ 501. theory in Hilbert space,” J. Math. AM^. Appl.
2) An eigenvalue of s ( j w ) equals e ’6k(w) where 6 k takes P. A. Fuhrmann, “On sums of Hankel operators,” to appear in
l3al values and is called the phase shift. If the obstaclein- Proc. AMS.
-, “On generalized Hankel operators induced by sumsand
creases in size, or if A , a, and b/a increase, then 6 k ( w ) in- products.”
creases. Also under starlikeness and
radial monotonicity -, “On Hankel operator ranges, meromorphic pseudocontinua-
tions and factorization of operator valued analytic functions,” to
assumptions as above, the derivative ( d / d w ) 6k(w) is positive. appear.
For the singly infinite transmissionline (open- or short-cir- -, “Factorization theorems for a class of bounded measurable
cuited finite end) this says that for fiied frequency input the functions,” toappear.
-, “On series and parallel coupling of a class of infinite dimen-
phase of the output is shifted by an amount which increases sional systems.”
when the capacitance or inductance of the line decreases. The I. M. Gelfand, Generalised Functions. New York: Academic
last fact is classical physics, the first two facts are in I 321 . Press, pp. 1964, 1968.
H. Helson, InvariantSubspaces. New York: Academic Press,
Quite a different topic is that of controllability and observ- 1964.
ability. It has recently come of interest to mathematicians in J. W. Helton, “The characteristic functions of operator theory
and electrical networkrealization,” IndioM J., vol. 22, no. 5,
this group. Majda [46]studied observability of the system 1972.
which arises (in the same way as the transmission line Section -, “Discrete timesystems, operator models,andscattering
11) from scattering for (5.1) with A = l/a in the extension of theory,” I . Funct. Anal., vol. 16, no. 1, pp. 15-38,May 1974.
-, “operator theory techniques for distributedsystems,” in
an obstacle 52 withboundaryconditions &/an +?(x) ut + Conf. Rec. 11th AUertonConf., pp. 131-140, 1973.
(N = 0 on the boundary 252 of the obstacle. He proves when -, “A spectral factorization approach to the distributed stable
160 PROCEEDINGS OF THE IEEE, VOL. 64, NO. 1, JANUARY 1976
regulator problem; the algebraic Riccati equation,” to appear in [45] -, “Theoryof operator nodes on Hilbert spaces,” 1971, in
SIAM J . Contr. Russian.
[31] F. J. Helton and J. W. Helton, “Scattering theory for computers 1461 A. Majda, “Disappearing solutions for the dissipative wave equa-
and other nonlinear systems,” to appear. tion, toappear.
I321 J. W. Helton and I . V. Ralston, “The f m t variation of the scat- 1471 C. Morawetz, “Decay for solutions of Dirchlet exterior problem
tering matrix,” to appear. for the wave equation.”
[ 331 E. Hille and R. S. Phillips, “Functional analysis and semigroups,” [48] C. Morawetz and D. Ludwig.
AMS Colloq., vol. 3 1. [49] B. Sz-Nagy and C. Foias, Harmonic A ~ l y s i sof Operatorson
[ 341 E. Kamen, “On an algebraic theory of systems defined by con- HilbertSpace. Amsterdam, The Netherlands: North Holland,
volution operators,” to appear in Math Syst. Theory. 1970.
[ 351 -, “Topological module structure of linear continuoustime [SO] J. V. Ralston,“Solutionsof the wave equationwith localized
systems.” energy,” Commun. Atre Appl. Math., vol. XXII, pp. 807-825,
[36] -, “Applicationsof modem algebra to infinitedimensional 1969.
systems.” [51 ] -, “Trapped rays in spherically symmetric media and poles of
[37] P.D. Lax and R. S. Phillips, Scattering Theory. New York: the scattering matrix,” Commun. Pure Appl. Math., vol. XXIV,
Academic Press, 1967. pp. 571-582,1971.
[ 381 -, “Scattering theoryfor dissipative hyperbolicsystems,” J. [S2] -, “Variationof the transmissioncoefficientandcomparison
Funct. A ~ l . , v o l .14, pp. 172-235, 1973. theorems for thepurely imaginary polar of the scattering matrix,”
(391 -, “Decaying modes for the wave equation in the exterior of Commun. Pure Appl. Math.,vol. XXV, pp. 45-61, 1972.
an obstacle,” Commun. Pure Appl. Math., vol. XXVII, pp. 737- (531 -, “Local decay of solutions of conservative f m t order hyper-
787,1969. bolic systems in odd dimensional space,” Trans. AMS, vol. 194,
[40] -, “A logrithmic bound on the location of the poles of the scat- 1974.
tering matrix,” Arch. Znst. Mech. AMI., vol. 40, no. 4 , pp. 268- [SS] -, “On the propagationof singularities in solutions ofsyrq
280,1971. metric hyperbolic partial differential equations.”
[41] -, “Scattering theory for acoustic equation in an even number [55] M. Reed and B. Simon, Methods of ModernMathematicsand
of space dimensions,”Indiana J., vol. 22, pp. 101-134, 1972. Physics: F u m t i 0 ~ Analysis.
1 New York: Academic Press, 1972.
[ 4 2 ] -, “On the scatteringfrequencies of the Laplace operator for [56] F. Riesz and B. Sz. Nagy, Functional AMJYS~S.New York:
exterior domains.” Ungar, 1955.
(431 J. L. Lions, OptimalControl ofSystemsGovernedby Partial [57] W. Rudin, FunctionalAnalysis. New York:McGrawNill, 1973.
Differential Equations. New York: Springer, 1971. [ 581 W. Strauss, “Dispersion of waves vanishing on the boundaryof an
[ 4 4 ] M. S. Livsic, “Operators oscillations, waves, open systems,” AMS exterior domain,” to appear.
Trans. Monogr., vol. 34, 1973. [ 591 Yosida, Functional Analysis. New York: Springer, 1966.
Invariant Subspace Methods in Linear

Multivariable=DistributedSystems and Lumped-
Distributed Network Synthesis
Abstmct-Linea multivariable-dhiuted systems and synthesis These factorizations play an essential role in the development and show
problems for lumpeddistributed networks are analyzed. The methods that properties of Hardy functions are of fundamental importance for
used center around the invariant subspace theory of Helson-Lax and this class of distributed systems as properties of rational functions s e
the theory of vectorial Hardy functions. State-space and transfer func- for lumped systems.
tion models are studied and their relations analyzed. We single out a
class of systems andnetworkswith nonrational transfer functions INTRODUCTION
(scattering matrices), for which several of the well-known results for
lumped systems and networks aregeneralized. In particular we develop
the relations between singuhrities of transfer functions and ‘‘natural
modes” of the systems, a degree theory for i n f ~ t e d i m e n s i o n a llinear
systems and a synthesis via lossless embedding of the scattering matrix.
FinaUy coprime factorizations for this class of systems are developed
IF REQUENCY DOMAIN methods in hmped-multivariable
systems have been developed in the last few years for the
analysis and design of control systems [ 1 1. These
methods provided a clear understanding of the interrelations
between statespace and transfer function models for systems
and networks and proved to be extremely useful in practical
Manuscript received April 1, 1975; revised August 1, 1975. The work design applications [ 821.
of J . S. Baras was supported in part by a General Research Board Award Recently, several researchers, Baras [2] -[8], Brockett [31-
from the University of Maryland and the work ofP. Dewilde was sup-
ported in part by the Belgian National Fund for Scientific Research [4], Dewilde [9]-[16],Fuhrmannf171-[23], Helton [24]
(NFWO). and [25], have been investigating a similar approach t o the
I . S. Baras is with the Electrical Engineering Department, University analysis and synthesis of distributed systemsand networks.
of Maryland, College Park, MD 20742.
P. Dewilde is withtheDepartement Elektrotechniek, Katholieke This theory applies to situations where energy considerations
Universiteit Leuven, Heverlee, Belgium. provide the setting of a Hilbert space for the state space of the
1436 SEPTEMBER IEEE,
PROCEEDINGS O F THE 1976
Comments on “Systems with Infinite-Dimensionai REFERENCES

State Space: The Hilbert
Space
Approach” [ 11 R. P. Feynman, R. B. Leighton, and M. Sands, The Feynman
Lectures on Physics, vol. 111. Reading, PA: Addison-Wesley, 1965.
JAMES A. DYER [ 2 ] J. B. Keller and D. W. McLaughlin, “The Feynman integral,”
Amer. Math. Monthly, vol. 82, pp. 451-465, May 1975.
131 H. J. White and S. Tauber, Systems Analy&, Philadelphia: W. B.
Professor Helton has written a very interesting expository paper’ on Saunders, 1969.
the applications of Hilbert space techniques to systems having infiite-
dimensional state spaces. I find it, however, to be rather surprising
that auantum mechanics. which is the best example in all mathematical Reply2 by J. W.Helfon
physik of the applications of Hilbext space t&hniques to the study
of infiitedimensional systems, is not mentioned. It ispleasing to hear that my article seemed natural enough to be
Not only does quantum mechanics provide excellent examples of termed expository, since sections II, III, and IV were new. Professor
infiitedimensional systems, the language currently used by many Dyer gives a nice discussion of how basic quantum mechanics itself can
physicists to describe a quantum system is the same as that used by a be described as an infinitedimensional system. Quantum mechanics
systems engineer to descriie an engineeringsystem from thestate was indeed not mentioned in the paper and so here are some comments
viewpoint. To illustrate this, consider a singleparticle quantum m e on the scattering theory aspects of it.
chanical system. Such a system is completely descri-bed by a probabil- The paper introduces a particular definition of infinitedimensional
ity amplitude function 6 (x, y , z,f ) . For fixed r,
(f ) = (x, y , z, I ) is
A , B, C, D-type linear systems and in Section 111 proves thatit is
equivalent to Lax-PWips scattering. Now their theory contains
required to be an element of the unit sphere of the Hilbert space
r,
L2(R3). For fmed this function is usuallycalled the state of the quantum mechanical scattering, although the examples they empha-
size are classical. This correspondence is explained in Chapter VI,
system even by physicists. The propagation in time of the state is given
by Schriidinger’s equation Section 4, of their book. Roughly, to a Schrodinger equation scatter-
ing problem with s-matrix S there is associated a scattering problem
for a wave equation with scattering matrix r. m e n S(Z) = r (&I.
Functions with branch cuts are not pseudomeromorphic in the sense
of Section IV-D of the paper and so S(z) is not pseudomeromorphic.
@ (0) = eo Also note that most results d e m i in the exposition, Section V,hold
not only for classical but for quantum mechanical scattering. We
where i = f i , h is (2n)‘l times Planck’s constant, and X known as should emphasize that when wesay quantum mechanical scattering
the Hamiltonian operator, is a self-ad‘oint unbounded linear operator we automatically mean very special input and output operators B, C
1
defined on a dense subspace of Lz(R ). It will be observed that this and so these comments do not address many systems based on
equation has the form of the usual state propagation equation of linear Schrodinger’s equation.
systems theory. It will be observed from this equation also that
quantum mechanics concerns itself only with zero-input solutions. The
’Manuscript received A r i l 7 , 1976.
quantities which a systems engineer would call outputs of this system
would be called observables by a quantum physicist. An observable y
’J. W. Helton is with tge Department of Mathematics, University of
California at San Diego, LaJolla, CA.
of a singleparticle system is related to the state of the system by the
equation
Y = (e,I:e) Comments on ‘‘Geometric progression Ladder RC Networks”
where (, ) denotes the inner product on L 2 ( R 3 ) and L is a linear self- L. GRUNER
adjoint unbounded operator defmed on a dense subspace of L2(R3). A k t m t - I t is ahown that a large class of nonrecunent RC Wer
For example, if K is the Hamiltonian operator then the observable networks including, as a special case, the geometric ladder networks
E = (e, K I$) is the energy of the system. Again, one finds here a discussed by Bha-uyya and S w m y a n be represented as an
standard type of output equation for a systemhaving an infinite- equivalent RCG ladder network.
dimensional state space and a onedimensional output. An excenent
elementary introduction to quantum mechanics from the state view- Bhattacharyya and Swamy’s [ l ] conjecture that other nonrecurrent
pointisgivenin [ l ] . ladder networks may be represented in terms of recurrent ladder net-
In the Feynman integral approach to quantum mechanics one also works is readily answered with reference to the author’s earlier publica-
makes use of the concept of state transition operator. Here one writes tions [ 21 -[ 31.
the state@ ( f )as The application of Kirchhoff‘s law to the nth node of the ladder net-
work of Fig. 1 shows that the voltage V(n) satisfies the difference
OD-- equation
K(t, a,?, 2) ~$0(a,?,2) d a d y ” d 2 ,
@ ( t ) = L L L
where J( is an Lz valued function known as the propagator. In fact
x is the i m p u k response of the quantum system and is the solution to
~chriidinger’sequation for the initial state eo e,?,
9 ) = 6 (X - 2) . (1)
-
6 (y -9) 6 (z 2). SinceSchrddinger’s equation does not have a where for a nonrecurrent RC network the series impedance Z(n) = R ( n )
x
Green’s function in the usual mathematical sense, is determined as and the shunt admittance Y(n)= sC(n).
In the special case of a recurrent ladder network, Z ( n ) and Y(n) are
the value of an abstract integral known as a Feynman integral. An
elementary introduction to this approach to quantum theory is given invariant with respect to n ; in particular, if Z ( n ) = R , and Y(n) =
in [2]. Gu + sC,, then replacing V(n) by V,(n), the solution of the difference
Thereason forthe parallelism between the usual formulations of equation (1) is given by
quantum mechanics and the present formulations of systems theory
is that both subjects have a common parent in classical Hamiltonian
mechanics. These origins are discussed in detail in [3], part 11. Since
both systems theory and quantum mechanicshave followed inde-
pendent but approximately parallel paths since their origins it is pos-
ablethat cross-fertilization between the two subjects could prove
beneficial to both.
Manuscript received January 30, 1976.

The author is with the Department of Mathematics, Iowa State Uni- Manuscript received November1 1, 197 5.
versity Ames, IA 50010. The author is with the Department of Electrical Engineering,Monash
‘J.W . Helton, Roc. B E E , vol. 64, pp. 145-160, Jan. 1976. University, Clayton, Victoria, Australia.

H76b

Uploaded by

H76b

Uploaded by

Systems with Infinite-Dimensional State Space:

The Hilbert Space Approach

)*d = A x ( t ) + Bu( t ) where A is the unbounded operator onHE defined by

mally the state x ( t ) arising from input functionu ( t ) is

If x. E !??(A*)'and u ( t ) is a continuous U-valued function,

[Xu] (s) = im E(t + s) u(-'s) ds.

which up to u(r) + u(-r) equals the Hankel Operator for the

boonto aphysicalunderstanding of thisformalism-rather

TRANSNISSION HOT ROD

parameter semigroup of operators on H which describes how TENPERATURE

Val [ 0, t ] and 0 off of that interval. Define R , : L 2 ( 0 ,00, H ) following identities

L 2 ( 0 , 00, H) by [ R r f ](r) = x [ ~rj(r)f(t

Phillips model is obviously LZ(-=,0, U ) @ X @ L 2 ( 0 ,=, Y) d'rp(w) + P ( t ) = P(t + w ) (3.2b)

The first term is C&(w- r + 'I,for t Q r Q w + c and 0 for all

Properties of the systemcorrespond topropertiesof T(t)

(a) + (b) + eA 'uniformly bounded T(r) uniformly bounded

over Ut > range Rt is dense in L 2 (0, =, V), so max, I I P ( t ) I I =

This plus the middle integral is

which added to the first integral extendstheformulato

P(t)u = I" 6 + &(f-r)B u(-r) dr

The main influence of Q is in the middle row of S ( t ) * and we

The classical scattering operator is W,*W- and from our com-

[I: C eACt-')B g(g) d t

That is, the scattering operator is g. This completes the com-

space of M . We definethe 63 pseudo-inverse of M , written From (4.1), we see that

( A - p)-" B(eP(g-') u(t))(")d t .

tion, but it can be naturally viewed as a linear functional on c

[S-' &I (g) = 1, m

Toapplytheir[38,Th. 5.51, we require in additionto required bythetheorembutwith W replacing F . Thusit

lability operator and therefore M is a bounded operator (by

Invariant Subspace Methods in Linear

Comments on “Systems with Infinite-Dimensionai REFERENCES

Manuscript received January 30, 1976.

You might also like