Subspace Identification of Structured State-Space
Models with Unknown Inputs
(¹��Ñ\�(�zG��m�.E£)
Chengpu Yu (|¤Ë)
Key Laboratory of Intelligent Control and Decision of Complex Systems
Beijing Institute of Technology, China
June 24, 2019
1 / 29
Outline
1 Background
2 Problem formulation
3 Identifiability analysis
4 Subspace identification method
5 Numerical simulations
6 Conclusions
2 / 29
Outline
1 Background
2 Problem formulation
3 Identifiability analysis
4 Subspace identification method
5 Numerical simulations
6 Conclusions
3 / 29
Background
Structured state-space model: a spatio-temporal model with structured interconnectionin the space domain and dynamic response in the time domain.
Applications of the structured state-space model identification:
1 Provide network models for general implementation of networked systems.
2 Estimate parameters involved in physical models.
3 Approximate continuous physical objects by finite-element networks.
Figure: Multi-agent network Figure: Mass-spring model
Figure: Finite-element network
4 / 29
Background
Traditional SysID methods require (noisy) input-output measurements; however, withunknown inputs, the identification problem (identifiability analysis + identificationmethod development) becomes challenging.
Examples of structured systems with unknown inputs are: local network system withunknown interconnections, compartmental temperature field with people movement, etc.
Large-scale network
2
3
4
5
1
Figure: Local network
Figure: HVAC system
5 / 29
Outline
1 Background
2 Problem formulation
3 Identifiability analysis
4 Subspace identification method
5 Numerical simulations
6 Conclusions
6 / 29
Problem formulationThe concerned state-space system model:
x(k + 1) = A(θ)x(k) + B(θ)u(k) + Hf (k)
y(k) = C(θ)x(k) + w(k)
u(k)-measurable input, f (k)-unknown input, y(k)-measurable output. The systemmatrices A(θ), B(θ) and C(θ) are affinely parameterized w.r.t. θ ∈ R
l :
A(θ) = A0+A1θ1+· · ·+Alθl , B(θ) = B0+B1θ1+· · ·+Blθl , C(θ) = C0+C1θ1+· · ·+Clθl .
Example 1
A(θ) =
[θ1 0 θ2
θ2 θ1 00 θ2 θ1
]
, B(θ) =
[θ3
00
]
H =
[010
]
, C(θ) =
[θ4
00
]T
.
1
2 3
7 / 29
The unknown input f (k) is assumed to be deterministic. No prior knowledgeof f (k) is available except the persistent excitation condition.
In the literature, only the blind SIMO system identification framework canhandle this identification problem.
y1(k) = H1(z)u(k)
y2(k) = H2(z)u(k)
.
.
.
yL(k) = HL(z)u(k)
Since the SIMO model is a decoupled network model, it cannot cope with theidentification of general structured state-space models.
8 / 29
Outline
1 Background
2 Problem formulation
3 Identifiability analysis
4 Subspace identification method
5 Numerical simulations
6 Conclusions
9 / 29
Identifiability analysis 1 - Full state observation
Given the full state observation, an equivalent state-space model can be written as
x(k + 1) = (A + HQ1)︸ ︷︷ ︸
A
x(k) + (B + HQ2)︸ ︷︷ ︸
B
u(k) + H [fk − Q1x(k) − Q2u(k)]︸ ︷︷ ︸
f (k)
y(k) = Cx(k)
where Q1 ∈ Rr×n and Q2 ∈ R
r×m are ambiguity matrices, and y(k) = y(k) − w(k)denotes the noise-free output.
Without any structural constraints, the system matrices cannot be identified even if thestates can be fully observed. For example, when (A, H) is controllable, the system polescan be arbitrarily assigned.
10 / 29
Identifiability analysis 1 - Full state observation
Theorem 1. Let P⊥
H = I − H(HT H)−1HT . Suppose that the state sequence x(k) can befully observed, the original identification problem boils down to estimating the parametervector θ from the following modified state-space model
P⊥
H x(k + 1) = P⊥
H A(θ)x(k) + P⊥
H B(θ)u(k)
y(k) = C(θ)x(k).
Example 1 (continue)
[x1(k + 1)x3(k + 1)
]
=
[θ1 0 θ2
0 θ2 θ1
] [x1(k)x2(k)x3(k)
]
+
[θ3
0
]
u(k)
y(k) = θ4x1(k).
The state-equation of the second agent is removed due to the unknown input signal;
however, θ can be estimated because of its duplicates in other state equations. When
H = [1 1 1]T , the system parameters are still identifiable.
11 / 29
Identifiability analysis 2 - No direct state observation
The past and future data equations are given as
Yp = OXp + TuUp + Tf Fp
Yf = OXf + TuUf + Tf Ff
where
O =
CCA...
CAs−1
, Tu =
0
CB. . .
.... . . 0
CAs−2B · · · CB
, Tf =
0
CH. . .
.... . . 0
CAs−2H · · · CH
,
Xp = [x(1) x(2) · · · x(h)] , Xf = [x(s) x(s + 1) · · · x(s + h − 1)] ,
Yp =
y(1) y(2) · · · y(h)y(2) y(3) · · · y(h + 1)
...... . . . ...
y(s) y(s + 1) · · · y(s + h − 1)
, Yf =
y(s) · · · y(s + h − 1)y(s + 1) · · · y(s + h)
... . . . ...
y(2s − 1) · · · y(2s + h − 2)
.
12 / 29
Lemma 1. Assume that CH has full column rank and the state-space model described bythe matrix of tuples (A, H, C , 0) is strongly observable , i.e., rank[O Tf ] = n + rank[Tf ].Then, the row subspace of the state space sequence can be estimated as
Row[Xf ] = Row
[Up
Yp
]
∩ Row
[Uf
Yf
]
.
The above lemma indicates that the system state can be estimated up to a similaritytransformation, i.e.,
x(k) = Qx(k)
where x(k) is the state estimate and Q is the similarity transformation matrix.
13 / 29
Given the state estimate x(k), the state-space model can be written as
Qx(k + 1) = AQx(k) + Bu(k) + Hf (k)
y(k) = CQx(k).
By Theorem 1, the identifiability of θ in the original state-space model is equivalent tothe identifiability of (θ, Q) of the following state-space model
P⊥
H Qx(k + 1) = P⊥
H A(θ)Qx(k) + P⊥
H B(θ)u(k)
y(k) = C(θ)Qx(k).
Theorem 2. Let the low-rank factorization of P⊥
H be given as P⊥
H = UHV TH where
UH ∈ Rn×(n−r) and VH ∈ R
n×(n−r) have full column rank. Then, the above state-spacemodel is identifiable if and only if, for any nonsingular matrices Q∗ ∈ R
n×n andΠ ∈ R
(n−r)×(n−r), the following equations
ΠVTH Q
∗ = VTH Q, ΠV
TH A(θ∗)Q∗ = V
TH A(θ)Q
ΠVTH B(θ∗) = V
TH B(θ), C(θ∗)Q∗ = C(θ)Q
yield that θ = θ∗, Q = Q∗ and Π = I.
14 / 29
Example 2 The identifiability of the state-space model with the following structuredsystem matrices is considered:
A(θ) =
[θ1 0 θ2
θ2 θ1 00 θ2 θ1
]
, B(θ) =
[100
]
, H =
[010
]
, C(θ) =
[θ3 1 00 0 θ3
]
.
Choosing the matrix VH as V TH =
[1 0 00 0 1
]
, we have the following equation group
[π11 π12
π21 π22
][q∗
1q∗
3
]
=
[q1
q3
][π11 π12
π21 π22
]
,
[θ∗
1 0 θ∗
20 θ∗
2 θ∗
1
] [q∗
1q∗
2q∗
3
]
=
[θ1 0 θ2
0 θ2 θ1
][q1
q2
q3
]
,
[π11 π12
π21 π22
][10
]
=
[10
]
,
[θ∗
3 1 00 0 θ∗
3
][q∗
1q∗
2q∗
3
]
=
[θ3 1 00 0 θ3
] [q1
q2
q3
]
.
For any non-singular matricesΠ and Q∗, it can be verifiedthat Q = Q∗ and θ = θ∗.
15 / 29
Outline
1 Background
2 Problem formulation
3 Identifiability analysis
4 Subspace identification method
5 Numerical simulations
6 Conclusions
16 / 29
Subspace identification method
Following identifiability analysis, the subspace identification method will be developed bytwo steps:
1 State estimation using the subspace intersection;
2 Parameter estimation by difference-of-convex optimization.
17 / 29
State estimationBy Lemma 1, the system state can be estimated as follows
Row[Xf ] = Row
[Up
Yp
]
∩ Row
[Uf
Yf
]
.
The subspace intersection can be computed by two steps:
1 Take the SVD as follows
Up
Yp − Wp
Uf
Yf − Wf
=
[U1 U2
][
Σ1
Σ2
] [V T
1
V T2
]
2 Partition U2 into two sub-matrices of the same size U2 =[UT
21 UT22
]T, the
intersection can be computed as
Xf = UT21
[Up
Yp − Wp
]
.
18 / 29
Lemma 2 If the matrix [O Tf ] is a tall matrix, then the variance of the measurementnoise can be estimated as
σ2w = λmin
[
limh→∞
1
hYpY
Tp − YpU
Tp (UpU
Tp )−1
UpYTp
]
(1)
where λmin(·) represents the least eigenvalue.
Then, an unbiased estimate of the subspace intersection can be computed as follows.
limh→∞
1
h
[Up
Yp
Uf
Yf
][Up
Yp
Uf
Yf
]T
−
0 0 0 0
0 σ2w I 0
[0 0
σ2w I 0
]
0 0 0 0
0
[0 σ2
w I0 0
]
0 σ2w I
=[
U1 U2
] [Σ1
Σ2
][UT
1
UT2
]
.
Partition U2 into U2 =[UT
21 UT22
]T, we have that
Xf = UT21
[Up
Yp
]
= QXf + UT21
[0
Wp
]
︸ ︷︷ ︸
estimation error
.
19 / 29
Parameter estimation
Define the following row sequences
xh = x(s+2 : s+h+1), xh = x(s+1 : s+h), uh = u(s+1 : s+h), yh = y(s+1 : s+h).
Substituting the estimated state into the original state-space model yields
P⊥
H Qxh = P⊥
H A(θ)Qxh + P⊥
H B(θ)uh + η1
yh = C(θ)Qxh + wh + η2,
where η1 and η2 are the state estimation errors that are asymptotically uncorrelated withthe inputs and states.
Step 1. The parameters can be estimated by solving the following optimization problem
minQ,θ
∥∥P⊥
H Qxh − P⊥
H A(θ)Qxh − P⊥
H B(θ)uh
∥∥
2
F+ ‖yh − C(θ)Qxh‖2
F .
20 / 29
Step 2. The bilinear estimation problem is equivalent to a rank-constrained optimizationproblem
minQi ,A,C,θ,Γ
∥∥P⊥
H Qxh − P⊥
H Axh − P⊥
H B(θ)uh
∥∥
2
F+ ‖yh − Cxh‖2
F
s.t. A = A0Q +
l∑
i=1
Ai Qi , C = C0Q +
l∑
i=1
Ci Qi
Γ =
[1 θ1 · · · θl
vec(Q) vec(Q1) · · · vec(Ql )
]
rank[Γ] = 1 (‖Γ‖∗ − ‖Γ‖2 = 0)
Step 3. By treating the nonnegative constraint ‖Γ‖∗ − ‖Γ‖2 as a penalty, we can obtainthat
minQi ,A,C,θ,Γ
∥∥P
⊥
H Qxh − P⊥
H Axh − P⊥
H B(θ)uh
∥∥
2
F+ ‖yh − Cxh‖
2F + λ (‖Γ‖∗ − ‖Γ‖2)
s.t. A = A0Q +
l∑
i=1
Ai Qi , C = C0Q +
l∑
i=1
Ci Qi
Γ =
[1 θ1 · · · θl
vec(Q) vec(Q1) · · · vec(Ql )
]
This optimization problem is solved by the sequential convex programming approach.21 / 29
Outline
1 Background
2 Problem formulation
3 Identifiability analysis
4 Subspace identification method
5 Numerical simulations
6 Conclusions
22 / 29
Blind identification of an SIMO modelConsider the following SIMO model y(k) =
∑L
i=1hiu(k − i) + w(k) or
x(k + 1) = Ax(k) + Hf (k), y(k) = Cx(k) + w(k)
where
A =
01 0
. . .. . .
1 0
, H =
10...
0
, C =[
h1 h2 · · · hL
].
The data equation has the following extended observability matrix and convolution matrix
O =
h1 · · · hL
0. . .
......
. . . h1
0 · · · 0... · · ·
...
0 · · · 0
, Tf =
0hL 0...
. . .. . .
h1 · · · hL 0...
. . .. . .
. . . 00 · · · h1 · · · hL
.
It can be seen that [O Tf ] is exactly a convolution matrix of the FIR model.
23 / 29
10 20 30 40 50 60 70 80 90 100 110
SNR (dB)
10-7
10-6
10-5
10-4
10-3
10-2
10-1
100
NE
Es
Subspace fitting method
The proposed method
Figure: Comparison between the subspacefitting method and the proposed method.
200 400 600 800 1000
Data length
0
2
4
6
8
10
12
14
16
18
NE
Es
10-3
Figure: Performance of the proposedmethod against the data length.
24 / 29
Structured state-space model
Consider Example 2 with the system matrices
A(θ) =
[θ1 0 θ2
θ2 θ1 00 θ2 θ1
]
, B(θ) =
[100
]
, H =
[010
]
, C(θ) =
[θ3 1 00 0 θ3
]
and the parameter vector θ =[
0.5 0.3 1]T
.
It has been verified that this structured state-space model is identifiable in the presence
of the unknown input signal.
25 / 29
10 20 30 40 50 60 70 80 90 100 110
SNR (dB)
10-8
10-6
10-4
10-2
100
102
104
106
NE
Es
Nuclear norm method
Gauss-Newton method
The proposed method
Figure: Performance comparison among theGauss-Newton method, nuclear-normmethod and the proposed method.
200 400 600 800 1000
Data length
0
2
4
6
8
10
12
14
16
18
NE
Es
10-3
Figure: The recovered unknown input signalat SNR=30dB.
26 / 29
1 Background
2 Problem formulation
3 Identifiability analysis
4 Subspace identification method
5 Numerical simulations
6 Conclusions
27 / 29
Conclusions
Conclusions
1 Identifiability conditions for the structured state-space models
2 Subspace-based state estimation with noise compensation
3 Difference-of-convex programming approach for the bilinear estimation
Future work
Simultaneous system structure reconstruction and the system parameter estimation using(partially) measurable input-output data.
28 / 29
THANK YOU!
29 / 29
Top Related