Académique Documents
Professionnel Documents
Culture Documents
e = [e1 , e2 , e 3 , e 4 ,..., e N ]
in reality, the set is finite and the data are inaccurate example 1: observations of the strength of gravity as a function of position on Earths surface example 2: observations of concentration of a chemical in a 2-reservoir system
model: a mathematical model of reality, represented by a set of model parameters m = [ m1 , m2 , m 3 , m 4 ,..., m M ] M can be infinite example 1: 3D density structure within Earth example 2: diffusion constant
forward modeling: given a model, compute data simulate an experiment for a known model incorporates the laws of physics, plus the experiment design e = F{m} F is any mathematical functional (it is often difficult to separate the data from the general equation example 1: Newtons Law of gravity, location of gravity observations example 2: diffusion equation, time of composition observations
G{m, e} = 0
GEOS 5004: Analytical Geosciences Hole: 1.1 Inversion: Inversion and Vector Spaces
forward modeling:
given data, directly compute a model that is consistent with it m = F 1 {e} F-1 is the mathematical inverse of F in general, the inverse function does not exist in a unique form => inverse theory
Hole: 1.2
the discrete, linear inverse problem the inverse problem is discrete if the number of unknown model parameters the inverse problem is linear if
mj
is finite:
F{m} = F{m}
and
example: fitting a straight line assumed physics: y = + x model parameters: intercept and slope
m = [ m1 , m2 ] = [, ]
data: values of
yi
xi
ei = 1m1 + x i m2 + x i 2 m 3 + x i 3 m 4
example: gravity anomaly measure the strength of Earths gravity field at different positions on a flat surface subtract the background average gravity to obtain the gravity anomaly g this gravity anomaly is due to anomalous mass, or , the density minus the average density Newtons Law of Gravitation says: (where x,y,z are measured from the ith gravity station)
gi = G
M
earth
(x
+y +z
( x , y, z)dzdydz
then
Hole: 1.4
example: straight-ray x-ray tomography (CT scan) or seismic tomography tomography = slice picture (Greek)
ei =
( raypath ) i
m( x, y, z)dl
for medical tomography: m(x,y,z) is the x-ray attenuation coefficient at each point in space ei are the natural logarithm of the observed x-ray amplitudes ln(A) for seismic tomography: m(x,y) is one divided by the seismic wave speed ei are the observed travel times
1/v
if the model is discretized into a set of grid cells, with constant physical properties in each cell mj is the attenuation or slowness in each grid cell and the integral becomes a summation
M
ei = Lij m j
j =1
where
Lij
note that both data and model in the tomography example are derived properties the equations have been rearranged to make them linear
GEOS 5004: Analytical Geosciences Hole: 1.5 Inversion: Inversion and Vector Spaces
scalar:
multiplication: C = AB
T transpose: Cij = C ji
u = Av
ui = Aik v k
k =1 K
identity matrix:
=1 i = j Iij = ij =0 i j
inverse of a matrix:
2x2 matrix:
AA 1 = I a b A = c d
GEOS 5004: Analytical Geosciences
Hole: 1.6
1 2 A = 3 4 A=
write
AT =
compute
1 compute A =
compute
AA 1 = I
?
does it equal
Hole: 1.7
ei = gij m j
j =1
i = 1,2,..., N
this is a system of
equations in
in matrix notation
e = Gm
e1 e e = 2 M e N
m1 g1 m g2 m = 2 G = M M m g M N
g i = [ gi1
gi 2 L giM ]
to find the unknown model parameters: however, the solution only exists if
vector spaces our (Euclidean) universe is a three-dimensional space ,y , and z the Cartesian coordinate system (x,y,z) is defined by three unit vectors x every vector in this 3D space can be written in terms of its components in each direction
+ vy y + vz z = (v x , v y , v z ) v = vx x
the norm that defines length in our universe is Pythagoruss Theorem
angle
2 2 2 2 v = v x + v y + vz = vi i= x ,y ,z
1 2
between two vectors in our universe is defined by the vector dot product
u v = u v cos = u x v x + u y v y + u z v z =
u v
i= x , y ,z
i i
two vectors are perpendicular (orthogonal) if their dot product equals zero
the unit vectors are orthonormal: length of 1 and perpendicular to each other
= y = z =1 x
= v cos x vx = v x
y =x z =y z =0 x
= v cos y vy = v y = v cos z vz = v z
these vectors define a basis for the vector space (a coordinate system)
the coordinates of the vector are the projections (dot products) of the vector on the unit vectors
the model vector m lies in an M-dimensional vector space the data vector e lies in an N-dimensional vector space
GEOS 5004: Analytical Geosciences Hole: 1.9 Inversion: Inversion and Vector Spaces
l1 norm:
v 1 = vi
i =1
l norm:
2 v 2 = vi i =1
N N
1 2
l norm: l norm:
p v p = vi i =1
v
2
= lim v p = max vi
p i=1
weighted l norm:
v 2w
e.g.,
2 = wi vi i =1
N
wi
v = [ 1, 3, 4, 4] u = [ 5,0,1, 4] v 1 = 12 v 2 = 42 u 1 = 10 u 2 = 42
v1> u1 v2 = u2
Hole: 1.10
v =4 u =5 v< u
Inversion: Inversion and Vector Spaces
dot product need to define angles between vectors need to define when vectors are perpendicular the dot product is defined as
N
u v = u v cos = ui vi
note that this definition of angle is associated with the l 2 norm (should not be used with others) two vectors are perpendicular (orthogonal) if their dot product equals zero compute the dot product of v = [ 1, 3, 4, 4] and what is the angle between them?
i=1
u = [ 5,0,1, 4]
gi
vectors
the data equations are the dot product of the model with the
M
ei = gij m j
j =1
ei = g i m
the gi vectors (the rows of G ) are called the data kernels each vector gi points in a direction in the models M-dimensional vector space what we know the data are the projections of the model upon the data kernels however, the data kernels do not form a very good coordinate system for the models vector space they are not orthogonal to each other; in fact, they are usually close to parallel
GEOS 5004: Analytical Geosciences Hole: 1.11 Inversion: Inversion and Vector Spaces
where the vertical bars indicate the determinant of the matrix inside e.g., for a 2x2 matrix:
b = (a )(d ) bc = 2 + (a d ) + (ad bc) = 0 d solve the quadratic equation in for the two eigenvalues B B 2 4 AC 2 for the quadratic equation Ax + Bx + C = 0 solutions are x = 2A an NxN matrix will involve the solution of an Nth order polynomial to find N eigenvalues
a c
i (i = 1,2,N) are defined r by i = 0 i = 1 r (A i I)r finding each eigenvector involves solving a system of N equations in N unknowns (the
components of the vector), subject to the additional unit-length constraint (note that some algorithms will not force the eigenvectors to have unit length)
1 2 A = 3 4
Hole: 1.13
spectral decomposition if there exist N different, real, non-zero eigenvalues for the matrix A , the eigenvectors form a new basis (coordinate system) for the N-dimensional vector space create a diagonal matrix whose values are the eigenvalues create a matrix
1 0 0 =0 O 0 0 0 N
then the original matrix can be diagonalized as
1 L r N R = r
A = RR1
assuming
N = M
m =R m e = R1e m = Rm e = Re m ,e are the coordinates of m , e in the coordinate system defined by the eigenvectors this gives a system of equations that is trivial to invert i m =e m i = e i the only problem with the method above is that the R matrix still needs to be inverted
GEOS 5004: Analytical Geosciences Hole: 1.14 Inversion: Inversion and Vector Spaces
spectral decomposition in the special case of a symmetric matrix if the eigenvalues are different (and non-zero), the eigenvectors will be orthonormal
i r j = ij r the eigenvectors then form an orthonormal basis for the N-dimensional vector space because the eigenvalues are orthonormal, the inverse of R is the transpose of R RRT = RTR = I then the original matrix can be diagonalized as A = RRT
assuming
N=M
and
is symmetric (a very special case, but one we will come back to)
RRT m = e RT m = RT e
m = RT m e = RT e m = Rm e = Re m ,e are the coordinates of m , e in the coordinate system defined by the eigenvectors that is trivial to invert gives a system of equations i m =e m i = e i
Hole: 1.15
linear dependence the data equations are the dot product of the model with the
ei = g i m each data kernel gi points in a direction within the models M-dimensional vector space each vector defines a direction in which you know the length of the model (from ei ) the N data kernels together define the N directions in which you know the length of the model
if one (or more) of the data kernels
N
gi
gi
can be expressed as a linear combination of the other rows, is satisfied for some
c g
i i=1
=0
ci 0
then the system of equations (or the matrix if not, the the system is linearly independent
G ) is linearly dependent
if a particular data kernel is linearly depended upon the previous ones, then it measures the length of the model in a direction that is already known => adds no new information the total number of directions in which you know the model is less than N g 2 = [0,1,0] g 3 = [1,2,0] e.g., in 3D space: g1 = [1,0,0] direction these three vectors are linearly dependent; none of the three vectors samples the z
GEOS 5004: Analytical Geosciences Hole: 1.16 Inversion: Inversion and Vector Spaces
over-determined and under-determined problems if the number of data exceed the number of unknowns: N > M the problem is over-determined some of the data kernels must be linear combinations of each other because they only span the M-dimensional model space; G is linearly dependent since the data include measurement errors, the system of equations is usually inconsistent => no solution exactly matches all of the data often the model parameterization is not an exact representation of reality e.g., the physics is only approximated by the equation you are trying to fit e.g., the earth is only approximated by a set of grid cells => no solution can exactly match all of the data solution: find the best fitting model but need to define best if the number of data is less than the number of unknowns: N < M the problem is under-determined some directions in the M-dimensional model space are not sampled by any of the data kernels the length of the model in directions perpendicular to the set of data kernels is unknown models that fit the data can have any length in these directions => the model is non-uniquely determined => an infinite number of models match the data solution: find the best model that fits the data but need to define best
GEOS 5004: Analytical Geosciences Hole: 1.17 Inversion: Inversion and Vector Spaces
mixed-determined problems in practice, we often have the mixed case: some of the data kernels are linear combinations of each other (linearly dependent) => cannot match all of the data but the total number of independent data is less than M => the model is non-uniquely determined this problem is mixed determined solution: find the best model that best fits the data but need to define both bests!
inverse theory is about making choices on best, and choosing tools to solve the problem
Hole: 1.18