Advanced Array Signal Processing Techniques for Beamforming and...

120
Fakultät für Elektrotechnik und Informationstechnik In Zusammenarbeit mit Advanced Array Signal Processing Techniques for Beamforming and Direction Finding Masterarbeit zur Erlangung des akademischen Grades Master of Science vorgelegt von: Jens Steinwandt Matrikelnummer: 39733 geboren: 14. März 1986 in Leinefelde Studiengang: Elektrotechnik und Informationstechnik Studienrichtung: Informationstechnik und Kommunikationstechnik Anfertigung im Fachgebiet: Nachrichtentechnik Fakultät für Elektrotechnik und Informationstechnik verantwortlicher Professor: Univ.-Prof. Dr.-Ing. Martin Haardt Erster wissenschaftlicher Betreuer: Dr. Rodrigo C. de Lamare Zweiter wissenschaftlicher Betreuer: Univ.-Prof. Dr.-Ing. Martin Haardt Datum: 9. Mai 2011

Transcript of Advanced Array Signal Processing Techniques for Beamforming and...

  • Fakultät für Elektrotechnik und Informationstechnik

    In Zusammenarbeit mit

    Advanced Array Signal Processing Techniques forBeamforming and Direction Finding

    Masterarbeitzur Erlangung des akademischen Grades Master of Science

    vorgelegt von: Jens Steinwandt

    Matrikelnummer: 39733

    geboren: 14. März 1986 in Leinefelde

    Studiengang: Elektrotechnik und Informationstechnik

    Studienrichtung: Informationstechnik und Kommunikationstechnik

    Anfertigung im Fachgebiet: Nachrichtentechnik

    Fakultät für Elektrotechnik und Informationstechnik

    verantwortlicher Professor: Univ.-Prof. Dr.-Ing. Martin Haardt

    Erster wissenschaftlicher Betreuer: Dr. Rodrigo C. de Lamare

    Zweiter wissenschaftlicher Betreuer: Univ.-Prof. Dr.-Ing. Martin Haardt

    Datum: 9. Mai 2011

  • Acknowledgements i

    ACKNOWLEDGEMENTS

    Firstly, I would like to express my sincere gratitude to Prof. Martin Haardt for giving

    me the chance to work on such an interesting topic under his supervision. His constant

    support of fresh ideas, comments and questions have encouraged me to go beyond what

    I believed would be my limits.

    Secondly, I am very grateful to Dr. Rodrigo C. de Lamare for his help, valuable

    supervision and useful advice for my research, without which most of this work would

    not have been possible.

    Further thanks go to all members of the Communications Research Group at the

    University of York and the members of the Communications Research Laboratory at

    Ilmenau University of Technology, for their support throughout my master’s research.

    I would like to thank my family, especially my mother, for constantly supporting

    me in many ways over the last year in York. I would not have made it to this point if

    you had not believed in me and encouraged me to discover my full potential.

    Finally, I want to thank Lola for being there for me and for being the source of my

    strength.

    Masterarbeit Jens Steinwandt

  • Abstract ii

    ABSTRACT

    Beamforming and direction of arrival estimation are areas of array signal processing

    devoted to extracting information of interest from signals received over the spatial aper-

    ture of an antenna array. The need for these techniques is encountered in a broad range

    of important engineering applications, including radar, sonar, wireless communications,

    radio astronomy and biomedicine, and is still a vital area of research.

    In the first part, we focus on the development of widely-linear adaptive beamforming

    algorithms designed according to the constrained minimum variance (CMV) principle

    for strictly non-circular sources. Specifically, we devise three adaptive algorithms based

    on the Krylov subspace, applying the auxiliary vector filtering (AVF), the conjugate

    gradient (CG), and the modified conjugate gradient (MCG) algorithms, which avoid

    the covariance matrix inversion required by the constraint. The proposed adaptive

    algorithms fully exploit the second-order statistics of the strictly non-circular data and

    achieve significant performance gains.

    The second part is concerned with direction finding algorithms in the beamspace

    domain. It is demonstrated that operation in beamspace substantially reduces the

    computational complexity while providing increased estimation performance compared

    to the element space. Therefore, two Krylov subspace-based algorithms, employing

    the AVF and the CG algorithm developed for direction finding, are extended to the

    beamspace processing. The development is followed by an extensive performance eval-

    uation, highlighting their superior resolution performance for closely-spaced sources at

    a low SNR and a small sample size.

    In the last part, we introduce a new strategy of incorporating prior knowledge for

    direction finding to improve the estimation performance of unknown signal sources.

    The novel approach is developed for situations with a limited data record and is based

    on an enhanced covariance matrix estimate obtained by linearly combining the sample

    covariance matrix and a prior known covariance matrix in an automatic fashion. As

    a result, knowledge-aided MUSIC-type and ESPRIT-type direction finding algorithms

    are devised and evaluated. Extensive studies to assess the performance illustrate that

    the exploitation of prior knowledge is translated into a significantly better estimation

    performance.

    Masterarbeit Jens Steinwandt

  • CONTENTS iii

    CONTENTS

    Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i

    Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii

    Contents . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii

    List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi

    List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii

    1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.1 Adaptive beamforming . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.2 Direction of arrival estimation . . . . . . . . . . . . . . . . . . . . . . . 31.3 Overview and contributions . . . . . . . . . . . . . . . . . . . . . . . . 41.4 List of publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

    2. System Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72.1 Data model for direction of arrival estimation . . . . . . . . . . . . . . 72.2 Signal subspace estimation . . . . . . . . . . . . . . . . . . . . . . . . . 9

    2.2.1 True signal subspace . . . . . . . . . . . . . . . . . . . . . . . . 92.2.2 Direct data approach . . . . . . . . . . . . . . . . . . . . . . . . 102.2.3 Covariance approach . . . . . . . . . . . . . . . . . . . . . . . . 10

    2.3 Data model for CMV beamforming . . . . . . . . . . . . . . . . . . . . 11

    3. Widely-linear adaptive beamforming using Krylov-based algorithms . . . 133.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133.2 LCMV beamforming . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153.3 Non-circularity and WL-CMV beamforming . . . . . . . . . . . . . . . 16

    3.3.1 Non-circularity of signal sources . . . . . . . . . . . . . . . . . . 173.3.2 WL-CMV beamforming assuming strict non-circularity . . . . . 17

    3.4 Widely-linear beamforming based on the AVF algorithm . . . . . . . . 193.5 Widely-linear beamforming based on the CG algorithm . . . . . . . . . 213.6 Widely-linear beamforming based on the MCG algorithm . . . . . . . . 243.7 Computational complexity . . . . . . . . . . . . . . . . . . . . . . . . . 27

    3.7.1 Complexity of LCMV algorithms . . . . . . . . . . . . . . . . . 283.7.2 Complexity of WL-CMV algorithms . . . . . . . . . . . . . . . . 29

    3.8 Simulation results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303.8.1 Performance in stationary scenarios . . . . . . . . . . . . . . . . 303.8.2 Performance in non-stationary scenarios . . . . . . . . . . . . . 32

    Masterarbeit Jens Steinwandt

  • CONTENTS iv

    3.9 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33

    4. Beamspace direction finding using Krylov subspace-based algorithms . . 354.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354.2 Beamspace processing . . . . . . . . . . . . . . . . . . . . . . . . . . . 374.3 Beamspace transformation matrix . . . . . . . . . . . . . . . . . . . . . 39

    4.3.1 Discrete Fourier transform (DFT) beamspace . . . . . . . . . . 394.3.2 Discrete prolate spheroidal sequences (DPSS) beamspace . . . . 404.3.3 Comparison of the DFT and DPSS spatial filters . . . . . . . . 41

    4.4 Beamspace direction finding based on the CG algorithm . . . . . . . . 414.5 Beamspace direction finding based on the AVF algorithm . . . . . . . . 434.6 Computational complexity . . . . . . . . . . . . . . . . . . . . . . . . . 46

    4.6.1 Subspace estimation via SVD . . . . . . . . . . . . . . . . . . . 464.6.2 Subspace estimation via EVD . . . . . . . . . . . . . . . . . . . 48

    4.7 Simulation results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 504.7.1 Comparison of the proposed algorithms in the DFT beamspace . 504.7.2 Comparison of the DFT and DPSS beamspace . . . . . . . . . . 54

    4.8 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

    5. Knowledge-aided direction of arrival estimation . . . . . . . . . . . . . . . 575.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 575.2 Enhanced covariance matrix estimation . . . . . . . . . . . . . . . . . . 59

    5.2.1 Computation of the a priori covariance matrix . . . . . . . . . . 595.2.2 Problem statement . . . . . . . . . . . . . . . . . . . . . . . . . 605.2.3 Computation of the optimal weight factors . . . . . . . . . . . . 615.2.4 Estimation of the optimal weight factors . . . . . . . . . . . . . 655.2.5 Improved estimation of the optimal weight factors . . . . . . . . 65

    5.3 General knowledge-aided direction of arrival estimation scheme . . . . . 675.3.1 Structure of KA DOA estimation . . . . . . . . . . . . . . . . . 685.3.2 Computational complexity . . . . . . . . . . . . . . . . . . . . . 69

    5.4 Knowledge-aided ESPRIT-type direction of arrival estimation . . . . . 705.4.1 KA-ESPRIT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 705.4.2 Unitary KA-ESPRIT . . . . . . . . . . . . . . . . . . . . . . . . 73

    5.5 Knowledge-aided MUSIC-type direction of arrival estimation . . . . . . 785.5.1 KA-MUSIC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 785.5.2 KA-Root-MUSIC . . . . . . . . . . . . . . . . . . . . . . . . . . 80

    5.6 Simulations with accurate a priori knowledge . . . . . . . . . . . . . . 835.6.1 Comparison of LC1 and LC2 using KA-ESPRIT . . . . . . . . . 835.6.2 Comparison of the proposed KA DOA estimation algorithms . . 885.6.3 Closely-spaced signal sources . . . . . . . . . . . . . . . . . . . . 925.6.4 Widely-spaced signal sources . . . . . . . . . . . . . . . . . . . . 925.6.5 Correlated signal sources . . . . . . . . . . . . . . . . . . . . . . 93

    5.7 Simulations with inaccurate a priori knowledge . . . . . . . . . . . . . 945.8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

    6. Summary and future work . . . . . . . . . . . . . . . . . . . . . . . . . . . 97

    Masterarbeit Jens Steinwandt

  • CONTENTS v

    Appendix A. Derivations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99A.1 Proof of equation (3.8) . . . . . . . . . . . . . . . . . . . . . . . . . . . 99A.2 Real-valued source symbols for strict non-circularity . . . . . . . . . . . 100A.3 Proof of equation (3.23) . . . . . . . . . . . . . . . . . . . . . . . . . . 101A.4 Proof of equation (3.43) . . . . . . . . . . . . . . . . . . . . . . . . . . 101A.5 Proof of equation (5.30) . . . . . . . . . . . . . . . . . . . . . . . . . . 102

    Acronyms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103

    Symbols and Notation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105

    Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

    Theses . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110

    Erklärung . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111

    Masterarbeit Jens Steinwandt

  • LIST OF FIGURES vi

    LIST OF FIGURES

    2.1 Generic model of a ULA . . . . . . . . . . . . . . . . . . . . . . . . . . 7

    2.2 Block diagram of a narrowband beamformer . . . . . . . . . . . . . . . 11

    3.1 Contour lines of complex Gaussian random variables with different ρ . . 17

    3.2 Structure of the WL-AVF algorithm . . . . . . . . . . . . . . . . . . . . 19

    3.3 Structure of the WL-CG algorithm . . . . . . . . . . . . . . . . . . . . 22

    3.4 Structure of the WL-MCG algorithm . . . . . . . . . . . . . . . . . . . 25

    3.5 Computational complexity of LCMV and WL-CMV algorithms . . . . . 28

    3.6 Output SINR versus the snapshot number in a stationary scenario . . . 31

    3.7 Output SINR versus the input SNR in a stationary scenario . . . . . . 31

    3.8 Output SINR versus the snapshot number in a non-stationary scenario 32

    4.1 Diagram of the beamspace environment . . . . . . . . . . . . . . . . . . 37

    4.2 Comparison of the DFT and DPSS spatial filters . . . . . . . . . . . . . 41

    4.3 Complexity in terms of arithmetic operations applying the SVD . . . . 48

    4.4 Complexity in terms of arithmetic operations applying the EVD . . . . 50

    4.5 Probability of resolution versus the SNR in the DFT beamspace . . . . 52

    4.6 Probability of resolution versus the snapshots in the DFT beamspace . 52

    4.7 RMSE versus the SNR in the DFT beamspace . . . . . . . . . . . . . . 53

    4.8 Comparison of the DFT and DPSS beamspace regarding the probability

    of resolution versus the SNR . . . . . . . . . . . . . . . . . . . . . . . . 54

    4.9 Comparison of the DFT and DPSS beamspace regarding the probability

    of resolution versus the snapshot number . . . . . . . . . . . . . . . . . 55

    4.10 Comparison of the DFT and DPSS beamspace regarding the RMSE

    versus the SNR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55

    5.1 Block diagram of the KA DOA estimation scheme . . . . . . . . . . . . 68

    5.2 Comparison of the LC1 and LC2 regarding the RMSE versus the SNR . 84

    5.3 Combination factors of the LC1 and LC2 versus the SNR . . . . . . . . 84

    5.4 Comparison of the LC1 and LC2 regarding the RMSE versus the snapshots 85

    5.5 Combination factors of the LC1 and LC2 versus the snapshot number . 85

    Masterarbeit Jens Steinwandt

  • LIST OF FIGURES vii

    5.6 RMSE versus the SNR of KA-ESPRIT and forward-backward averaging 87

    5.7 RMSE versus the snapshot number of KA-ESPRIT and forward-backward

    averaging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87

    5.8 Comparison of KA-ESPRIT-type algorithms . . . . . . . . . . . . . . . 89

    5.9 Comparison of Unitary KA-ESPRIT-type algorithms . . . . . . . . . . 89

    5.10 Comparison of all the proposed KA algorithms . . . . . . . . . . . . . . 90

    5.11 Comparison of KA-MUSIC-type and constrained MUSIC-type algorithms 91

    5.12 Comparison of the proposed KA algorithms for closely-spaced sources . 92

    5.13 Comparison of the proposed KA algorithms for widely-spaced sources . 93

    5.14 Comparison of the proposed KA algorithms for correlated sources . . . 94

    5.15 Comparison of the proposed KA algorithms with inaccurate knowledge 95

    Masterarbeit Jens Steinwandt

  • LIST OF TABLES viii

    LIST OF TABLES

    3.1 The proposed WL-AVF algorithm . . . . . . . . . . . . . . . . . . . . . 21

    3.2 The proposed WL-CG algorithm . . . . . . . . . . . . . . . . . . . . . 24

    3.3 The proposed WL-MCG algorithm . . . . . . . . . . . . . . . . . . . . 27

    3.4 Computational complexity of LCMV beamforming algorithms . . . . . 29

    3.5 Computational complexity of WL-CMV beamforming algorithms . . . . 29

    4.1 The proposed DFT BS CG algorithm . . . . . . . . . . . . . . . . . . . 43

    4.2 The proposed DFT BS AVF algorithm . . . . . . . . . . . . . . . . . . 45

    4.3 Number of additions applying the SVD . . . . . . . . . . . . . . . . . . 47

    4.4 Number of multiplications applying the SVD . . . . . . . . . . . . . . . 47

    4.5 Number of additions applying the EVD . . . . . . . . . . . . . . . . . . 49

    4.6 Number of multiplications applying the EVD . . . . . . . . . . . . . . . 49

    5.1 Computational complexity of the proposed versions of the KA scheme . 69

    5.2 The proposed KA-ESPRIT algorithm . . . . . . . . . . . . . . . . . . . 72

    5.3 The proposed Unitary KA-ESPRIT algorithm . . . . . . . . . . . . . . 77

    5.4 The proposed KA-MUSIC algorithm . . . . . . . . . . . . . . . . . . . 80

    5.5 The proposed KA-Root-MUSIC algorithm . . . . . . . . . . . . . . . . 82

    Masterarbeit Jens Steinwandt

  • 1. Introduction 1

    1. INTRODUCTION

    Array signal processing is an active area of research in the broad field of signal process-

    ing, and focuses on the problem of estimating signal parameters from data collected

    over the spatial aperture of an antenna array, where the sensors are placed at distinct

    spatial locations. The given estimation task is usually associated with the extraction

    of desired information from impinging signals in the presence of noise and interference.

    The antenna array addresses the estimation problem by exploiting the spatial sepa-

    ration of the sensor elements to capture the propagating wavefronts, which originate

    from energy-radiating sources. Common signal parameters of interest to be estimated

    are the signal content itself, the directions of arrival of the signals, and their power. To

    obtain this information, the sensor array data is processed using statistical and adap-

    tive signal processing techniques. These schemes include parameter estimation and

    adaptive filtering applied to array signal processing. The underlying set of principles

    and techniques for sensor array signal processing is applicable in many versatile areas

    [1, 2], including wireless communications, radar, sonar, biomedicine, seismology and

    astronomy.

    Among the most relevant topics within array signal processing are beamforming

    [1] and direction of arrival estimation [1], which present inevitable challenges when

    designing wireless communications systems. As this thesis focuses on the advancement

    of these techniques for communications, the main problems to be solved are interference

    suppression, source localization, source tracking, complexity reduction and general

    performance improvements. For the sake of simplicity, the novel strategies developed

    in this work rely on the use of a uniform linear sensor array. An extension to arbitrary

    array geometries will be addressed in future work.

    1.1 Adaptive beamforming

    A ubiquitous task in array signal processing is the extraction of desired information

    from propagating signals originating from a certain direction by employing an antenna

    array equipped with multiple sensors. The array records the radiating wavefronts and

    produces an observation vector at a given time instant, which is processed to obtain the

    Masterarbeit Jens Steinwandt

  • 1. Introduction 2

    information of interest. The most straightforward strategy to achieve this objective is

    the weighted linear combination of the signals received at multiple spatially separated

    sensor elements. By selecting the appropriate weights, signals from a particular direc-

    tion can be emphasized while attenuating interfering signals from other directions. This

    weighting process is referred to as beamforming or spatial filtering [2, 3]. Generally,

    the steering of beams into specific angles by constructing the weights is equivalent to

    designing a frequency-selective finite impulse response (FIR) filter for temporal signals.

    The weighting applied to the received signal at each sensor element can either be fixed

    and data-independent, or continuously adapted to track changes in the system and

    to reject interference. The latter approach, termed adaptive beamforming, evidently

    provides a better performance and is the focus of this thesis.

    The adaptive algorithms used to adjust the weights for each sensor element are

    designed by optimizing certain criteria according to the given properties. One of the

    most relevant design criteria in practice is the constrained minimum variance (CMV)

    approach [4], which does not require the transmission of a known training sequence

    and only implies knowledge of the array geometry and the angle of the desired signal.

    The CMV optimality criterion minimizes the total beamformer output power while

    constraining the array response in the direction of the desired signal to be constant.

    Due to its simplicity and effectiveness much effort has been devoted over the past few

    decades to devise efficient adaptive algorithms in order to realize a practical beam-

    former design [1,3,5]. Among the existing adaptive algorithms, the least mean squares

    (LMS) method [5] as a representative of the low-complexity stochastic gradient tech-

    niques adopts gradient vectors for the iterative computation of the weights and yields

    an acceptable performance in many applications. However, its efficacy strongly de-

    pends on the step size and the eigenvalue spread of the covariance matrix, resulting

    in an insufficient convergence performance for certain scenarios [3]. An alternative

    method is the recursive least squares (RLS) algorithm [5], which is independent of the

    eigenvalue spread and thus attains a fast convergence speed. Nevertheless, its main

    drawbacks are numerical instability and a relatively high complexity. An attractive

    trade-off between the performance and the complexity of the LMS method and the

    RLS technique is achieved by the conjugate gradient (CG) algorithm [6, 7] originally

    derived to solve a linear system of equations. It forms successive residual vectors within

    a number of iterations in order to update the beamforming weight vector, requiring a

    lower computational cost compared to the RLS method for few iterations and provid-

    ing a faster convergence rate than the LMS approach. A modified version of the CG

    (MCG) algorithm derived to obtain further reduction of the complexity while main-

    taining the same performance level was proposed in [8]. Another adaptive algorithm

    Masterarbeit Jens Steinwandt

  • 1. Introduction 3

    that has received considerable interest is the auxiliary vector filtering (AVF) technique

    [9]. Similarly to the CG method, it utilizes an iterative procedure to update the weight

    vector by adopting auxiliary vectors to approach the optimal solution. The AVF algo-

    rithm was shown to outperform the performance of the RLS method with regard to the

    convergence speed and the steady-state level, and to be the state-of-the-art algorithm

    for adaptive beamforming [9].

    1.2 Direction of arrival estimation

    A second problem, which is extensively studied as part of the general field of array

    signal processing, is the estimation of the direction of arrival (DOA). The objective

    is to determine the angle of arrival of a given spatially propagating signal relative to

    the antenna array. To this end, the spatial separation of multiple sensor elements

    is exploited to obtain the location of the energy-radiating source. The result of the

    estimation procedure is subsequently used for the beamforming design described in the

    previous section to steer the beam towards this specific direction, in order to capture

    or radiate maximal power. With the field of applications involving DOA estimation

    constantly expanding, numerous direction finding techniques have been devised over

    the past few decades [1, 10]. The most well-known parameter estimation strategies

    discussed here can be classified into three main categories, namely conventional [11],

    subspace-based [12–17], and maximum likelihood (ML) methods [18].

    The concept of the conventional DOA estimation algorithms relies on the beam-

    forming principle. These techniques successively steer the main beam in all possible

    look directions and measure the output power [10], which is recorded in the form of a

    pseudo spectrum over the angle range. The largest peaks in the pseudo spectrum are

    associated with the DOA estimates. The most prominent approach within this class is

    Capon’s method [11] based on the CMV criterion [4]. It minimizes the power induced

    by interfering signals and noise while keeping the gain towards the look direction fixed.

    Although the implementation of the conventional techniques is simple, they suffer from

    a lack of angular resolution and require a large number of sensors to achieve a higher

    resolution.

    The class of subspace-based methods exploits a spectral decomposition of the covari-

    ance matrix to achieve high-resolution DOA estimates. Among the most relevant tech-

    niques are the multiple signal classification (MUSIC) [12], its extension Root-MUSIC

    [13], the estimation of signal parameters via rotational invariance techniques (ESPRIT)

    [14], its enhancement Unitary ESPRIT [15] and the recently proposed auxiliary vector

    filtering (AVF) [16] and conjugate gradient (CG) [17] algorithms developed for direction

    Masterarbeit Jens Steinwandt

  • 1. Introduction 4

    finding. The MUSIC-type and the ESPRIT-type algorithms exploit the eigen-structure

    of the covariance matrix, allowing a decomposition of the observation space into a sig-

    nal subspace and a complementary noise subspace. Specifically, MUSIC scans over the

    possible angle range and makes use of the orthogonality of the subspaces to obtain a

    pseudo spectrum with increased resolution. The extension Root-MUSIC for uniform

    linear arrays avoids the exhaustive search for peaks by applying a polynomial rooting

    technique. The ESPRIT-type algorithms avoid the exhaustive peak search by dividing

    the sensor array into two identical subarrays and benefit from the uniform displacement

    of the subarrays. The Unitary ESPRIT additionally takes advantage of the fact that

    the phase factors representing the displacement lie on the unit circle. The recently

    developed AVF and CG algorithms iteratively generate an extended non-eigen-based

    signal subspace containing the true signal subspace and the scanning vector itself. The

    DOA estimates are determined by the search for the collapse of the extended signal

    subspace as the scanning vector belongs to it. Whereas the AVF algorithm adopts aux-

    iliary vectors to form the extended signal subspace, the CG method applies residual

    vectors and can be considered as an extension of the AVF technique. Both approaches

    provide high-resolution estimates at a low signal-to-noise ratio, and a small sample

    size.

    ML-type methods are based on a parametric approach. They effectively exploit

    the underlying data model, resulting in sufficiently high accuracy superior to the con-

    ventional and subspace-based methods. However, the efficiency is at the expense of

    the computational intensity as a multidimensional search is required. An iterative ap-

    proach to limit the computational effort is the alternating projection technique [18],

    which transforms the optimization problem into a sequence of one-dimensional opti-

    mization problems.

    1.3 Overview and contributions

    In this work, we deal with the advancement of array signal processing algorithms for the

    applications of beamforming and direction finding, which are of significant relevance

    in wireless communications.

    Chapter 2 introduces the system model for array signal processing, focusing on

    the subspace estimation for direction finding and the CMV criterion-based design for

    adaptive beamforming.

    Chapter 3 is concerned with the first major contribution, which is the develop-

    ment of adaptive beamforming methods using Krylov-based algorithms for non-circular

    Masterarbeit Jens Steinwandt

  • 1. Introduction 5

    sources. By exploiting the widely-linear property that some of the signal sources are

    non-circular, an improved performance can be achieved. Three efficient widely-linear

    adaptive algorithms based on the AVF, the CG and the MCG algorithms, which avoid

    the computation of the inverse of the covariance matrix are derived according to the

    CMV criterion. Furthermore, their performance compared to the linear versions is

    investigated and the complexity analyzed.

    Chapter 4 deals with the extension of the Krylov subspace-based DOA estimation

    algorithms to the operation in beamspace, which is the second major contribution. The

    operation in the beamspace domain allows for a significant reduction of the complexity

    while substantially increasing the estimation performance. The AVF and CG-based

    direction finding techniques are developed for two different beamspace designs, provid-

    ing improved performance over their counterparts in element space. Complexity and

    performance assessments are also conducted.

    Chapter 5 addresses the third contribution, focusing on knowledge-aided DOA

    estimation. By exploiting a priori knowledge of some of the DOAs to be estimated,

    the estimation accuracy of the unknown signal directions can be significantly enhanced.

    A novel way of incorporating the prior knowledge for situations of a limited data

    record and a low signal-to-noise ratio is developed. The proposed scheme is applied

    to ESPRIT-type and MUSIC-type algorithms and the impact of the knowledge on the

    estimation performance is extensively studied.

    Chapter 6 concludes the key properties covered in this work, emphasizing the

    relevance for the scientific community and discussing possible topics for future work.

    1.4 List of publications

    Some of the research work outlined in this thesis has already been published. A list of

    the conference papers is provided below.

    Conference Papers

    1. J. Steinwandt, R. C. de Lamare, L. Wang, N. Song, and M. Haardt, “Widely

    linear adaptive beamforming algorithm based on the conjugate gradient method,”

    International ITG Workshop on Smart Antennas (WSA), (Aachen, Germany),

    Feb. 2011.

    Masterarbeit Jens Steinwandt

  • 1. Introduction 6

    2. J. Steinwandt, R. C. de Lamare, and M. Haardt, “Beamspace direction finding

    based on the conjugate gradient algorithm,” International ITG Workshop on

    Smart Antennas (WSA), (Aachen, Germany), Feb. 2011.

    3. N. Song, J. Steinwandt, L. Wang, R. C. de Lamare, and M. Haardt, “Non-

    data-aided adaptive beamforming algorithm based on the widely linear auxiliary

    vector filter,” IEEE International Conference on Acoustics, Speech, and Signal

    Processing (ICASSP), (Prague, Czech Republic), May 2011.

    Masterarbeit Jens Steinwandt

  • 2. System Model 7

    2. SYSTEM MODEL

    This chapter presents the system model of the antenna array output at the receiver,

    which provides the basis for the development of array signal processing algorithms.

    Although the problems of adaptive beamforming and DOA estimation are different, the

    fundamental model used for them is very similar. As stated earlier, the developments

    throughout this work are based on the use of a uniform linear array (ULA). Hence, the

    introduced system model is only valid for this class of array geometries.

    2.1 Data model for direction of arrival estimation

    Considering a spherical coordinate system, the DOA of an incident wavefront can be

    determined by estimating the azimuth angle θ and the elevation angle φ. For the

    sake of simplicity, we limit this model to the one-dimensional case, assuming that the

    wavefront impinges on the sensor array in the horizontal plane, i.e. φ = π/2, so that

    the azimuth angle θ entirely specifies the DOA. Corresponding to the schematic array

    structure in Figure 2.1, additional assumptions to describe the system model are to be

    made:

    • The propagating wavefronts are assumed to be radiated by point sources.

    • The sources are considered to be in the far field and the wavefronts are approxi-mately planar.

    x1 x2 xM

    ∆sin θ1

    θ1

    s1sd

    x3

    Fig. 2.1: Generic model of a ULA.

    Masterarbeit Jens Steinwandt

  • 2. System Model 8

    • The signals are narrowband signals, so that a time delay corresponds to a phaseshift.

    • The sensors are omni-directional, there is no coupling between them and thearray is perfectly calibrated.

    Let us suppose that an M -element sensor array receives d (d < M) propagating wave-

    fronts with the distinct DOAs θ = [θ1, . . . , θd]T relative to the broad side of the array.

    The ith of N available data snapshots of the (M × 1)-dimensional array output vectorcan be modeled as

    x(i) =[a(θ1) · · · a(θd)

    ]

    s1(i)...

    sd(i)

    + n(i) = A(θ)s(i) + n(i), i = 1, . . . , N, (2.1)

    where A(θ) ∈ CM×d is the array steering matrix, whose columns are functions of theunknown DOAs. The zero-mean vector s(i) ∈ Cd×1 represents the impinging complexwaveforms of the d signals and n(i) ∈ CM×1 is the additive vector of white circularlysymmetric complex Gaussian noise with zero mean and variance σ2n.

    Considering a ULA equipped with equally spaced sensors as shown in Figure 2.1,

    the (M × 1) steering vectors a(θn) corresponding to the nth source, n = 1, . . . , d, canbe expressed as

    a(θn) =[1 ej2π

    ∆λc

    sin θn · · · ej2π(M−1) ∆λc sin θn]T, (2.2)

    where ∆ denotes the distance of adjacent sensors of the ULA and λc is the signal

    wavelength. In order to avoid spatial aliasing, ∆ is restricted to ∆ ≤ λc/2 in whichcase the DOAs are limited to the interval −90◦ < θn < 90◦. Defining the spatialfrequencies µn ∈ [−π, π] as a result of the geometry in Figure 2.1 as

    µn = 2π∆

    λcsin θn, (2.3)

    the model stated in (2.1) can be written as

    x(i) =

    1 1 · · · 1ejµ1 ejµ2 · · · ejµd

    ......

    ...

    ej(M−1)µ1 ej(M−1)µ2 · · · ej(M−1)µd

    s(i) + n(i), (2.4)

    Masterarbeit Jens Steinwandt

  • 2. System Model 9

    where the array steering matrix A(θ) exhibits the Vandermonde structure. Given the

    entire data of the N snapshots, (2.4) can also be expressed compactly as

    X =[x1 · · · xN

    ]= A(θ)S + N ∈ CM×N . (2.5)

    2.2 Signal subspace estimation

    The DOA estimation techniques considered in this work belong to the class of subspace-

    based algorithms. Therefore, a brief review of the fundamental properties is covered in

    this section.

    2.2.1 True signal subspace

    Due to the fact that the source data s(i) and the noise vector n(i) are modeled as

    uncorrelated random variables, the covariance matrix of the noise-corrupted zero-mean

    array output x(i) is calculated by

    R = E{x(i)xH(i)

    }= A(θ)RssA

    H(θ) + σ2nIM ∈ CM×M , (2.6)

    where Rss = E{s(i)sH(i)} is the signal covariance matrix, which is diagonal if thesources are uncorrelated and nondiagonal for partially correlated sources. The noise

    covariance matrix E{n(i)nH(i)} = σ2nIM , where IM denotes the M × M identitymatrix.

    The eigenvalue decomposition (EVD) of the covariance matrix R can be expressed

    as

    R =[Us Un

    ]Λd 0

    0 0

    + σ2nIM

    U

    Hs

    UHn ,

    (2.7)

    where the diagonal matrix Λd = diag{λn}dn=1 consists of the nonzero eigenvalues,and Us and Un represent the orthogonal eigenvectors corresponding to the d largest

    eigenvalues and the M − d smallest eigenvalues, respectively. The matrix Us is oftenreferred to as signal subspace S and its orthogonal complement Un is termed noisesubspace. Without additive noise, i.e., σ2n = 0, it is evident from

    R = A(θ)RssAH(θ) = UsΛdU

    Hs (2.8)

    that A(θ) and Us span the same signal subspace S of dimension d, i.e.,

    S = span{A(θ)} = span{Us}. (2.9)

    Masterarbeit Jens Steinwandt

  • 2. System Model 10

    2.2.2 Direct data approach

    In the direct data approach, a singular value decomposition (SVD) of the array output

    matrix X is performed to estimate the signal and the noise subspace. Assuming that

    the number of sources d, i.e., the rank, is known, X can be written as

    X =[Ûs Ûn

    ]Σ̂s 0

    0 Σ̂n

    Hs

    V̂ Hn

    (2.10)

    = ÛsΣ̂sV̂Hs + ÛnΣ̂nV̂

    Hn , (2.11)

    where the diagonal matrices Σ̂s and Σ̂n contain the d largest singular values and the

    M − d smallest singular values, respectively. The estimated signal subspace Ûs ∈CM×d comprises the singular vectors corresponding to Σ̂s and the noise subspace Ûn ∈

    CM×(M−d) is associated with Σ̂n. If only the signal subspace is to be estimated a rank-d

    approximation of the SVD can be applied.

    2.2.3 Covariance approach

    In practical applications, the true covariance matrix R is unknown and needs to be

    estimated. A widely used estimator for R is a sample-averaged approach, termed

    sample covariance matrix, which is given by

    R̂ =1

    N

    N∑

    i=1

    x(i)xH(i) =1

    NXXH ∈ CM×M . (2.12)

    Again, assuming the rank d is known, the EVD of the sample covariance matrix R̂

    yields

    R̃ =[Ûs Ûn

    ]Λ̂s 0

    0 Λ̂n

    Hs

    ÛHn

    (2.13)

    = ÛsΛ̂sÛHs + ÛnΛ̂nÛ

    Hn , (2.14)

    where the diagonal matrices Λ̂s and Λ̂n contain the d largest eigenvalues and the

    M − d smallest eigenvalues, respectively. The estimated signal subspace Ûs ∈ CM×dis composed of the eigenvectors corresponding to Σ̂s and the noise subspace Ûn ∈CM×(M−d) complies with Σ̂n. If only the signal subspace is to be estimated a rank-d

    approximation of the EVD can be applied. Both the direct data approach and the

    covariance approach are theoretically equivalent.

    Masterarbeit Jens Steinwandt

  • 2. System Model 11

    2.3 Data model for CMV beamforming

    In order to describe adaptive beamforming techniques according to the constrained

    minimum variance (CMV) principle, the same assumptions on the impinging wavefronts

    as in the previous section are presumed. However, for the system model, we do not

    restrict the sensor array to be a ULA as this property is not required by the adaptive

    algorithms. Furthermore, we use a slightly different notation to distinguish between

    the problems of DOA estimation and adaptive beamforming.

    Assume that d narrowband signals from sources in the far field are impinging

    on an arbitrarily formed array of M (M ≥ d) sensor elements with the DOAs θ =[θ1, . . . , θd]

    T . The ith of N available data snapshots of the (M × 1)-dimensional arrayoutput vector can be modeled as

    r(i) = A(θ)P 1/2s(i) + n(i), i = 1, . . . , N, (2.15)

    where the zero-mean vector s(i) = [s1(i), . . . , sd(i)]T ∈ Cd×1 represents the complex

    signal waveforms, P = diag{σ21, . . . , σ2d} contains the signal powers on its diagonal,n(i) ∈ CM×1 is the vector of white circularly symmetric complex Gaussian noise withzero mean and variance σ2n, and the matrix A(θ) = [a(θ1), . . . ,a(θd)] ∈ CM×d consistsof the array steering vectors corresponding to the d sources.

    Corresponding to Figure 2.2, the output of the adaptive beamformer is generated

    by appropriately weighting the sum of the signals at each sensor element. Then, the

    output signal is sent back to the adaptive processor to update the weights subject

    to the optimization of a cost function. The weighting process can be mathematically

    θn

    x1(i)

    x2(i)

    xM (i)

    1

    2

    M

    Sources

    Sensors1

    2

    d

    ∑∑∑

    w∗1

    w∗2

    w∗M

    y(i)

    Adaptive

    Algorithm

    Fig. 2.2: Block diagram of a narrowband beamformer.

    Masterarbeit Jens Steinwandt

  • 2. System Model 12

    expressed as

    y(i) = wHr(i), (2.16)

    where w = [w1, . . . , wM ]T ∈ CM×1 is the complex beamforming weight vector to be

    designed. After estimating the DOAs in the environment and selecting a particular

    source of interest with the signal direction θ1, the beamformer output in (2.16) can be

    expanded as

    y(i) =√σ21w

    Ha(θ1)s1(i) + wH

    (K∑

    k=2

    √σ2ka(θk)sk(i) + n(i)

    )(2.17)

    =√σ21w

    Ha(θ1)s1(i) + wHu(i), (2.18)

    where u(i) is the vector containing the interferers and the additive noise. The objective

    is to keep the gain in the direction of the desired user fixed, i.e., wHa(θ1) = γ and the

    gain of the interferers small wHu(i) ≈ 0. This can be achieved by CMV beamforming,which formulates the objective as the constrained optimization problem

    wopt = argminw

    E

    {|y(i)|2

    }= E

    {|wHr(i)|2

    }= wHRw

    subject to wHa(θ1) = γ,(2.19)

    where the covariance matrix R is estimated by the sample covariance matrix R̂ defined

    in (2.12). The goal of (2.19) is to minimize the array output variance while maintaining

    the response in the direction of the SOI.

    Applying the method of Lagrange multipliers, the optimal solution for the mini-

    mization problem is

    wopt =γR−1a(θ1)

    aH(θ1)R−1a(θ1). (2.20)

    Setting the constant γ = 1 yields the so-called minimum variance distortionless re-

    sponse (MVDR). Note that the CMV criterion requires the knowledge of the DOA θ1

    of the desired user and the array geometry to form the steering vector a(θ1). Further

    details and a classification into linear CMV (LCMV) and widely-linear CMV (WL-

    CMV) will be discussed in the next chapter.

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 13

    3. WIDELY-LINEAR ADAPTIVE

    BEAMFORMING USING

    KRYLOV-BASED ALGORITHMS

    In this chapter, we develop three widely-linear constrained adaptive beamforming al-

    gorithms based on the Krylov subspace, which are designed according to the minimum

    variance principle. The proposed algorithms apply either the auxiliary vector filter-

    ing (AVF) algorithm, the conjugate gradient (CG) algorithm or the modified conju-

    gate gradient (MCG) algorithm to iteratively update the beamforming weight vector.

    By exploiting the non-circularity of impinging signals via a widely-linear processing

    technique, the output performance of these adaptive algorithms can be significantly

    improved.

    3.1 Introduction

    Adaptive beamforming is an essential approach for estimating the properties of a signal

    of interest at the sensor array output by weighting and combining the received signals

    to filter the desired user and to reject interference. The key advantage of the adaptive

    architecture is its ability to automatically adjust its weights to suit differences in the

    observed scenario. This includes tracking of moving interferers, or adaptation to leaving

    or entering signal sources in the system. Given its vast importance in multiple fields

    such as radar, sonar, biomedicine and wireless communications, adaptive beamforming

    has received considerable attention in the last few decades and numerous algorithms

    have been reported [5]. One of the most popular criteria for designing a beamformer is

    the linearly constrained minimum variance (LCMV) concept [4], which minimizes the

    variance at the array output while maintaining a desired response in the direction of

    the source of interest (SOI). This design criterion only requires the knowledge of the

    steering vector of the desired signal.

    In most existing estimation and filtering algorithms, it is often assumed that the re-

    ceived vector r(i) is second-order circular as for quadrature phase-shift keying (QPSK)-

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 14

    modulated signals, where the conventional covariance matrix

    R = E{r(i)rH(i)

    }∈ CM×M (3.1)

    fully describes the second-order statistics. However, in many modern telecommunica-

    tions and satellite systems based on binary phase-shift keying (BPSK)-modulated or

    amplitude-shift keying (ASK)-modulated signals, the observation signal r(i) features

    second-order non-circularity, i.e., the complementary covariance matrix

    C = E{r(i)rT (i)

    }6= 0 ∈ CM×M (3.2)

    and thus needs to be considered to optimally characterize the statistical properties of

    the signals [19]. It was recently shown that fully exploiting the second-order statis-

    tics, referred to as widely-linear (WL) processing, substantially improves the output

    performance compared to that in the linear processing [19–21]. Consequently, several

    widely-linear adaptive beamforming algorithms comprising the WL minimum variance

    distortionless response (WL-MVDR) algorithm [21] and the WL recursive least squares

    (WL-RLS) algorithm [22] have been devised. These approaches are either based on the

    assumption of weak non-circularity [21] or strict non-circularity [22] of the signals. A

    slight drawback associated with the strategy of widely-linear processing is the increased

    complexity due to the extended dimensions as both the originally observed data vector

    r(i) and its complex conjugate r∗(i) need to be processed. As the filtered data ordi-

    narily occupies a large sample size and also, largely equipped sensor arrays are utilized

    in practice, the widely-linear processing scheme further slows down the convergence

    speed of the adaptive algorithms.

    Among the well-established linearly constrained adaptive algorithms are the least

    mean squares (LMS) approach [5], which requires a low computational complexity but

    exhibits a slow convergence, and the recursive least squares (RLS) algorithm [5] that

    converges fast but suffers from a high computational burden. Two more recently de-

    veloped adaptive beamforming algorithms, namely the auxiliary vector filtering (AVF)

    algorithm [9] and the conjugate gradient (CG) algorithm [8] [23], rely on the Krylov

    subspace and iteratively update the filter weights in a stochastic gradient fashion. The

    AVF algorithm is known to yield a high convergence speed and to outperform the

    LMS and the RLS techniques. The iterative method based on the CG algorithm can

    provide an attractive trade-off between the convergence performance and the compu-

    tational complexity. Similarly to the AVF algorithm, it avoids the computation of

    R−1 required by the constraint, saving significant computational resources. However,

    it does not attain the performance level of the RLS algorithm. Despite the benefits

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 15

    of these Krylov-based methods, they require several iterations per data snapshot in

    order to converge. Thus, a modified version of the CG algorithm employing only one

    iteration per data update was developed in [23]. By making use of a degenerated

    scheme, meaning that its residual vectors are not requested to be orthogonal to the

    spanned Krylov subspace, the computational cost is further reduced while maintaining

    the performance.

    Motivated by the convergence speed-enhancing and complexity-reducing perfor-

    mance of the Krylov-based adaptive algorithms, this chapter is concerned with the

    development of these algorithms combined with the strategy of widely-linear process-

    ing to overcome the issues of the increased complexity and the convergence speed

    limitations. The resulting novel algorithms devised for strictly non-circular sources are

    referred to as the WL-AVF, the WL-CG and the WL-MCG beamforming methods and

    are designed according to the widely LCMV (WL-CMV) criterion. Requiring only the

    a priori knowledge of the signal direction of the desired user, the proposed techniques

    fully exploit the second-order statistics of the strictly non-circular signals in the system

    and, at the same time, ensure the benefits of the Krylov-based algorithms. In connec-

    tion with the development of the WL-AVF, the WL-CG, and the WL-MCG methods,

    an analysis of their computational complexity is performed and their convergence ca-

    pabilities are studied. For comparison purposes, their performance is contrasted with

    the previously developed WL-LMS and the WL-RLS algorithms.

    3.2 LCMV beamforming

    We start by shortly reviewing the LCMV concept for adaptive beamforming. Adopting

    the definition of the received vector r(i) stated in Section 2.3, the output of an adaptive

    narrowband beamformer is given by

    y(i) = wHr(i), (3.3)

    where

    w = [w1, . . . , wM ]T ∈ CM×1 (3.4)

    is the beamforming vector to be designed, which contains the complex weights that

    multiply the signals at each sensor element. The adaptation of the weight vector

    according to the LCMV optimality criterion is defined as the minimization of the

    total output variance while simultaneously keeping the gain of the array into a desired

    signal direction θ1 fixed. This formulation is equivalent to the reduction of the output

    variance by suppressing the interference. Mathematically, the optimal weights wopt are

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 16

    computed by solving the constrained optimization problem

    wopt = argminw

    E

    {|y(i)|2

    }= E

    {|wHr(i)|2

    }= wHRw

    subject to wHa(θ1) = γ,(3.5)

    where R is the M × M covariance matrix defined in (3.1), a(θ1) is the array steeringvector of the desired user with the DOA θ1, and γ is an arbitrary constant. Solving

    the minimization problem by using the method of Lagrange multipliers, the optimal

    solution for the calculation of the weights is obtained by

    wopt =γR−1a(θ1)

    aH(θ1)R−1a(θ1). (3.6)

    Setting the constant γ = 1 yields the so-called MVDR beamformer, meaning that

    the desired signal is undistorted due to the gain of 1 and the output signal exhibits

    minimum variance. Note that the LCMV criterion requires the knowledge of the DOA

    of the SOI and the exact array geometry to form the steering vector a(θ1).

    In order to assess the effectiveness of an adaptive beamformer, the signal-to-interference-

    plus-noise ratio (SINR) for the particular signal direction θ1 at each snapshot i is

    analyzed. The expression of the SINR is given by

    SINR(i) =σ21|wH(i)a(θ1)|2E {|wH(i)v(i)|2} =

    σ21|wH(i)a(θ1)|2wH(i)Ruw(i)

    , (3.7)

    where σ21 is the power of the SOI and Ru is the M × M covariance matrix of theinterference plus noise. The maximum output SINR according to the LCMV criterion

    can be derived as

    SINRLCMVmax =σ21a

    H(θ1)R−1a(θ1)

    1 − σ21aH(θ1)R−1a(θ1). (3.8)

    The proof of equation (3.8) was moved to the Appendix A.1.

    3.3 Non-circularity and WL-CMV beamforming

    As the objective of this chapter is to develop WL adaptive beamforming algorithms for

    non-circular sources, we first review the properties of non-circularity and then make

    use of the resulting characteristics to design the WL-CMV optimization criterion to

    generate the beamforming weight vectors.

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 17

    −2 −1 0 1 2−2

    −1

    0

    1

    2

    (a) ρ = 0

    −2 −1 0 1 2−2

    −1

    0

    1

    2

    (b) ρ = 0.6 exp(jπ/2)

    −2 −1 0 1 2−2

    −1

    0

    1

    2

    (c) ρ = 1 exp(jπ/2)

    Fig. 3.1: Contour lines of complex Gaussian random variables with different ρ.

    3.3.1 Non-circularity of signal sources

    A signal is termed non-circular if the real and the imaginary part of its source symbols

    are dependent, such that the lines of equal probability in the joint density function

    are not circles anymore [24, 25]. Consequently, a signal is referred to as circular if the

    opposite fact holds. For mathematical purposes, the non-circularity coefficient

    ρ =E {s2(i)}E {|s(i)|2} = |ρ|e

    jψ (3.9)

    is defined to provide a measure for the deviation from the circular symmetry. Thus,

    the non-circularity coefficient is ρ = 0 if the signal is circular. Assuming a signal in

    the system to be non-circular is not a distinct condition. Furthermore, it is essential to

    distinguish between weak non-circularity and strict non-circularity. The requirement

    for a weakly non-circular signal is 0 < |ρ| < 1 and a strictly non-circular signal impliesa non-circularity coefficient of |ρ| = 1. The contour lines of constant probability densityfor the three cases depending on ρ are depicted in Figure 3.1. Note that the lines for

    non-circular sources can have an arbitrary orientation ψ/2, which is determined by the

    angle of (3.9) [24]. In the following development of the WL Krylov-based beamforming

    algorithms, we assume the property of strict non-circularity of all the signals in the

    system.

    3.3.2 WL-CMV beamforming assuming strict non-circularity

    In order to exploit the strict non-circularity of signals impinging on the sensor array, the

    second-order statistics fully described by the covariance matrix R = E{r(i)rH(i)} andthe complementary covariance matrix C = E{r(i)rT (i)} need to be taken into account.Thus, to make use of the additional information contained in C, the received vector r(i)

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 18

    and its complex conjugate r∗(i) are processed by applying a bijective transformation

    T such thatr(i)

    T−→ r̃(i) : r̃(i) = [rT (i), rH(i)]T ∈ C2M×1, (3.10)

    where r̃(i) is the augmented received vector. It is utilized by the designed beamformer

    to design the augmented weight vector w̃ ∈ C2M×1 and to generate the output y(i) =w̃H r̃(i), which is termed widely-linear beamforming. In what follows, all the widely-

    linear quantities are denoted by an over tilde. Expanding the augmented vector after

    the bijective transformation, and taking into account the entire data block, yields

    X̃ =

    XX∗

    =

    A(θ)SA∗(θ)S∗

    +

    NN ∗

    ∈ C2M×N , (3.11)

    where we used X to denote the array output matrix for beamforming to avoid confu-

    sions with the covariance matrix R. As a result of the strict non-circularity assumption

    with |ρ| = 1, the source symbols in S are real-valued and (3.11) can be rewritten as

    X̃ =

    A(θ)A∗(θ)

    S +

    NN ∗

    . (3.12)

    A proof of the implication of real-valued symbols when presuming a non-circularity

    coefficient of |ρ| = 1 is shown in Appendix A.2.

    Applying the bijective transformation T to the LCMV optimization problem in(3.5), we obtain a new optimization problem referred to as WL-CMV design criterion,

    which is formulated as

    w̃opt = argminw̃

    E

    {|y(i)|2

    }= E

    {|w̃H r̃(i)|2

    }= w̃HR̃w̃

    subject to w̃Hã(θ1) = γ,(3.13)

    where ã(θ1) = T {a(θ1)} is the augmented steering vector of the SOI and R̃ =E{r̃(i)r̃H(i)} ∈ C2M×2M is the augmented covariance matrix of the structure

    R̃ =

    R C

    C∗ R∗

    , (3.14)

    where the covariance matrix R and the complementary covariance matrix C appear in

    conjugate pairs. Solving the widely-linear minimization problem using the Lagrange

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 19

    multiplier yields the optimal solution given by

    w̃opt =γR̃−1ã(θ1)

    ãH(θ1)R̃−1ã(θ1). (3.15)

    The performance of a widely-linear adaptive beamformer is evaluated through the

    WL SINR, which is expressed similarly to (3.7) as

    SĨNR(i) =σ21|w̃H(i)ã(θ1)|2w̃H(i)R̃uw̃(i)

    , (3.16)

    where R̃u is the augmented covariance matrices of the interference plus noise. The

    maximum output SINR in the WL case is given by

    SINRWL-CMVmax =σ21ã

    H(θ1)R̃−1ã(θ1)

    1 − σ21ãH(θ1)R̃−1ã(θ1). (3.17)

    3.4 Widely-linear beamforming based on the AVF algorithm

    In this part, the widely-linear adaptive beamforming algorithm adopting the AVF

    technique, termed WL-AVF algorithm, is developed. It utilizes the auxiliary vectors

    provided by the AVF algorithm to iteratively update the widely-linear weight vector

    w̃. After introducing the general structure of the WL-AVF-based beamforming design,

    the algorithm is derived in detail according to the previous work in [9].

    Structure of the WL-AVF algorithm for beamforming

    The general framework of the proposed beamformer based on the WL-AVF algorithm

    is depicted in Figure 3.2. First, the augmented received vector r̃(i) is obtained by

    applying a bijective transformation T to the received input vector r(i). The augmentedvector is then processed by the filter, whose complex weights are adjusted via the WL-

    FilterDesign

    WL-AVFAlgorithm

    T {·}r(i) y(i)r̃(i)

    wK(i)

    Fig. 3.2: Structure of the WL-AVF algorithm.

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 20

    AVF algorithm to generate the beamformer output y(i). The final weight wK(i) per

    data update is computed after K iterations of the WL-AVF algorithm. In the following

    derivation, we drop the time instant i to increase the readability.

    Proposed WL-AVF algorithm

    The development of the WL-AVF algorithm starts with the initialization of the aug-

    mented weight vector w̃, which is chosen to be equal to the conventional matched filter

    solution for a desired response γ in the direction θ1, given by

    w̃0 = γ∗ ã(θ1)

    ‖ã(θ1)‖2. (3.18)

    Then, the weight vectors are iteratively updated in a stochastic gradient manner by

    subtracting scaled auxiliary vectors g̃ that are constructed to be orthogonal to ã(θ1).

    The successively computed augmented weight vectors are obtained by

    w̃k = w̃k−1 − µkg̃k, k = 1, . . . , K, (3.19)

    where µk is the step size for the kth iteration. The augmented auxiliary vectors g̃k are

    designed to maximize the magnitude of the cross-correlation between g̃Hk r̃ and w̃Hk−1r̃,

    subject to the orthonormality constraint with respect to ã(θ1) [9], which is formulated

    by the optimization problem

    g̃k = argmaxg̃k

    ∣∣∣E{w̃Hk−1r̃[g̃

    Hk r̃]

    H} ∣∣∣

    = argmaxg̃k

    ∣∣∣E{w̃Hk−1R̃g̃k

    } ∣∣∣

    s.t. g̃Hk ã(θ1) = 0, ‖g̃k‖ = 1.

    (3.20)

    According to [9], the solution to this problem is expressed by

    g̃k =

    (I − ã(θ1)ãH(θ1)

    ‖ã(θ1)‖2

    )R̃w̃k−1

    ∥∥∥(I − ã(θ1)ãH(θ1)

    ‖ã(θ1)‖2

    )R̃w̃k−1

    ∥∥∥(3.21)

    Eventually, the value for the step size µk is determined by minimizing the variance at

    the output, i.e.,

    µk = minµk

    E

    {|w̃Hk r̃|2

    }(3.22)

    with the solution

    µk =g̃Hk R̃w̃k−1

    g̃Hk R̃g̃k. (3.23)

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 21

    Tab. 3.1: The proposed WL-AVF algorithm

    Initialization: w̃0 = γ∗ ã(θ1)

    ‖ã(θ1)‖2

    for each snapshot i = 1, . . . , N

    R̃dl(i) = λR̃dl(i − 1) + r̃(i)r̃H(i) + ξI2Mfor iteration k = 1, . . . , K

    g̃k =

    (I2M −

    ã(θ1)ãH (θ1)

    ‖ã(θ1)‖2

    )R̃dlw̃k−1∥∥∥

    (I2M −

    ã(θ1)ãH (θ1)

    ‖ã(θ1)‖2

    )R̃dlw̃k−1

    ∥∥∥If ‖g̃k − g̃k−1‖ → 0, EXIT.µk =

    g̃Hk

    R̃dlw̃k−1

    g̃Hk

    R̃g̃k

    w̃k = w̃k−1 − µkg̃kend

    end

    For the proof of (3.23), the reader is referred to Appendix A.3.

    The proposed beamforming technique based on the WL-AVF algorithm is summa-

    rized in Table 3.1, where the time instant i is included again to emphasize the adaptive

    approach of obtaining the augmented weight vector for each snapshot. As the auxil-

    iary vectors approach zero when k increases, the iterative procedure is terminated if

    ‖g̃k − g̃k−1‖ → 0.

    Applying the developed WL-AVF beamforming algorithm to non-stationary sys-

    tems, the covariance matrix R̃ is estimated by its recursive form using the exponentially

    weighted estimate:

    R̃(i) = λR̃(i− 1) + r̃(i)r̃H(i), (3.24)

    where λ is the forgetting factor, which is close to but less than 1. The first initialization

    of R̃(0) = δI2M , where a common value for δ = σ2n. Furthermore, the diagonal loading

    technique [26] is adopted to achieve more accurate estimates of R̃ for a small number

    of available snapshots, i.e.,

    R̃dl(i) = R̃(i) + ξI2M , (3.25)

    where the diagonal loading factor ξ is typically chosen to be ξ = 10·σ2n according to [27].Other approaches to improve the estimates in short data records include reduced-rank

    algorithms as in [28].

    3.5 Widely-linear beamforming based on the CG algorithm

    This part of the chapter deals with the derivation of the WL-CG algorithm for adaptive

    beamforming in the case of strictly non-circular signals. The WL-CG algorithm forms

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 22

    FilterDesign w̃(i)

    WL-CGAlgorithm

    T {·}r(i) y(i)r̃(i)

    ṽK(i)

    Fig. 3.3: Structure of the WL-CG algorithm.

    orthogonal residual vectors, which are employed to iteratively update the widely-linear

    weight vector w̃(i). Whereas the general structure is described first, the second part is

    concerned with the development of the WL-CG method as an extension of the previous

    work in [8, 23].

    Structure of the WL-CG algorithm for beamforming

    The block diagram of the WL-CG-based beamforming procedure depicted in Figure 3.3

    exhibits a similar structure as for the WL-AVF algorithm. The bijective transformation

    T applied to the original received vector r(i) yields the augmented received vectorr̃(i), which is processed by the filter to provide the output y(i). However, contrary

    to the WL-AVF procedure, the WL-CG method iteratively updates a new defined

    augmented vector ṽ(i) to avoid the computation of R̃−1 that is passed onto the filter

    after performing K iterations of the WL-CG algorithm. Thus, the final beamforming

    weight vector w̃(i) is computed in the filter design. Again, we drop the time instant i

    in the following development.

    The proposed WL-CG algorithm

    In order to derive the WL-CG beamforming algorithm for non-circular sources, the

    WL-CG method is applied to minimize the cost function [29]

    min J(ṽ) = ṽHR̃ṽ − 2Re{ãH(θ1)ṽ

    }, (3.26)

    where ṽ ∈ C2M×1 is a newly defined augmented CG weight vector, which is differentfrom the beamforming weight vector w̃ to be designed. The CG method was originally

    developed to solve the optimization problem in (3.26) [6]. However, by taking the

    gradient of (3.26) with respect to the CG weight vector ṽ and equating it to zero,

    it can be shown that minimizing the optimization problem in (3.26) is equivalent to

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 23

    solving the system of linear equations [23]

    R̃ṽ = ã(θ1) ∈ C2M×1. (3.27)

    The WL-CG algorithm solves (3.27) by approaching the optimal weight solution step

    by step via a line search along successive directions, which are sequentially determined

    at each iteration [6]. The solution for the WL-CG optimization problem (3.26) is

    obtained as

    ṽ = R̃−1ã(θ1). (3.28)

    As the expression R̃−1ã(θ1) also occurs in the optimal WL-CMV solution in (3.15),

    the WL-CMV beamformer can thus be iteratively computed without the calculation

    of R̃−1 by updating the new augmented CG weight vector ṽ as

    ṽk = ṽk−1 + αkp̃k, k = 1, . . . , K, (3.29)

    where p̃k is the direction vector, αk is the step size value to control its weight and k

    is the iteration number. The directions p̃k are orthogonal with respect to R̃, termed

    R̃-orthogonality, i.e.,

    p̃Hk R̃p̃l = 0 (3.30)

    for all k 6= l. The following direction was determined in the previous iteration and isobtained by

    p̃k+1 = g̃k + βkp̃k, (3.31)

    where g̃k is the residual vector defined in [6] that is given by

    g̃k = ã(θ1) − R̃ṽk. (3.32)

    The step size αk is chosen to minimize the cost function in the determined direction

    and βk ensures the R̃-orthogonality of the direction vectors p̃k in (3.30). According to

    [6], both parameters are calculated by

    αk =g̃Hk−1p̃k

    p̃Hk R̃p̃kand βk =

    g̃Hk g̃k

    g̃Hk−1g̃k−1. (3.33)

    After K iterations of the WL-CG algorithm the augmented complex weight vector w̃

    is formed as

    w̃ =γṽK

    ãH(θ1)ṽK. (3.34)

    A summary of the proposed WL-CG-based beamforming technique is given in Table

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 24

    Tab. 3.2: The proposed WL-CG algorithm

    Initialization: ṽ0(1) = 0, R̃dl(0) = δI2Mfor each snapshot i = 1, . . . , N

    R̃dl(i) = λR̃dl(i − 1) + x̃(i)x̃H(i) + ξI2M ,p̃1(i) = g̃0(i) = ã(θ1) − R̃dl(i)ṽ0(i)for iteration k = 1, . . . , K

    d̃k(i) = R̃dl(i)p̃k(i)

    αk(i) = [p̃Hk (i)d̃k(i)]

    −1g̃Hk−1(i)p̃k(i)ṽk(i) = ṽk−1(i) + αk(i)p̃k(i)

    g̃k(i) = g̃k−1(i) − αk(i)d̃k(i)βk(i) = [g̃

    Hk−1(i)g̃k−1(i)]

    −1g̃Hk (i)g̃k(i)p̃k+1(i) = g̃k(i) + βk(i)p̃k(i)

    end

    ṽ0(i + 1) = ṽK(i)w̃(i) = [ãH(θ1)ṽK(i)]

    −1γṽK(i)end

    3.2, where again the included time instant i indicates the adaptive way to obtain the

    weight solution for each data snapshot.

    As already outlined in the previous section, the covariance matrix R̃ is estimated by

    its recursive form using the exponentially weighted estimate and incorporating diagonal

    loading:

    R̃dl(i) = λR̃dl(i− 1) + r̃(i)r̃H(i) + ξI2M , (3.35)

    where the diagonal loading factor ξ is again chosen to be ξ = 10 · σ2n [27].

    3.6 Widely-linear beamforming based on the MCG algorithm

    In this section, a modified version of the WL-CG method, termed WL-MCG algo-

    rithm, is developed for widely-linear adaptive beamforming. Similarly to the WL-CG

    algorithm, it applies residual vectors to update the beamforming weight vector in an

    iterative fashion. However, it only requires one iteration of the CG algorithm per sam-

    ple update. Again, the first part deals with the structure of the MCG method and in

    the second part, the detailed algorithm is derived according to [8].

    Structure of the WL-MCG algorithm for beamforming

    The general structure of the WL-MCG algorithm for adaptive beamforming is shown as

    a block diagram in Figure 3.4. The augmented received vector r̃(i) obtained after the

    bijective transformation is processed by the WL-MCG algorithm to provide the weight

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 25

    FilterDesign

    WL-MCGAlgorithm

    T {·}r(i) y(i)r̃(i)

    w̃(i)

    Fig. 3.4: Structure of the WL-MCG algorithm.

    vector w̃(i) for the filter. The key advantage is the property that, unlike the WL-CG

    developed in the previous section, the MCG method requires only one iteration per

    data snapshot, saving computational resources. Furthermore it also adopts the new

    defined augmented vector ṽ(i) to avoid the costly computation of R̃−1.

    The proposed WL-MCG algorithm

    Corresponding to the WL-CG beamforming method, the WL-MCG algorithm is an-

    other way of solving the system of equations stated in (3.27) in the previous section.

    The covariance matrix is estimated by using the diagonal-loading enhanced exponen-

    tially weighted data window

    ˆ̃Rdl(i) = λ

    ˆ̃Rdl(i− 1) + r̃(i)r̃H(i) + ξI2M , (3.36)

    where λ is the forgetting factor and ξ the diagonal loading factor. According to [1],

    the estimated covariance matrix can be approximated for a large sample size as

    ˆ̃Rdl(i) ≈

    1

    1 − λ(R̃ + ξI2M). (3.37)

    Now, we define the new augmented weight vector ṽ(i) to avoid the computationally

    expensive computation of ˆ̃R−1

    dl (i) at each snapshot as

    ṽ(i) = ˆ̃R−1

    dl (i)ã(θ1). (3.38)

    Inserting equation (3.37) into (3.38), we obtain

    ṽ(i) =(

    1

    1 − λ(R̃ + ξI2M))−1

    ã(θ1) (3.39)

    for a sufficiently large number of snapshots.

    Similarly to the WL-CG algorithm, the WL-MCG method iteratively computes the

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 26

    solution of (3.38) by updating the new defined augmented vector ṽ(i) as

    ṽ(i) = ṽ(i− 1) + α(i)p̃(i), (3.40)

    where the direction vector p̃(i) was determined at the previous snapshot by

    p̃(i+ 1) = g̃(i) + β(i)p̃(i) (3.41)

    and the residual vector g̃(i) is given by

    g̃(i) = ã(θ1) − ˆ̃R(i)ṽ(i). (3.42)

    In order to realize sample-by-sample processing, a recursive expression for the residual

    vector g̃(i) can be derived by taking into account the equations (3.42), (3.36), and

    (3.40), leading to

    g̃(i) = (1 − λ)ã(θ1) + λg̃(i− 1) −(r̃(i)r̃H(i) + ξI2M

    )ṽ(i− 1) − α(i) ˆ̃R(i)p̃(i). (3.43)

    The proof of (3.43) is shown in Appendix A.4.

    The basic idea of the WL-MCG is to employ a degenerated scheme [8], meaning

    that the obtained residual vector g̃(i) is not required to be orthogonal to the subspace

    spanned by the previously determined direction vectors S = span{p̃(1), p̃(2), . . . , p̃(i)},i.e., p̃H(i)g̃(l) 6= 0 for l < i. Under this condition the step size α(i) has to satisfy theconvergence bound defined according to [8, 23] as

    0 ≤ E{p̃H(i)g̃(i)

    }≤ 0.5 E

    {p̃H(i)g̃(i− 1)

    }. (3.44)

    To this end, premultiplying equation (3.43) by p̃H(i), taking the expectation of both

    sides and considering (3.39) yields

    E

    {p̃H(i)g̃(i)

    }≈ λE

    {p̃H(i)g̃(i− 1)

    }− E {α(i)}E

    {p̃H(i) ˆ̃R(i)p̃(i)

    }, (3.45)

    where the convergence of the adaptive algorithm is assumed, such that the approxima-

    tion in (3.39) holds, and therefore

    (1 − λ)ã(θ1) −[E

    {r̃(i)r̃H(i)

    }+ ξI2M

    ]ṽ(i− 1) = 0. (3.46)

    Inserting the obtained expression (3.45) into the convergence bound in (3.44) and

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 27

    Tab. 3.3: The proposed WL-MCG algorithm

    Initialization: ṽ(0) = 0, R̃dl(0) = δI2M , p̃(1) = g̃(0) = ã(θ1)for each snapshot i = 1, . . . , N

    R̃dl(i) = λR̃dl(i − 1) + x̃(i)x̃H(i) + ξI2M ,d̃(i) = R̃dl(i)p̃(i)

    α(i) = [p̃H(i)d̃(i)]−1(λ − η)p̃H(i)g̃(i − 1) (0 ≤ η ≤ 0.5)ṽ(i) = ṽ(i − 1) + α(i)p̃(i)g̃(i) = (1 − λ)ã(θ1) + λg̃(i − 1) − [r̃(i)r̃H(i) + ξI2M ]ṽ(i − 1) − α(i)d̃(i)β(i) = [g̃H(i − 1)g̃(i − 1)]−1[g̃(i) − g̃(i − 1)]H g̃(i)p̃(i + 1) = g̃(i) + β(i)p̃(i)w̃(i) = [ãH(θ1)ṽ(i)]

    −1γṽ(i)end

    rearranging the terms, we have that

    (λ− 0.5)E

    {p̃H(i)g̃(i− 1)

    }

    E

    {p̃H(i) ˆ̃R(i)p̃(i)

    } ≤ E {α(i)} ≤ λE

    {p̃H(i)g̃(i− 1)

    }

    E

    {p̃H(i) ˆ̃R(i)p̃(i)

    } . (3.47)

    From (3.47), the inequality is satisfied if α(i) is computed by

    α(i) = (λ− η) p̃H(i)g̃(i− 1)

    p̃H(i) ˆ̃R(i)p̃(i), (3.48)

    where 0 ≤ η ≤ 0.5 to fulfill the convergence criterion. The step size β(i) is computedaccording to the Polak-Ribiere approach [30] to avoid the reset procedure [8] and is

    stated as

    β(i) =[g̃(i) − g̃(i− 1)]H g̃(i)

    g̃(i− 1)H g̃(i− 1) . (3.49)

    The proposed WL-MCG algorithm for widely-linear adaptive beamforming is sum-

    marized in Table 3.3. It only performs one iteration of the CG algorithm per snapshot,

    saving significant computational efforts. The diagonal loading factor ξ in the algorithm

    is again chosen to be ξ = 10 · σ2n [27].

    3.7 Computational complexity

    The computational effort of the WL processing is one of the most important aspects

    to take into account when developing novel WL adaptive beamforming algorithms.

    To this end, we evaluate the computational complexity of the proposed WL Krylov-

    based beamforming algorithms and compare them to their linear counterparts. For

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 28

    comparison purposes, we first focus on the complexity of the linear algorithms, and

    then consider the complexity of the WL algorithms.

    3.7.1 Complexity of LCMV algorithms

    The computational cost, shown in Table 3.4, is measured in terms of the arithmetical

    operations, whereM is the number of sensor elements andK is the number of iterations.

    The expressions in Table 3.4 are obtained from the work in [23]. It is evident that the

    complexity of the AVF and the CG algorithm depends on the iteration number K.

    Figure 3.5 illustrates the computational cost as a function of the number of sensors M .

    The iteration numbers for the AVF and the CG method are chosen to be K = 4 and

    K = 5 respectively, as these numbers are used in the following performance evaluation.

    Comparing the cost of the linear Krylov-based algorithms to the RLS method, the AVF

    technique clearly exceeds the computational requirements of the RLS method, whereas

    the CG algorithm demands a slightly higher cost. The aforementioned trade-off of

    the CG method only becomes apparent for a smaller number of iterations. However,

    the MCG algorithm provides a lower computational burden than the RLS technique,

    especially for a large number of sensors M , but still requires a higher cost than the

    LMS algorithm.

    10 20 30 40 5010

    0

    101

    102

    103

    104

    105

    Num

    ber

    of additio

    ns

    Number of sensors M10 20 30 40 50

    100

    101

    102

    103

    104

    105

    Num

    ber

    of m

    ultip

    lications

    Number of sensors M

    LMS

    WL−LMS

    RLS

    WL−RLS

    AVF, K = 4

    WL−AVF, K = 2

    CCG, K = 5

    WL−CCG, K = 3

    MCG

    WL−MCG

    Fig. 3.5: Complexity in terms of arithmetic operations versus the sensors M .

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 29

    Tab. 3.4: Computational complexity of LCMV beamforming algorithms

    Algorithm Additions Multiplications

    LMS [5] 4M − 2 4M + 3RLS [5] 4M2 − M − 1 5M2 + 5M − 1AVF [9] K(4M2 + M − 2) + 5M2 − M − 1 K(5M2 + 3M) + 8M2 + 2MCG [8] K(M2 + 4M − 2) + 2M2 − 1 K(M2 + 4M + 1) + 3M2 + 3MCG [8] 2M2 + 7M − 3 3M2 + 9M + 4

    Tab. 3.5: Computational complexity of WL-CMV beamforming algorithms

    Algorithm Additions Multiplications

    WL-LMS [21] 8M − 2 8M + 3WL-RLS [22] 16M2 − 2M − 1 20M2 + 10M − 1WL-AVF K(16M2 + 2M − 2) + 20M2 − 2 ∗ M − 1 K(20M2 + 6M) + 32M2 + 4MWL-CG K(4M2 + 8M − 2) + 8M2 − 1 K(4M2 + 8M + 1) + 12M2 + 3WL-MCG 8M2 + 14M − 3 12M2 + 18M + 4

    3.7.2 Complexity of WL-CMV algorithms

    The computational complexity for the WL beamforming algorithms is depicted in Table

    3.5. Due to the WL processing, the analyzed algorithms developed for non-circular

    sources consider the additional information contained in the augmented steering vector,

    which can be interpreted as an array composed of twice as many sensor elements M .

    Thus, the computational cost of all the WL beamforming methods increases by nearly

    the same amount, complying with the structure of the augmented covariance matrix

    R̃. As a result, the same observations for the linear methods also apply to the WL

    case. According to Figure 3.5, where the number of iterations for the WL-AVF and

    the WL-CG techniques is now K = 2 and K = 3 respectively, the gap between the

    cost of the WL-RLS and the WL-CG is substantially decreased. The complexity of

    some of the considered algorithms might be further reduced by using symmetries and

    individual properties. However, these complexity reductions are beyond the scope of

    this work and are left for future investigations.

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 30

    3.8 Simulation results

    In this section, the output SINR performance of the proposed WL Krylov-based al-

    gorithms in stationary and non-stationary scenarios is evaluated and compared to the

    WL-RLS, the WL-LMS algorithm, and their linear counterparts. For the simulations,

    we employ a ULA consisting of M = 24 sensor elements whose interelement spacing

    equals half a signal wavelength. Among d sources in the system, the DOA of the desired

    user with signal power σ21 = 1 is θ1 = 0◦, which is known beforehand by the receiver. All

    the signals are assumed to be binary phase-shift keying (BPSK)-modulated, and γ = 1

    is set to satisfy the condition for the MVDR leading to a maximum achievable SINR

    performance. The diagonal loading factor is ξ = 10 · σ2n and each curve is obtained byaveraging a total of 1000 runs. Notice that in the simulations the number of iterations

    K for the WL-AVF and the WL-CG algorithm is fixed for all the realizations. The first

    part of the computer simulations deals with the performance in stationary scenarios

    and the second part is concerned with the performance assessment in non-stationary

    scenarios.

    3.8.1 Performance in stationary scenarios

    In the first experiment, we examine the SINR performance of the proposed WL adaptive

    beamforming algorithms in a stationary system. Assuming that there are d = 15 users

    in the scenario, the d − 1 = 14 interfering signals impinge on the sensor array fromthe directions (±10◦ · [1, . . . , (d − 1)/2]) with an input SNR = 10 dB and a signal-to-interference ratio (SIR) of −20 dB. Due to the stationarity of the system, the forgettingfactor is set to be λ = 1.

    Figure 3.6 shows the output SINR performance as a function of the number of

    snapshots N . The number of iterations K for the AVF algorithm and for the WL-

    AVF algorithm were found to be K = 4 and K = 2 and the iteration number for the

    CG algorithm and the WL-CG algorithm is K = 3 and K = 5, respectively. The

    amount of iterations was optimized to yield the best output performance in terms

    of the convergence speed. The lower number of iterations for the WL version of the

    AVF and the CG algorithm can be explained by the fact that the augmented weight

    vector provides additional information and reaches the optimum solution with fewer

    iterations. It is apparent from Figure 3.6 that all the WL beamforming algorithms

    provide substantial improvements over the linear techniques in general. Specifically,

    the proposed WL-AVF algorithm outperforms the adaptive WL-LMS and the WL-RLS

    in terms of both the convergence and the steady state. The WL-CG method and the

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 31

    0 200 400 600 800 1000

    −2

    0

    2

    4

    6

    8

    10

    12

    14

    Outp

    ut S

    INR

    (dB

    )

    Number of snapshots N

    LMS

    WL−LMS

    RLS

    WL−RLS

    AVF, K = 4

    WL AVF, K = 2

    CG, K = 5

    WL−CG, K = 3

    MCG

    WL−MCG

    SINRmax

    LCMV

    SINRmax

    WL−CMV

    Fig. 3.6: Output SINR versus the number of snapshots N in a stationary scenario withM = 24, d = 15, SNR = 10 dB, SIR = −20 dB, λ = 1.

    0 5 10 15 200

    5

    10

    15

    20

    25

    Outp

    ut S

    INR

    (dB

    )

    SNR (dB)

    LMS

    WL−LMS

    RLS

    WL−RLS

    AVF, K = 4

    WL AVF, K = 2

    CG, K = 5

    WL−CG, K = 3

    MCG

    WL−MCG

    SINRmax

    LCMV

    SINRmax

    WL−CMV

    Fig. 3.7: Output SINR versus the input SNR in a stationary scenario with M = 24,d = 15, N = 1000, SIR = −20 dB, λ = 1.

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 32

    WL-MCG algorithm perform close to the level of the WL-RLS algorithm and outclass

    the WL-LMS technique. However, as discussed in the previous section, the WL-MCG

    algorithm significantly reduces the complexity of the WL-CG method. As seen for the

    SINRWL-CMVmax compared to the SINRLCMVmax , by exploiting the additional information in

    the presence of non-circular sources, a higher gain can be achieved. This is due to the

    WL processing, where the augmented steering vector can be interpreted as a virtual

    array with twice as many sensors.

    The SINR performance as a function of the SNR is illustrated in Figure 3.7, where

    the number of snapshots is fixed to N = 1000. As expected, the SINR increases with

    the SNR, and for the SINRWL-CMVmax the additional gain is expressed by shifting the curve

    for the SINRLCMVmax to a higher SINR. The proposed WL algorithms performs similarly

    to the results in Figure 3.6. The WL-AVF method outperforms the WL-LMS and the

    WL-RLS algorithm, whereas the WL-CG and the WL-MCG techniques almost reach

    the level of the WL-RLS algorithm.

    3.8.2 Performance in non-stationary scenarios

    In the second experiment, the SINR performance in a non-stationary scenario is an-

    alyzed and compared to the WL-LMS and the WL-RLS. The system starts with the

    same scenario as in Figure 3.6, but with different power levels of the interfering signals,

    0 500 1000 1500 2000−2

    0

    2

    4

    6

    8

    10

    12

    14

    Ou

    tpu

    t S

    INR

    (d

    B)

    Number of snapshots N

    LMS

    WL−LMS

    RLS

    WL−RLS

    AVF, K = 4

    WL AVF, K = 2

    CG, K = 5

    WL−CG, K = 3

    MCG

    WL−MCG

    Fig. 3.8: Output SINR versus the number of snapshots N in a non-stationary scenariowith M = 24, d1 = 15, d2 = 12, SNR = 10 dB, λ = 0.998.

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 33

    namely 8 interferers with a power of 20 dB, 4 with 10 dB and 2 with 5 dB above the

    level of the SOI. At the 1000th snapshot, 4 users with a signal power of 20 dB and

    one user with 10 dB above the SOI leave the system, and 2 users with a power of

    20 dB and 5 dB above the SOI enter the system. As can be seen in Figure 3.8, the

    proposed WL-CG and the WL-MCG quickly adapt to the changed system and exhibit

    the convergence speed and the steady-state level of the WL-RLS method. Thus, the

    WL-CG and the WL-MCG approaches provide a better performance after a change in

    the environment. Nevertheless, the same conclusion can not be drawn for the proposed

    WL-AVF algorithm. After the changing event, it barely exceeds the convergence speed

    of the WL-LMS method and only attains the level of the other algorithms after requir-

    ing a significant number of data samples. This behavior is due to the relatively low

    number of iterations of the AVF algorithm per data update. However, a larger iteration

    number would decrease the performance in a stationary system. In consequence of the

    fixed number of iterations K for the experiments, it is evident that the performance

    can be improved by applying an automatic scheme to adaptively adjust K.

    3.9 Conclusions

    In this chapter, an extension of the Krylov subspace-based algorithms employing the

    AVF, the CG and the MCG methods for adaptive beamforming to the WL process-

    ing is developed, resulting in the WL-AVF, the WL-CG and the WL-MCG algorithm.

    After exploring the performance advantages associated with the WL processing in the

    face of implying a loss of the computational efficiency, the WL-CMV optimization

    criterion is introduced to update the augmented beamforming weight vector. Then

    by assuming strictly non-circular signals in the system, the general framework of the

    three proposed adaptive algorithms is discussed and their detailed derivation presented.

    The developed algorithms fully exploit the second-order statistics of the strictly non-

    circular signals and avoid the computation of the inverse of the augmented covariance

    matrix. Whereas the WL-AVF algorithm applies augmented auxiliary vectors within

    a number of iterations to adapt the augmented weight vector at each data sample, the

    WL-CG resorts to augmented residual vectors. The key benefit of the WL-MCG is

    the fact that it only performs one iteration per data snapshot. To assess the compu-

    tational requirements, an analysis of the complexity is conducted, which demonstrates

    that the same computational behavior in the linear case also holds for the WL pro-

    cessing. Specifically, the WL-AVF algorithm requires the highest complexity and the

    WL-MCG the lowest cost. Furthermore, the SINR performance of these algorithms is

    evaluated by computer simulations for stationary and non-stationary scenarios, where

    Masterarbeit Jens Steinwandt

  • 3. Widely-linear adaptive beamforming using Krylov-based algorithms 34

    the number of iterations for the WL-AVF and the WL-CG method is optimized. It is

    illustrated that the WL processing provides significant performance improvements over

    the linear processing. In stationary scenarios, the WL-AVF algorithm outperforms the

    WL-RLS method, and the WL-CG and the WL-MCG algorithm perform close to the

    WL-RLS technique. Considering non-stationary scenarios, the WL-CG and the WL-

    MCG method attain the performance of the WL-RLS approach after a change in the

    system; however, the WL-AVF achieves a slower adaptation due to the lower number of

    iterations. This motivates an automatic scheme for adaptively switching the iteration

    number.

    Masterarbeit Jens Steinwandt