Post on 15-Apr-2018
Astrophysical Applications of Gravitational Microlensing
Dissertation
Presented in Partial Fulfillment of the Requirements for the Degree Doctor ofPhilosophy in the Graduate School of The Ohio State University
By
Subo Dong, M.S.
Graduate Program in Astronomy
The Ohio State University
2009
Dissertation Committee:
Professor Andrew Philip Gould, Advisor
Professor Bernard Scott Gaudi
Professor Krzysztof Zbigniew Stanek
ABSTRACT
In this thesis, I present several astrophysical applications of Galactic and
cosmological microlensing.
The first few topics are on searching and characterizing extrasolar planets
by means of high-magnification microlensing events. The detection efficiency
analysis of the Amax ∼ 3000 event OGLE-2004-BLG-343 is presented. Due to
human error, intensive monitoring did not begin until 43 minutes after peak, at
which point the magnification had fallen to A ∼ 1200. It is shown that, had a
similar event been well sampled over the peak, it would have been sensitive to
almost all Neptune-mass planets over a factor of 5 in projected separation and even
would have had some sensitivity to Earth-mass planets. New algorithms optimized
for fast evaluation of binary-lens models with finite-sources effects have been
developed. These algorithms have enabled efficient and thorough parameter-space
searches in modeling planetary high-magnification events. The detection of the
cool, Jovian-mass planet MOA-2007-BLG-400Lb, discovered from an Amax = 628
event with severe finite-source effects, is reported. Detailed analysis yields a
fairly precise planet/star mass ratio of q = (2.5+0.5−0.3) × 10−3, while the planet/star
ii
projected separation is subject to a strong close/wide degeneracy. Photometric
and astrometric measurements from Hubble Space Telescope, as well as constraints
from higher order effects extracted from the ground-based light curve (microlens
parallax, planetary orbital motion and finite-source effects) are used to constrain the
nature of planetary event OGLE-2005-BLG-071Lb. Our primary analysis leads to
the conclusion that the host is an M = 0.46 ± 0.04 M⊙ M dwarf and that the planet
has mass Mp = 3.8 ± 0.4 MJupiter, which is likely to be the most massive planet yet
discovered that is hosted by an M dwarf.
Next a spaced-based microlens parallax is determined for the first time using
Spitzer and ground-based observations for binary-lens event OGLE-2005-SMC-001.
The parallax measurement yields a projected velocity v ∼ 230 km s−1, the typical
value expected for halo lenses, but an order of magnitude smaller than would be
expected for lenses lying in the Small Magellanic Cloud (SMC) itself.
Finally, I propose using quasar microlensing to probe Mg II and other
absorption “cloudlets” with sizes ∼ 10−4.0 − 10−2.0pc in the intergalactic medium.
I show that significant spectral variability on timescales of months to years can be
induced by such small-scale absorption “cloudlets” toward a microlensed quasar.
With numerical simulations, I demonstrate the feasibility of applying this method
to Q2237+0305, and I show that high-resolution spectra of this quasar in the near
future would provide a clear test of the existence of such metal-line absorbing
“cloudlets”.
iii
ACKNOWLEDGMENTS
In modern times, most of the necessities of people’s daily lives rely upon others,
therefore, no matter how highly the lofty ideal of individual freedom is acclaimed,
most people have to choose their jobs according to the needs of the society rather
than following their inner voices (i.e., what they are interested in the most). I
consider myself very lucky to have five years’ opportunity to pursue my childhood
dream in becoming an astronomer with no financial burden. I am thankful to
everyone who has made this possible.
I thank Andy Gould for being an unparalleled advisor. He has taken immense
care in guiding me along my path to becoming a scientist. His influence and help
have permeated every aspect of my scientific activities as a graduate student.
Whenever I need discussions or advice (or whenever he perceives me as needing
them), he is always ready to offer them in most timely fashion. His excitement by
new findings, creativity in novel ideas, and self-driven “Gung-ho” working spirit
exhibit no trace of his age. It is difficult to describe how exhilarating an experience
it is to work with him.
v
I thank Andy for being a truthful person – for being truthful to himself, to
others, and to nature. I was initially intimidated by the way he spoke in morning
coffee and colloquia. But I have come to realize that, if one is offended by sincere
efforts to pursue the truth, or not offended by distortion or fabrication of facts and
careless or superficial analyses, he/she is not a real scientist. I thank Andy for
teaching me to be honest to data, to listen to what nature has to say rather than
projecting one’s prejudices onto nature.
I came to the United States certainly at an interesting time, when “the best and
brightest” in the White House and on the Wall Street have drastically transformed
the world’s social and economic orders. As a “Stranger in a Strange Land”, I
appreciate Andy for many illuminating discussions that shed light on the perplexing
and intriguing events in the U.S. and on the world stage. He observes the society at a
vantage point beyond his ethnical, national, and cultural background. I never have to
worry about offending him by expressing my honest views. I thank Andy for rallying
a group of people (mostly astronomers) to have discussions on real-world events in
a similar fashion as we discuss science. We attempt to distinguish between real and
bogus information from various sources and analyze the events by confronting ideas
against distilled evidence and data. Although I do not always agree with Andy, it
has always been such a great time talking to him on almost anything.
Next I thank my thesis committee members, Scott Gaudi and Kris Stanek.
vi
I first met Scott back when he was a postdoc at CfA. He has been providing
unreserved encouragement and support to me since then. I have benefited greatly
from his advice on science and professional development. I thank him for having
led me to see the bright side when I was in depression or frustration. I admire
his comprehensive knowledge in almost all branches of astronomy and his often
amazingly deep physical intuition.
I thank Kris for being friendly and humorous and willing to talk to me on any
topic. I thank him for inspiring discussions that led to my first original workable
science idea. I appreciate his insistence that scientific pictures should fundamentally
be simple. He always has great insights into complexity and sees through irrelevant
artificial obscuration. I hope I will be able to acquire even a fraction of his ability in
seizing the moments by recognizing fleeting opportunities in astronomy.
I am grateful to Rick Pogge, Chris Kochanek and many others in the department
for providing a lot of great technical help and having interesting science discussions
together. And I greatly appreciate many faculty in the department for creating an
intellectually stimulating environment which is also very student-friendly. Daily
coffee has become an indispensable part of my life. The department staff are
always helpful, and I especially thank David Will for solving many of my computer
problems.
vii
I have spent a great time with many graduate students and several postdocs at
OSU. I first thank Zheng for having provided me many great advice when I first got
to OSU, and for him being a role-model student and astronomer. He shows me how
high someone with a similar background as me can possibly achieve in astronomy. I
thank Jose, Vimal, Dale, Himel and others for having a lot of interesting discussions
and experience together. And I appreciate having many great office mates, Frank,
Molly, Ondrej, Kelly, Rob, Deokkeun, etc., during the last five years. Szymon and
Xinyu have been great friends and colleagues.
I could not possibly have finished my PhD work without many colleagues
outside Ohio State. In particular, I thank Andrzej Udalski for being an exemplary
astronomer. I never fail to be thrilled by the first-rate data he and the OGLE team
have produced. A lot of crucial data I have analyzed come from many “amateur”
astronomers: Jennie McCormick, Grant Christie and Berto Monard, to name a few.
They are amateurs only in the sense that they are not paid for their observations,
otherwise what they do are extraordinarily professional. Their fascination in the sky,
enthusiasm and dedication have been a constant source of inspiration to me. I thank
all my observer colleagues for offering me the privilege in being the first to see many
beautiful light curves and realize the exotic extrasolar worlds that they reveal. The
precious moments of discovery transcend anything else.
viii
I wish to express my deep gratitude to Profs. Dawei Yang, Tianyi Huang,
Qiusheng Gu and Dr. Jin Zhu for their selfless help and great guidance at various
important stages before I entered the graduate school.
At last, I thank my mother and father, who put my education, both in acquiring
knowledge and in being a decent member of the society, as the first priority in
their life. Without their unequivocal support and sacrifice, I would have never been
anywhere near the position I am at today to write this dissertation.
ix
VITA
March 18, 1982 . . . . . . . . . . . . . . . . Born – Chengde, Hebei, China
2004 . . . . . . . . . . . . . . . . . . . . . . . . . . . B.S. Astronomy, Nanjing University, China
2004 – 2005 . . . . . . . . . . . . . . . . . . . . University Fellow, The Ohio State University
2005 – 2006 . . . . . . . . . . . . . . . . . . . . Graduate Research Assistant,
The Ohio State University
2006 . . . . . . . . . . . . . . . . . . . . . . . . . . . M.S. Astronomy, The Ohio State University
2006 – 2008 . . . . . . . . . . . . . . . . . . . . Graduate Research and Teaching Assistant,
The Ohio State University
2008 – 2009 . . . . . . . . . . . . . . . . . . . . . Presidential Fellow, The Ohio State University
PUBLICATIONS
Research Publications
1. A. Udalski, et al. “A Jovian-Mass Planet in Microlensing Event OGLE-2005-BLG-071”, ApJL, 628, 109L, (2005).
2. Subo Dong, et al. “Planetary Detection Efficiency of the Magnification3000 Microlensing Event OGLE-2004-BLG-343”, ApJ, 642, 842, (2006).
3. A. Gould, A. Udalski, D. An, D. P. Bennett, A.-Y. Zhou, S. Dong, et al.“Microlens OGLE-2005-BLG-169 Implies That Cool Neptune-like Planets AreCommon”, ApJL, 644, L37, (2006).
x
4. Subo Dong, “Probing ∼100 AU Intergalactic Mg II Absorbing ‘Cloudlets’with Quasar Microlensing”, ApJ, 660, 206, (2007).
5. Subo Dong, et al. “First Space-Based Microlens Parallax Measurement:Spitzer Observations of OGLE-2005-SMC-001”, ApJ, 664, 862, (2007).
6. B.S. Gaudi, D. P. Bennett, A. Udalski, A. Gould, G. W. Christie, D.Maoz, S. Dong, et al. “Discovery of a Jupiter/Saturn Analog with GravitationalMicrolensing”, Science, 319, 927, (2008).
7. B. Scott Gaudi, Joseph Patterson, David S. Spiegel, Thomas Krajci, R.Koff, G. Pojmanski, Subo Dong, et al. “Discovery of a Very Bright, NearbyGravitational Microlensing Event”, ApJ, 677, 1268, (2008).
8. Subo Dong, et al. “OGLE-2005-BLG-071Lb, the Most Massive M DwarfPlanetary Companion?”, ApJ, 695, 970, (2009).
9. Subo Dong, et al. “Microlensing Event MOA-2007-BLG-400: Exhumingthe Buried Signature of a Cool, Jovian-Mass Planet”, ApJ, 698, 1826, (2009).
10. A.Gould, A.Udalski, B.Monard, K.Horne, Subo Dong, et al. “The ExtremeMicrolensing Event OGLE-2007-BLG-224: Terrestrial Parallax Observation of aThick-Disk Brown Dwarf”, ApJL, 698, L147, (2009).
FIELDS OF STUDY
Major Field: Astronomy
xi
Table of Contents
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
Dedication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iv
Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
Vita . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . x
List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xvii
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xix
Chapter 1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Chapter 2 Planetary Detection Efficiency of the Magnification 3000Microlensing Event OGLE-2004-BLG-343 . . . . . . . . . . . . . . . 12
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.1.1 High-Magnification Events & Earth-Mass Planets . . . . . . . 13
2.1.2 Planet Detection Efficiencies: Philosophy and Methods . . . . 17
2.1.3 ”Seeing” the Lens in High-Magnification Events . . . . . . . . 22
2.2 Observational Data . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.3 Event Modeling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.3.1 Point-Source Point-Lens Model . . . . . . . . . . . . . . . . . 28
2.3.2 Source Properties from Color-Magnitude Diagram . . . . . . . 29
2.3.3 Finite-Source Effects . . . . . . . . . . . . . . . . . . . . . . . 35
2.3.4 Monte-Carlo Simulation . . . . . . . . . . . . . . . . . . . . . 36
xii
2.4 Detecting Planets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
2.4.1 Detection Efficiency . . . . . . . . . . . . . . . . . . . . . . . . 41
2.4.2 Constraints on Planets . . . . . . . . . . . . . . . . . . . . . . 46
2.4.3 No Planet Detected . . . . . . . . . . . . . . . . . . . . . . . 47
2.4.4 Fake Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
2.4.5 Detection Efficiency in Physical Parameter Space . . . . . . . 49
2.5 Luminous Lens? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
2.6 Summary and Discussion . . . . . . . . . . . . . . . . . . . . . . . . 55
Chapter 3 Microlensing Event MOA-2007-BLG-400: Exhuming theBuried Signature of a Cool, Jovian-Mass Planet . . . . . . . . . . . 72
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
3.2 Observations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
3.3 Microlens Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
3.3.1 Hybrid Pixel/Ray Map Algorithm . . . . . . . . . . . . . . . 86
3.3.2 The (w, q) Grid of Lens Geometries . . . . . . . . . . . . . . 88
3.3.3 Best-Fit Model . . . . . . . . . . . . . . . . . . . . . . . . . . 88
3.4 Finite-Source Effects . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
3.5 Limb Darkening . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
3.6 Blended Light . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
3.7 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
Chapter 4 OGLE-2005-BLG-071Lb, the Most Massive M-DwarfPlanetary Companion? . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
4.2 Overview of Data and Types of Constraints . . . . . . . . . . . . . . 115
xiii
4.3 Constraining the Physical Properties of the Lens and its PlanetaryCompanion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
4.3.1 Microlens Parallax Effects . . . . . . . . . . . . . . . . . . . . 119
4.3.2 Fitting Planetary Orbital Motion . . . . . . . . . . . . . . . . 121
4.3.3 Finite-source effects and Other Constraints on θE . . . . . . . 122
4.3.4 HST Astrometry . . . . . . . . . . . . . . . . . . . . . . . . . 129
4.3.5 “Seeing” the Blend with HST . . . . . . . . . . . . . . . . . . 133
4.3.6 Final Physical Constraints on the Lens and Planet . . . . . . 139
4.3.7 Constraints on a Non-Luminous Lens . . . . . . . . . . . . . . 146
4.3.8 Xallarap Effects and Binary Source . . . . . . . . . . . . . . . 151
4.4 Summary and Future Prospects . . . . . . . . . . . . . . . . . . . . . 154
4.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
Chapter 5 First Space-Based Microlens Parallax Measurement: SpitzerObservations of OGLE-2005-SMC-001 . . . . . . . . . . . . . . . . . 181
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
5.2 Observations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
5.2.1 Error Rescaling . . . . . . . . . . . . . . . . . . . . . . . . . . 189
5.2.2 Spitzer Data Reduction and Error Determination . . . . . . . 190
5.3 Complications Alter Strategy and Analysis . . . . . . . . . . . . . . 192
5.3.1 Need for Additional Spitzer Observation . . . . . . . . . . . . 193
5.3.2 Eight-fold Way . . . . . . . . . . . . . . . . . . . . . . . . . . 196
5.4 Binary Orbital Motion . . . . . . . . . . . . . . . . . . . . . . . . . . 197
5.4.1 Static Binary Lens Parameters . . . . . . . . . . . . . . . . . 199
5.4.2 Binary Orbital Parameters . . . . . . . . . . . . . . . . . . . 201
5.4.3 Summary of Parameters . . . . . . . . . . . . . . . . . . . . . 203
xiv
5.5 Search for Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . 204
5.5.1 Convergence . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
5.6 Solution Triage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
5.6.1 Wide-Binary Solutions . . . . . . . . . . . . . . . . . . . . . . 210
5.6.2 Close-Binary Solutions . . . . . . . . . . . . . . . . . . . . . . 211
5.7 Lens Location . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
5.7.1 Halo Lenses . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
5.7.2 SMC Lenses . . . . . . . . . . . . . . . . . . . . . . . . . . . 215
5.7.3 SMC Structure . . . . . . . . . . . . . . . . . . . . . . . . . . 216
5.7.4 Likelihood Ratios . . . . . . . . . . . . . . . . . . . . . . . . 220
5.7.5 Kepler Constraints for Close Binaries . . . . . . . . . . . . . 221
5.7.6 Kepler Constraints for Wide Binaries . . . . . . . . . . . . . . 223
5.7.7 Constraints From (Lack of) Finite-Source Effects . . . . . . . 225
5.8 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225
Chapter 6 Probing ∼ 100AU Intergalactic Mg IIAbsorbing “Cloudlets”with Quasar Microlensing . . . . . . . . . . . . . . . . . . . . . . . . . 245
6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245
6.2 Varying Microlensed Quasar Image as A “Ruler” . . . . . . . . . . . 248
6.2.1 Basic Geometric Configurations and Motions . . . . . . . . . 249
6.2.2 Bulk Motion of the Un-microlensed “Ray Bundles” . . . . . . 251
6.3 Application to Q2237+0305 . . . . . . . . . . . . . . . . . . . . . . . 252
6.4 Discussion and Conclusion . . . . . . . . . . . . . . . . . . . . . . . . 260
Appendix A Two New Finite-Source Algorithms . . . . . . . . . . . . 264
A.1 Map-Making . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264
xv
A.2 Loop-Linking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269
A.3 Algorithm Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . 272
Appendix B MOA-2003-BLG-32/OGLE-2003-BLG-219 . . . . . . . . 275
Appendix C Extracting Orbital Parameters for Circular PlanetaryOrbit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 278
Appendix D Failure of Elliptical-Source Models for MOA-2007-BLG-400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 281
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283
xvi
List of Tables
2.1 OGLE-2004-BLG-343 Best-Fit PSPL Model Parameters . . . . . . . 71
3.1 MOA-2007-BLG-400 Best-fit Planetary Models . . . . . . . . . . . . . 110
4.1 OGLE-2005-BLG-071 Light Curve Parameter EstimationsFrom Markov chain Monte Carlo Simulations A (Part 1). 174
4.2 OGLE-2005-BLG-071 Light Curve Parameter EstimationsFrom Markov chain Monte Carlo Simulations A (Part 2). 175
4.3 OGLE-2005-BLG-071 Light Curve Parameter EstimationsFrom Markov chain Monte Carlo Simulations B (Part 1). 176
4.4 OGLE-2005-BLG-071 Light Curve Parameter EstimationsFrom Markov chain Monte Carlo Simulations B (Part 2). 177
4.5 OGLE-2005-BLG-071 Derived Physical Parameters . . . . . 178
4.6 Jovian-mass Companions to M Dwarfs (M∗ < 0.55 M⊙) . . . . 179
4.6 Jovian-mass Companions to M Dwarfs (M∗ < 0.55 M⊙) . . . . 180
5.1 OGLE-2005-SMC-001 Light Curve Models: Free Blending. . . . . . . 229
5.1 OGLE-2005-SMC-001 Light Curve Models: Free Blending. . . . . . . 230
5.1 OGLE-2005-SMC-001 Light Curve Models: Free Blending. . . . . . . 231
5.1 OGLE-2005-SMC-001 Light Curve Models: Free Blending. . . . . . . 232
5.1 OGLE-2005-SMC-001 Light Curve Models: Free Blending. . . . . . . 233
5.2 OGLE-2005-SMC-001 Light Curve Models: Zero Blending. . . . . . . 234
5.2 OGLE-2005-SMC-001 Light Curve Models: Zero Blending. . . . . . . 235
5.2 OGLE-2005-SMC-001 Light Curve Models: Zero Blending. . . . . . . 236
xvii
5.2 OGLE-2005-SMC-001 Light Curve Models: Zero Blending. . . . . . . 237
5.2 OGLE-2005-SMC-001 Light Curve Models: Zero Blending. . . . . . . 238
xviii
List of Figures
2.1 Light curve of OGLE-2004-BLG-343 near its peak on 2004 June 19(HJD 2,453,175.7467) . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
2.2 CMD of the OGLE-2004-BLG-343 field. . . . . . . . . . . . . . . . . 60
2.3 Color-color diagram in a field centered on OGLE-2004-BLG-343. . . . 61
2.4 Likelihood contours for finite-source points-lens models relative to thebest-fit PSPL model for OGLE-2004-BLG-343. . . . . . . . . . . . . . 62
2.5 Probability distributions of various parametrs for Monte Carlo eventstoward the line of sight of OGLE-2004-BLG-343. . . . . . . . . . . . . 63
2.6 OGLE-2004-BLG-343 planetary detection efficiency for three massratios (q) as a function of the planet-star separation bx and by. . . . . 64
2.7 Planet detection efficiency of OGLE-2004-BLG-343 as a function of band q. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
2.8 OGLE-2004-BLG-343 planetary detection efficiency for four massratios with simulated peak data points as a function of bx and by. . . 66
2.9 OGLE-2004-BLG-343 planetary detection efficiency as a function of band q with simulated peak data points. . . . . . . . . . . . . . . . . . 67
2.10 OGLE-2004-BLG-343 planetary detection efficiency as a function of thephysical projected star-planet distance (r⊥) and the planetary mass (mp). 68
2.11 Planetary detection efficiency as a function of r⊥ and mp for OGLE-2004-BLG-343 augmented by simulated data points covering the peak. 69
2.12 Planetary detection efficiency as a function of r⊥ andd mp for OGLE-2004-BLG-343, by assuming that the blended light is due to the lens. 70
3.1 Lightcurve of MOA-2007-BLG-400 . . . . . . . . . . . . . . . . . . . 103
xix
3.2 Comparisons between two photometric reductions of µFUN H-banddata for MOA-2007-BLG-400. . . . . . . . . . . . . . . . . . . . . . . 104
3.3 Magnification differences between of best-fit planetary model andsingle-lens models for MOA-2007-BLG-400 . . . . . . . . . . . . . . . 105
3.4 ∆χ2 Contours as a function of planet-star mass ratio q and projectedplanet-star separation d, as well as “short caustic diameter” w. . . . . 106
3.5 Instrumental color-magnitude diagram of field containing MOA-2007-BLG-400. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
3.6 Bayesian relative probability densities for the physical properties of theplanet MOA-2007-BLG-400Lb and its host star. . . . . . . . . . . . . 108
3.7 Constraining the blended flux from CTIO I-band images for MOA-2007-BLG-400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
4.1 Light curve of planetary microlensing event OGLE-2005-BLG-071 . . 162
4.2 Probability contours of OGLE-2005-BLG-071 microlens parallaxparameters derived from MCMC simulations. . . . . . . . . . . . . . . 163
4.3 CMD for the OGLE-2005-BLG-071 field . . . . . . . . . . . . . . . . 164
4.4 HST ACS astrometric measurements of OGLE-2005-BLG-071 . . . . 165
4.5 Posterior probability contours for OGLE-2005-BLG-071 relative lens-source proper motion. . . . . . . . . . . . . . . . . . . . . . . . . . . . 166
4.6 Differences between OGLE V and HST F555W magnitudes for thematched stars in the field of OGLE-2005-BLG-071. . . . . . . . . . . 167
4.7 Probability contour of OGLE-2005-BLG-071 lens mass M and relativelens-source parallax πrel without assuming the HST blend is the lens. 168
4.8 Probability contour of OGLE-2005-BLG-071 lens mass M and relativelens-source parallax πrel by assuming the blend is the lens. . . . . . . 169
4.9 Probability contours of OGLE-2005-BLG-071Lb projected velocityr⊥γ in the units of vc,⊥ . . . . . . . . . . . . . . . . . . . . . . . . . . 170
4.10 Probability distributions of OGLE-2005-BLG-071Lb planetaryparameters from MCMC realizations assuming circular orbital motion. 171
xx
4.11 χ2 distributions for OGLE-2005-BLG-071 best-fit xallarap solutions atfixed binary-source orbital periods P . . . . . . . . . . . . . . . . . . . 172
4.12 OGLE-2005-BLG-071 xallarap fits results by fixing binary orbitalphase λ and complement of inclination β at 1 yr period and u0 > 0. . 173
5.1 Standard (Paczynski 1986) microlensing fit to the light curve of OGLE-2005-SMC-001 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239
5.2 Why 4 (not 3) Spitzer observations are needed to measure πE =(πE,‖, πE,⊥) for OGLE-2005-SMC-001. . . . . . . . . . . . . . . . . . . 240
5.3 Parallax πE = (πE,N , πE,E) 1 σ error ellipses for all discrete solutionsfor OGLE-2005-SMC-001. . . . . . . . . . . . . . . . . . . . . . . . . 241
5.4 Best-fit binary microlensing model for OGLE-2005-SMC-001. . . . . . 242
5.5 Likelihood contours of the inverse projected velocity Λ ≡ v/v2 forSMC lenses together with Λ values for OGLE-2005-SMC-001. . . . . 243
5.6 Likelihood contours of the inverse projected velocity Λ ≡ v/v2 for halolenses together with Λ values for OGLE-2005-SMC-001. . . . . . . . . 244
6.1 Quasar microlensing caustics network, light curve and images. . . . . 262
6.2 Physical positions of “ray bundles” from a microlensed quasar in theframe of cloud at redshifts 0.1, 0.57, 0.83 and 1.69. . . . . . . . . . . 263
B.1 Planetary exclusion regions for microlensing event MOA-2003-32/OGLE-2003-BLG-219 . . . . . . . . . . . . . . . . . . . . . . . . . 277
xxi
Chapter 1
Introduction
The spatial and temporal scales tangible to mankind are insignificant compared
to those of the observable universe. Nevertheless, we aspire to apprehend all wonders
of the nature and understand her secrets, ranging from the sub-atomic fabric of
matter to the origin and evolution of the entire universe. To break human beings’
in-born limits, scientists utilize new tools designed with the knowledge we have
learned from nature to probe nature. Modern astronomy was born four centuries
ago when Galileo built his telescope according to newly discovered laws of optics and
pointed it to the sky. Moreover, earthbound astronomers also attempt to use special
celestial circumstances bestowed by the universe to study what is often unreachable
from available technologies. Astronomers have long mastered the parallax technique
by using the Earth’s motion to measure distances to faraway stars. Until recently
the cosmic alignment among the Earth, Moon and Sun had been the only chance
for us to glimpse the part of solar atmosphere normally overwhelmed by the glaring
photosphere.
1
Gravitational microlensing is an elegant method that follows the great
astronomy tradition by making use of cutting-edge technologies as well as capturing
rare celestial opportunities. In the last two decades, the technique has opened up
new windows to many aspects of the universe which are often times inaccessible via
other means.
This dissertation is composed of several astrophysical applications of
gravitational microlensing. Chapters 2, 3 and 4 are on discovering and characterizing
extrasolar planets with microlensing toward the Galactic bulge. Chapter 5 presents
the first space-based microlens parallax measurements. The parallax determinations
constrain the distance of a MACHO (MAssive Compact Halo Object) candidate in
the direction of the Small Magellanic Cloud. In Chapter 6, I propose a method to
probe small-scale structures of the intergalactic medium with quasar microlensing.
Microlensing as a field has been rapidly developing during the course of
completion of this dissertation. An overview of microlensing is given in the
following along with brief discussions on how the dissertation topics fit into ongoing
microlensing research and broader context of astrophysics.
Gravitational microlensing occurs when a background “source” (a star for
microlensing in the Local Group or a quasar for cosmological microlensing) and a
foreground object (“lens”) get very closely aligned (within the “Einstein radius” rE)
in the sight line of the observer. The light beams from the source are bent during
2
the passage near the lens according to the General Relativity, and the flux from the
source seen by the observer gets magnified. Typically the angular size of the Einstein
radius θE for a Galactic microlensing event is on the order of a milli-arcsecond, which
makes the “optical depth”, the probability of microlensing events, extremely small in
our Galaxy. Therefore, although the theoretical possibility of Galactic microlensing
has been discussed by numerous people since early last century (e.g., Einstein 1936;
Liebes 1964), it was not until the visionary paper by Paczynski (1986) that the
observational plausibility of stellar microlensing was brought into the sight of most
astronomers.
Overwhelming astronomical evidences suggest that most of the matter in the
universe is in the form of “dark matter”, whose existence has so far only exhibited
from its gravitational effects. Paczynski (1986) demonstrated that, if most of the
dark matter in the Galactic halo were made of dim massive compact objects, the
optical depth of microlensing would be significantly boosted to ∼ 10−6. Paczynski
(1986) proposed that one could search for MACHOs by identifying microlensing
variability, which follows a distinct analytical form, from simultaneously monitoring
brightness changes of millions of stars on a daily basis. It has also been shown that
the optical depth solely due to the contributions from the known stellar populations
toward the Galactic bulge should be of the same order (Kiraga & Paczynski 1994).
The massive data collection and processing required to search for microlensing
events was unprecedented in astronomy. Fortunately, the silicon revolution brought
3
the relevant technology to astronomers in the last decade of 20th century. In
particular, the advent of large format CCD, powerful computer hardware as well
as sophisticated data analysis software that could obtain fast and precise relative
photometry in wide and crowded stellar fields made microlensing surveys feasible.
In 1993, three collaborations announced their independent discoveries of the first
microlensing events (Udalski et al. 1993; Alcock et al. 1993; Aubourg et al. 1993).
Nowadays, OGLE-III and MOA-II surveys discover a total of about one thousand
microlensing events toward the Galactic bulge every year.
Mostly driven by observations, rich astrophysical applications of microlensing
have been revealed theoretically. An important and exciting observational prospect
is to search for extrasolar planets with microlensing, which was recognized by Mao
& Paczynski (1991) (see also earlier work by Liebes 1964). At that time, the Sun
was the only known star that harbored planets. Mao & Paczynski (1991) found
that lenses orbited by planetary companions near rE could give rise to microlensing
light curves easily distinguishable from those by single stellar lenses. In fact, the
first microlensing planet discovered by OGLE and MOA more than a decade later
(Bond et al. 2004) was via an event perturbed by a resonant caustic, which bore
close similarity to what was illustrated in Fig. 1 of Mao & Paczynski (1991).
Gould & Loeb (1992) did a comprehensive study on generic planetary
perturbations in microlensing. They gave the characteristics of the most common
type of planetary signature, whose duration is proportional to the square root of
4
planet-to-star mass ratio. The signal of a Jupiter lasts for a couple of days, so
daily observing cadence by a microlensing survey group is generally not sufficient
to capture a planet. They proposed to form follow-up groups who would do
round-of-clock, intensive monitoring on a selective number of events found by the
survey group to specifically search for planets.
Gould & Loeb (1992) found that microlensing was most sensitive to planets in
the so-called “lensing zone” ([0.6 − 1.6]rE), which corresponds to a few Astronimical
Unit (AU) for a typical Galactic microlensing event. Intriguingly, this is coincident
with the orbital radius of the Jupiter, the dominant planet of our Solar System.
Therefore, solar system analogs should be the easiest targets to find via microlensing.
Furthermore, Bennett & Rhie (1996) found that, an Earth-mass planet could produce
discernible hour-long signals in a Gould & Loeb type event. By contrast, the most
productive planet finding technique, Radial Velocity (RV), which was still then in its
cradle, needs about 10 years of observations to find a Jupiter. And the RV precision
that could enable Earth-mass planet detections was beyond a spectroscopist’s wildest
dreams. Even today, microlensing remains the most powerful technique to search for
Solar System analogs and Earth-mass planets from the ground.
Inspired by these theoretical considerations, the first microlensing planet
follow-up groups were formed according to the Gould & Loeb (1992) strategy in the
mid 1990s. In their early years, these groups made many great discoveries in areas
such as stellar-binary lensing (Albrow et al. 2000a), stellar atmosphere (Albrow et al.
5
1999b), etc, thanks to the high-cadence, high-precision monitoring. However, several
early claims on discoveries of microlensing planets (e.g., Bennett et al. 1999; Sahu
et al. 2001; Rhie et al. 2000) fail to gain acceptance due to inadequate modeling
(Albrow et al. 2000a), erroneous data analysis (Sahu et al. 2002), contradictory
observational data, or simply insufficient statistical significance.
Gaudi et al. (2002) analyzed five-years (1995-2000) of well-covered microlensing
events by the PLANET collaboration. They found no convincing detections of
planets, and they concluded that Jupiter-mass planets at the lensing zone around
stellar lenses are not very common (less than ∼ 33%). The first Gould & Loeb
type planet discovery, OGLE-2005-BLG-390, was led by PLANET and OGLE in
2005 (Beaulieu et al. 2006), which was a Neptune-mass planet (5.5+5.5−2.7M⊕). Since
microlensing is much more sensitive to Jupiters than Neptunes, this appears to
indicate Neptune-mass planets are much more common at a few AU than Jupiters
around M dwarfs – the most common stellar lenses. This conclusion was further
supported by additional microlensing planet discoveries (Gould et al. 2006).
The seminal work by Griest & Safizadeh (1998) proposed a different approach
microlensing planet searches from that of Gould & Loeb (1992). Their idea was
to find planets in high-magnification events (HMEs) via central caustics – then
an overlooked type of planetary perturbation. Central caustics have smaller cross
sections than those of planetary caustics, which are responsible for Gould & Loeb
events. Thus an uniform sample of planetary microlensing events will mostly consist
6
of Gould & Loeb type events. However, Griest & Safizadeh (1998) pointed out that
central caustics always perturb the peak of events that reach high magnification
(Amax>∼ 100), and an individual HME has nearly ∼ 100% sensitivity to any Jupiters
within the lensing zone. An additional practical advantage of this channel is that the
time that a HME reaches maximum magnification can be predicted in advance so
that one can focus limited resource to observe over the peak only, whereas one must
monitor the entire microlensing event to effectively track down Gould & Loeb-type
perturbations.
The significance of Griest & Safizadeh (1998) was immediately grasped by the
microlensing community. Several follow-up studies on HMEs (e.g., Gaudi et al. 1998;
Rattenbury et al. 2002; Bond et al. 2002; Abe et al. 2004) have further enriched the
scope of this planetary detection channel and further demonstrated its effectiveness.
For example, it is found that HMEs are especially sensitive to multiple-planet
systems (Gaudi et al. 1998). However, the potential of HMEs was not fully developed
immediately after Griest & Safizadeh (1998). One major practical difficulty was
that, HMEs are very rare, and the capacity of the survey groups only permitted
detections of a handful of them per year. Furthermore, the perturbations by central
caustics is more complex and much less intuitive than those by planetary caustics.
Central-caustic perturbations were also much less understood than the planetary
caustics ones, so many practitioners of microlensing were concerned with uniqueness
of models from planetary HMEs. Finally, modeling planetary high-magnification
7
events was non-trivial as the two-point-mass-lens calculations with finite-source
effects at high magnification were very time-consuming. The less intuitive light-curve
features and complicated parameter-space structure only exacerbated the modeling
difficulties.
The work presented in Chapter 2 was started in 2004, and it was on analyzing
OGLE-2004-BLG-343 – the single-lens microlensing event with highest peak
magnification (∼ 3000) that had been ever analyzed. Due to observer error, most of
the peak region was not well covered, which significantly reduced its sensitivity to
planets. However, it is shown that had the event been well sampled, it would have
been sensitive to almost all Neptune-mass planets over a factor of 5 of projected
separation and would even have had some sensitivity to Earth-mass planets. This
event highlighted the importance of aggressively covering the peak of HMEs and
showed the great potential of these events achievable with the ongoing projects.
To derive the planetary detection efficiency, I performed the most intensive yet
binary-lens finite-source calculations at high magnification. As a “by-product”, I
developed new algorithms (see APPENDIX A) that improved the speed of such
calculations by a few orders of magnitudes for high-magnification events compared
to previous codes. The algorithms and improved version (see Chapter 3) have played
crucial roles in analyzing the majority of planets discovered from HMEs.
The OSU-based Microlensing Follow-Up Network (µFUN) has started to
primarily focus on HMEs since 2004-2005 season. We have gradually enlarged a
8
global network of 1m-class and smaller (∼ 30− 50 cm) telescopes that offers 24-hour
coverage with some geographical redundancy in longitude. Such redundancy helps
to alleviate weather problems and sometimes provides additional constraints on
microlensing parameters (e.g. terrestrial parallax measurements by Gould et al.
2009). Due to their high peak magnifications, one can more easily obtain precise
photometry over the peak from telescopes of all sizes. And thanks to the increasing
number of HMEs due to upgraded survey telescopes of OGLE and MOA, we are
currently able to intensively follow up a dozen or so HMEs per year. This strategy
has proven quite fruitful. µFUN has led the discoveries of ∼ 10 microlensing
planets in the last 4 years, including the first multiple planetary system through
microlensing, which turned out to be the first extrasolar Jupiter-Saturn analog
(Gaudi et al. 2008).
In Chapter 3, a planetary microlensing event discovered by µFUN is analyzed.
The planet, MOA-2007-BLG-400Lb, was detected in a magnification Amax = 628
event, which exhibited severe finite-source effects. This was the first planetary
high-magnification event in which the central caustics was smaller than the size of
the source. A new parametrization based on central-caustic size was developed that
enabled efficient and thorough parameter space search. It was demonstrated that
the planetary nature of the deviation can be unambiguously ascertained from the
gross features of the residuals. Detailed analysis yields a fairly precise planet/star
9
mass ratio of q = (2.5+0.5−0.3)× 10−3, in accord with the large significance (∆χ2 = 1070)
of the detection.
Chapter 4 is on characterizing the physical properties of Jupiter-mass planet
OGLE-2005-BLG-071Lb and its host. One can directly infer planet/star mass
ratio q and the planet-star projected separation d (in units of the angular Einstein
radius) from a planetary microlensing light curve. By combining the parallax
(Gould 1992) and angular Einstein radius measurements derived from finite-source
effects, one can directly determine the lens mass and distance and subsequently the
planet mass and star-planet physical separation. In this chapter, we obtain such
constraints from microlens parallax, finite-source effects and also planetary orbital
motion from the light curve. In addition, we obtained photometric and astrometric
measurements from Hubble Space Telescope. Our primary analysis leads to the
conclusion that the host of Jovian planet OGLE-2005-BLG-071Lb is an M dwarf in
the foreground disk with mass M = 0.46± 0.04 M⊙, distance Dl = 3.2± 0.4 kpc, and
thick-disk kinematics vLSR ∼ 103 km s−1. From the best-fit model, the planet has
mass Mp = 3.8 ± 0.4 MJupiter. These results from the primary analysis suggest that
OGLE-2005-BLG-071Lb was likely to be the most massive planet yet discovered
that is hosted by an M dwarf.
Chapter 5 presents the first space-based microlens parallax measurements
of OGLE-2005-SMC-001. The parallax measurement yields a projected velocity
v ∼ 230 km s−1, the typical value expected for halo lenses, but an order of magnitude
10
smaller than would be expected for lenses lying in the Small Magellanic Cloud (SMC)
itself. The lens is a weak (i.e., non-caustic-crossing) binary, which complicates the
analysis considerably but ultimately contributes additional constraints. It is found
that the likelihood ratio is Lhalo/LSMC = 20. Hence, halo lenses are strongly favored
but SMC lenses are not definitively ruled out.
Intergalactic Mg II absorbers are known to have structures down to scales
∼ 102.5pc, and there are now indications that they may be fragmented on scales
∼< 10−2.5pc (Hao et al. 2006). Chapter 6 presents a method to probe Mg II
and other absorption “cloudlets” with sizes ∼ 10−4.0 − 10−2.0pc using quasar
microlensing. When a lensed quasar is microlensed, the micro-images of the quasar
experience creation, destruction, distortion, and drastic astrometric changes during
caustic-crossings. It is shown that these small-scale structures in the intergalactic
medium could induce significant spectral variability on timescales of months to
years. With numerical simulations, I demonstrate the feasibility of applying this
method to Q2237+0305, and I show that high-resolution spectra of this quasar in the
near future would provide a clear test of the existence of such metal-line absorbing
“cloudlets”.
11
Chapter 2
Planetary Detection Efficiency of the
Magnification 3000 Microlensing Event
OGLE-2004-BLG-343
2.1. Introduction
Current microlensing planet searches focus significant effort on high-
magnification events, which have great promise for detecting low-mass extrasolar
planets. It is therefore crucial to understand the potential for discovering planets
and to optimize the early identifications and observational strategy of such events.
In previous planetary detection efficiency analyses of high-magnification events,
finite-source effects have often been ignored mainly due to computational limitations.
However, such effects are intrinsically important in these events because the sources
are more likely to be resolved at very low impact parameters. In this study, we
improve the method of Yoo et al. (2004b) by incorporating finite-source effects to
characterize the planetary detection efficiency of the extremely high-magnification
event OGLE-2004-BLG-343, and we develop new efficient algorithms to make the
calculations possible. Moreover, we attempt to find useful observational signatures
12
of high-magnification events so as to help alleviate the difficulties in their early
recognition.
2.1.1. High-Magnification Events & Earth-Mass Planets
Apart from pulsar timing (Wolszczan & Frail 1992), microlensing is at present
one of the few planet-finding techniques that is sensitive to Earth-mass planets.
A planetary companion of an otherwise isolated lens star introduces two kinds of
caustics into the magnification pattern: “planetary caustics” associated with the
planet itself and a “central caustic” associated with the primary lens. 1 When the
source passes over or close to one of these caustics, the light curve deviates from its
standard Paczynski (1986) form, thus revealing the presence of the planet (Mao &
Paczynski 1991; Liebes 1964).
Since planetary caustics are generally far larger than central caustics, a
“fair sample” of planetary microlensing events would be completely dominated
by planetary-caustic events. Nevertheless, central caustics play a crucial role in
current microlensing planet searches, particularly for Earth-mass planets (Griest &
Safizadeh 1998), for the simple reason that it is possible to predict in advance that
the source of a given event will arrive close to the center of the magnification pattern
where it will probe for the presence of these caustics. Hence, one can organize the
1When the planetary companion is close to the Einstein ring, the planetary and central caustics
merge into a single “resonant caustic”.
13
intensive observations required to characterize the resulting anomalies. By contrast,
the perturbations due to planetary caustics occur without any warning. The lower
the mass of the planet, the shorter the duration of the anomaly, and so the more
crucial is the warning to intensify the observations. This is the primary reason that
planet-searching groups give high priority to high-magnification events, i.e., those
that probe the central caustics 2. As a bonus, high-magnification events are also
more sensitive to planetary-caustic perturbations than are typical events (Gould
& Loeb 1992) since their larger images increase the chances that they will be
perturbed by planets. However, this enhancement is relatively modest compared to
the rich potential of central-caustic crossings.
In principle, it is also possible to search for Earth-mass planets from
perturbations due to their larger (and so more common) planetary and resonant
caustics, but this would require a very different strategy from those currently being
carried out. The problem is that these perturbations occur without warning during
otherwise normal microlensing events, and typically last only 1 or 2 hr. Hence, one
2 Note that although high-magnification events are guaranteed to have low impact parameters,
peak magnification for events with low impact parameters are not necessarily high if they have
relatively large source sizes. And large sources will tend to smear out the perturbations induced by
the central caustics, thereby decreasing the planetary sensitivity (Griest & Safizadeh 1998; Chung
et al. 2005). So when central caustics are important in producing planetary signals, the maximum
magnification of microlensing events serves as a better indicator of planetary detection efficiency
than the impact parameter.
14
would have to intensively monitor the entire duration of many events. The only
way to do this practically is to intensively monitor an entire field containing many
ongoing microlensing events roughly once every 10 minutes in order to detect and
properly characterize the planetary deviations. Proposals to make such searches
have been advanced for both space-based (Bennett & Rhie 2002) and ground-based
(Sackett 1997) platforms.
At present, two other microlensing planet-search strategies are being pursued.
Both strategies make use of wide-area (∼> 10 deg2) searches for microlensing events
toward the Galactic bulge. Observations are made once or a few times per night
by the OGLE-III3 (Udalski 2003) and MOA4 (Bond et al. 2001) surveys. When
events are identified, they are posted as “alerts” on their respective Web sites. In
the first approach, these groups check each ongoing event after each observation
for signs of anomalous behavior, and if their instantaneous analysis indicates that
it is worth doing so, they switch from survey mode to follow-up mode. This
approach led to the first reliable detection of a planetary microlensing event, OGLE
2003-BLG-235/MOA-2003-BLG-53 (Bond et al. 2004).
In the second approach, follow-up groups such as the Probing Lensing Anomalies
NETwork (PLANET; Albrow et al. 1998) and the Microlensing Follow-Up Network
(µFUN; Yoo et al. 2004b) monitor a subset of alerted events many times per day
3 OGLE Early Warning System: http://www.astrouw.edu.pl/˜ogle/ogle3/ews/ews.html4MOA Transient Alert Page: http://www.massey.ac.nz/˜iabond/alert/alert.html
15
and from locations around the globe. Generally these groups focus to the extent
possible on high-magnification events for the reasons stated above. The survey
groups can also switch from “survey mode” to “follow-up” mode to probe newly
emerging high-magnification events.
Over the past decade several high-magnification events have been analyzed
for planets. Gaudi et al. (2002) and Albrow et al. (2001) placed upper limits
on the frequency of planets from the analysis of 43 microlensing events, three
of which reached magnifications Amax ≥ 100, including OGLE-1998-BUL-15
(Amax = 170 ± 30), MACHO-1998-BLG-35 (Amax = 100 ± 5), and OGLE-1999-BUL-
35 (Amax = 125 ± 15). However, the first of these was not monitored over its peak.
MACHO-1998-BLG-35 was also analyzed by Rhie et al. (2000) and Bond et al.
(2002), who incorporated all available data and found modest (∆χ2 = 63) evidence
for one, or perhaps two, Earth-mass planets.
Yoo et al. (2004b) analyzed OGLE-2003-BLG-423 (Amax = 256 ± 43), which
at the time was the highest magnification event yet recorded. However, because
this event was covered only intermittently over the peak, it proved less sensitive to
planets than either MACHO-1998-BLG-35 or OGLE-1999-BUL-35.
Abe et al. (2004) analyzed MOA-2003-BLG-32/OGLE-2003-BLG-219, which at
Amax = 525 ± 75, is the current record-holder for maximum magnification. Unlike
OGLE-2003-BLG-423, this event was monitored intensively over the peak: the Wise
16
Observatory in Israel was able to cover the entire 2.5 hr FWHM during the very
brief interval that the bulge is visible from this northern site. The result is that this
event has the best sensitivity to low-mass planets to date.
Recently, Udalski et al. (2005) detected a ∼ 3-Jupiter mass planet by intensively
monitoring the peak of the high-magnification event OGLE-2005-BLG-071. This
was the second robust detection of a planet by microlensing and the first from
perturbations due to a central caustic.
2.1.2. Planet Detection Efficiencies: Philosophy and
Methods
The fundamental aim of microlensing planet searches is to derive meaningful
conclusions about the presence of planets (or lack thereof) from these searches.
Therefore, it is essential to quantitatively assess what planets could have been
detected from the observations of individual non-planetary events if such planets
had been present. Actually, this problem is not as easy to properly formulate as it
might first appear. For example, the event parameters are measured with only finite
precision. Among these, the impact parameter u0 (in units of the angular Einstein
radius θE) is particularly important: if the event really did have a u0 equal to its
best-fit value, then one could calculate whether a planet at a certain separation and
position angle would have given rise to a detectable signal in the observed light
17
curve. But the true value of u0 may differ from the best-fit value by, say, 1σ, and the
same planet may not give rise to a detectable signal for this other, quite plausible
geometry. (In principle, an error in the time of maximum, t0, would cause a similar
problem, displacing the assumed path through the Einstein ring by δt0/tE, where tE
is the Einstein crossing time. However, because u0 is strongly correlated with several
other parameters while t0 is not, the error in u0 is substantially larger than the error
in t0 divided by tE.) Or, as another example, consider finite-source effects. Planetary
perturbations have a fairly high probability of exhibiting finite-source effects, which
then have a substantial impact on whether the deviation can be detected in a
given data stream. If there is such a planetary perturbation, one can measure
ρ∗ = θ∗/θE, the size of the source relative to the Einstein radius. But if there is no
planet detected, no finite-source effects are typically detected, and hence there is
no direct information on ρ∗. Therefore, one cannot reliably determine whether a
given planetary perturbation would have been affected by finite-source effects and
so whether it would have been detected. Finally, there are technical questions as to
what exactly it means that a planet “would have been” detected.
The past decade of microlensing searches has been accompanied by a steady
improvement in our understanding of these questions. Gaudi & Sackett (2000)
developed the first method to evaluate detection efficiencies, which was later
implemented by Albrow et al. (2000b) and Gaudi et al. (2002). In this approach,
binary models are fitted to the observed data with the three “binary parameters”
18
(b, q, α) held fixed and the three “point-lens parameters” (t0, u0, tE) allowed to vary.
Here b is the planet-lens separation in units of θE, q is the planet-star mass ratio, α
is the angle of the source trajectory relative to the binary axis, t0 is the time of the
source’s closest approach to the center of the binary system, u0 = u(t0) is the impact
parameter, tE = θE/µ is the Einstein timescale, and µ is the source-lens relative
proper motion. If a particular (b, q, α) yielded a χ2 improvement ∆χ2 < χ2min = −60,
a planet could be said to be detected. If not, then the ensemble of (b, q, α) for which
∆χ2 > χ2min = 60 was said to be excluded for that event. For each (b, q), the fraction
of angles 0 ≤ α ≤ 2π that was excluded was designated the “sensitivity” for that
geometry.
Gaudi et al. (2002) argued that this method underestimated the sensitivity
because it allowed the fit to move u0 to values for which the source trajectory would
“avoid” the planetary perturbation but still be consistent with the light curve. That
is, u0 has some definite value, even if it were not known to the modelers exactly
what that value should be. Yoo et al. (2004b) followed up on this by holding u0
fixed at a series of values and estimated planetary detection efficiency at each. The
total efficiency would then be the average of these weighted by the probability of
each value of u0. In principle, one should also integrate over t0 and tE. In practice,
Yoo et al. (2004b) found that, at least for the event they analyzed, t0 and teff ≡ u0tE
were determined very well by the data, so that once u0 was fixed, so were t0 and tE.
19
Yoo et al. (2004b) departed from all previous planet-sensitivity estimates by
incorporating a Bayesian analysis that accounts for priors derived from a Galactic
model of the mass, distance, and velocity properties of source and lens population
into the analysis. They simulated an ensemble of events and weighted each by both
the prior probability of the various Galactic model parameters (lens mass, lens and
source distances, lens and source velocities) and the goodness of fit of the resulting
magnification profile to the observed light curve. This approach was essential
to enable a proper weighting of different permitted values of u0. As a bonus, it
allowed one, for the first time, to determine the sensitivities as a function of the
physical planetary parameters (such as planet mass mp and planet-star separation
r⊥) as opposed to the microlensing parameters, the planet-star mass ratio q and the
planet-star projected separation (in the units of θE) b.
Rhie et al. (2000) introduced a procedure for evaluating planet sensitivities that
differs qualitatively from that of Gaudi & Sackett (2000). For each trial (b, q, α) and
observed point-lens parameters (t0, u0, tE), they created a simulated light curve with
epochs and errors similar to those of the real light curve. They then fitted this light
curve to a point-lens model with (t0, u0, tE) left as free parameters. If the point-lens
model had ∆χ2 > χ2min, then this (b, q, α) combination was regarded as excluded.
That is, they mimicked their planet-detection procedure on simulated planetary
events.
20
Abe et al. (2004) carried out a similar procedure except that they did not fit for
(t0, u0, tE), but rather just held these three parameters fixed at their point-lens-fit
values. Of course, this procedure necessarily yields a higher ∆χ2 than that of Rhie
et al. (2000), but Abe et al. (2004) expected that the difference would be small.
While all workers in this field have recognized that finite-source effects are
important in principle, they have generally concluded that these did not play a
major role in the particular events that they analyzed. This has proved fortunate
because the source size is generally unknown, and even a single trial value for the
source size typically requires several orders of magnitude more computing time than
does a point-source model. Gaudi et al. (2002) estimated angular sizes θ∗ of each
of their 43 source stars from their positions on an instrumental color-magnitude
diagram (CMD) by adopting µ = 12.5 km s−1 kpc−1 for all events and evaluating
ρ∗ = θ∗/(µtE). They made their sensitivity estimates for both this value of ρ∗ and
for a point source (ρ∗ = 0) and found that generally the differences were small. They
concluded that a more detailed finite-source evaluation was unwarranted (and also
computationally prohibitive). Using their Monte Carlo technique, Yoo et al. (2004b)
were able to evaluate the probability distribution of the parameter combination z0,
which is equal to impact parameter over source size. This analysis showed that
z0 ≫ 1 (no finite-source effects) with high confidence for their event. This implied
that the source did not pass close to the central caustic and hence that finite-source
21
effects were not important. Again, computation for additional values of ρ∗ would
have been computationally prohibitive.
2.1.3. ”Seeing” the Lens in High-Magnification Events
In the very first paper on microlensing, Einstein (1936) already realized that it
might be difficult to observe the magnified source due to “dazzling by the light of
the much nearer [lens] star.”
Seventy years later, more than 2000 microlensing events have been discovered,
but only for two of these has the “dazzling” light of the lens star been definitively
observed. The best case is MACHO-LMC-5, for which the lens was directly imaged
by the Hubble Space Telescope (HST) (Alcock et al. 2001a; Drake et al. 2004), which
yielded mass and distance estimates of the lens that agreed to good precision with
those derived from the microlensing event itself (Gould et al. 2004).
The next best case is OGLE-2003-BLG-175/MOA-2003-BLG-45, for which
Ghosh et al. (2004) showed that the blended light was essentially perfectly aligned
with the source. This would be expected if the blend actually was the lens, but it
would be most improbable if it were just an ambient field star. In this case, the
blend was about 1 mag brighter than the baseline source in I and 2 mag brighter in
V , perhaps fitting Einstein’s criterion of a “dazzling” presence.
22
Intriguingly, the above two events positively identified to harbor luminous lenses
are both high-magnification. It is quite plausible that events with luminous lenses
are biased towards high magnification since they will more likely be missed if the
magnifications are too low. This raises the question of whether OGLE-2003-BLG-423
has a luminous lens. In addition, identifying the lens star would allow us to directly
determine the physical properties of the lens, which in turn would help better
constrain parameters in the planetary detection analysis.
Here we analyze OGLE-2004-BLG-343, whose maximum magnification
Amax ∼ 3000 is by far the highest of any observed event and the first to exceed the
A = 1000 benchmark initially discussed by Liebes (1964) as roughly the maximum
possible magnification for typical Galactic sources and lenses.5
As we describe below, the event was alerted as a possibly anomalous, very-
high-magnification event in time to trigger intensive observations over the peak, but
due to human error, the actual observations caught only the falling side of the peak.
We analyze both the actual observations made of this event (in order to evaluate
its actual sensitivity to planets) and the sequence of observations that should have
been initiated by the trigger. The latter calculation illustrates the potential of
5Liebes (1964) derived that for perfect lens-source alignment, Au=0 = 2θE/θ∗ by approximating
the source star as having uniform surface brightness, and he evaluated this expression for several
typical examples.
23
state-of-the-art microlensing observations, although unfortunately this potential was
not realized in this case.
We analyze the event within the context of the Yoo et al. (2004b) formalism
with several major modifications. First, we adopt the Rhie et al. (2000) criterion
of planet-sensitivity in place of that of Gaudi & Sackett (2000). That is, we say a
planet configuration is ruled out if simulated data generated by this configuration
are inconsistent with a point-lens light curve at ∆χ2 > ∆χ2min . Second, using a
Monte Carlo simulation, we show that for this event, z0 ∼ 1, and hence finite-source
effects are quite important. This requires us to generalize the Yoo et al. (2004b)
procedure to include a two-dimensional grid of trial parameters (u0, ρ∗) in place
of the one-dimensional u0 sequence used by Yoo et al. (2004b). From what was
said above, it should be clear that the required computations would be completely
prohibitive if they were carried out using previous numerical techniques. Therefore,
third, we develop new techniques for finite-source calculations that are substantially
more efficient than those used previously.
In § 2.2, we describe the data. Next, we discuss modeling of the event in § 2.3.
Then in § 2.4, we present our procedures and results related to planet detection.
We explore the possibility that the blended light is due to the lens in § 2.5. In
§ 2.6 we summarize our results and make suggestions on monitoring extremely
high-magnification events in the future. Finally, the two new binary-lens finite-source
algorithms that we have developed are described in Appendix A.
24
2.2. Observational Data
The first alert on OGLE-2004-BLG-343 was triggered by the OGLE-III Early
Warning System (EWS; Udalski 2003) on 2004 June 16, about 3 days before its
peak on HJD′ ≡ HJD−2450000 = 3175.7467. On June 18, after the first observation
of the lens was taken, the OGLE real time lens monitoring system (Early Early
Warning System [EEWS]; Udalski 2003) triggered an internal alert, indicating a
deviation from the single lens light curve based on previous data. Two additional
observations were made after that, but the new fits to all of the data were still fully
compatible with single-mass lens albeit suggesting high magnification at maximum.
Therefore, an alert to the microlensing community was distributed by OGLE on
HJD′ = 3175.1 suggesting OGLE-2004-BLG-343 as a possible high-magnification
event. The observation at UT 0:57 (HJD′ 3175.54508) the next night showed a
large deviation from the light-curve prediction based on previous observations, and
an internal EEWS alert was triggered again. Usually further observations would
have been made soon after such an alert, but unfortunately no observations were
performed until UT 6:29 (HJD′ 3175.77626), about 0.71 hr after the peak. At that
time, the event had brightened by almost 3 mag in I relative to the previous night’s
observation, and therefore it was regarded very likely to be the first caustic crossing
of a binary-lens event. As a consequence, no V -band photometry was undertaken
to save observation time and in the hope that observations in V could be done
when it brightened again. After two post-peak observations confirmed the event’s
25
extremely high magnification, OGLE began maximally intensive observations with
a cadence of 4.3 minutes. However, after it was clear that the event was falling in
a regular fashion, it was then observed less intensively. A total of 14 observations
were performed during 3.39 hr, and a new alert to the microlensing community was
immediately released by OGLE as well. However, the next day the event faded
drastically, by about 3 mag from the maximum point of the previous night, implying
that if the event were a binary, the peak had probably been a cusp crossing rather
than a caustic crossing. After being monitored for a few more days, it became
clearer that OGLE-2004-BLG-343 was most probably a point-lens event of very high
magnification and therefore very sensitive to planets. This recognition prompted
OGLE to obtain a V -band point, but by this time (HJD′ 3179.51) the source had
fallen 6 mag from its peak, so that only a weak detection of the V flux was possible.
Hence, this yielded only a lower limit on the V − I color.
By chance, µFUN made one dual-band observation in I and H 1 day before peak
(HJD′ 3174.74256) solely as a reference point to check on the future progress of the
event. After the event peaked, µFUN also concluded that it was uninteresting until
OGLE/µFUN email exchanges led to the conclusion that the event was important.
Since the source was magnified by A∼40 at this pre-peak µFUN observation, it
enabled a clear H-band detection and so yielded an (I − H) color measurement,
which can be translated to (V − I).
26
The OGLE data are available at the OGLE EWS Web site mentioned above
and the µFUN data are available at the µFUN Web site 6.
There were 195 images in I and eight images in V , both from OGLE, as well
as three images in H from µFUN. Since only OGLE I-band observational data are
available near the peak, the following analysis is entirely based on the OGLE I-band
data except that the OGLE V -band data and µFUN H-band data are used to
constrain the color of the source star. The OGLE errors are renormalized by a factor
of 1.42 so that the χ2 per degree of freedom for the best-fit point-source/point-lens
(PSPL) model is close to unity. We also eliminate the two OGLE points that are 3σ
outliers. These are both well away from the peak and therefore their elimination
has no practical impact on our analysis.
2.3. Event Modeling
Yoo et al. (2004b) introduced a new approach to model microlensing events
for which u0 is not perfectly measured. As distinguished from previous analyses,
this method establishes the prior probability of the event parameters by performing
a Monte-Carlo simulation of the event using a Galactic model rather than simply
assuming uniform distributions. This approach is not only more realistic but also
makes possible the estimation of physical parameters, which are otherwise completely
6http://www.astronomy.ohio-state.edu/˜microfun
27
degenerate. Following the procedures of Yoo et al. (2004b), we begin our modeling
by fitting the event to a PSPL model, evaluating the finite-source effects and
performing a Monte-Carlo simulation. We then improve that method by considering
finite-source effects when combining the simulation with the light-curve fits.
2.3.1. Point-Source Point-Lens Model
The PSPL magnification is given by (Paczynski 1986)
A(u) =u2 + 2
u√
u2 + 4, u(t) =
√
√
√
√u20 +
(t − t0)2
t2E, (2.1)
where u is the projected lens-source separation in units of the angular Einstein radius
θE, t0 is the time of maximum magnification, u0 = u(t0) is the impact parameter,
and tE is the Einstein timescale.
The predicted flux is then
F (t) = FsA[u(t)] + Fb, (2.2)
where Fs is the source flux and Fb is the blended-light flux.
The observational data are fitted in the above model with five free parameters
(t0, u0, tE, Fs, and Fb). The results of the fit are shown in Table 2.6 (also see
Fig. 2.1). The best-fit u0 is remarkably small, u0 = 0.000333 ± 0.000121, which
indicates that the maximum magnification is Amax = 3000 ± 1100. As discussed
28
below in § 2.3.3, the 3σ lower limit is Amax ∼> 1450. This is the first microlensing
event ever analyzed in the literature with peak magnification higher than 1000. The
uncertainties in u0, tE and Fs are fairly large, roughly 35%. As pointed out in Yoo
et al. (2004b), these errors are correlated, while combinations of these parameters,
teff ≡ u0tE and Fmax ≡ Fs/u0, have much smaller errors:
teff = 0.0141 ± 0.0008 days, Imin = 13.805 ± 0.065. (2.3)
Here Imin is the calibrated I-band magnitude corresponding to Fmax.
2.3.2. Source Properties from Color-Magnitude Diagram
It is by now standard practice to determine the dereddened color and magnitude
of a microlensed source by putting the best-fit instrumental color and magnitude of
the source on an instrumental (I, V − I) CMD. The dereddened color and magnitude
can then be determined from the offset of the source position from the center of the
red clump, which is locally measured to be [MI , (V − I)0] = (−0.20, 1.00). We adopt
a Galactocentric distance R0 = 8 kpc. However, at Galactic longitude l = +4.21, the
red clump stars in the OGLE-2004-BLG-343 field are closer to us than the Galactic
center by 0.15 mag (Stanek et al. 1997). We derive (I, V − I)0,clump = (14.17, 1.00).
Although the source instrumental color and magnitude are both fit parameters,
only the magnitude is generally strongly correlated with other fit parameters. By
contrast, the source instrumental color can usually be determined directly by a
29
regression of V on I flux as the magnification changes. No model of the event is
actually required to make this color determination. In the present case, we exploit
both (V − I) and (I − H) data. Hence, in order to make use of this technique, we
must convert the (I − H) to (V − I). This will engender some difficulties.
As discussed in § 2.2, however, V -band measurements were begun only
when the source had fallen nearly to baseline. Hence, the measurement of
the (V − I) color obtained by this standard procedure has very large errors
and indeed is consistent with infinitely red (Fs,V = 0) at the 2σ level (see
Fig. 2.2). The CMD itself is based on OGLE-II photometry, and we have therefore
shifted the OGLE-III-derived fluxes by ∆I = IOGLE−II − IOGLE−III = 0.26 mag.
On this now calibrated CMD, the clump is at (I, V − I)clump = (15.51, 2.04).
Hence, the dereddened source color and magnitude are given by
(I, V − I)0 = (I, V − I)+ (I, V − I)0,clump − (I, V − I)clump = (I, V − I)− (1.34, 1.04),
the final offset being the reddening vector. This vector corresponds to
RV I = 1 + 1.34/1.04 = 2.29, which is somewhat high compared to values
obtained by Sumi (2004) for typical bulge fields. However, we will present below
independent evidence for this or a slightly higher value of RV I . Figure 2.2 also shows
the position of the blended light, which lies in the so-called reddening sequence of
foreground disk main sequence stars. This raises the question as to whether this
blended star is actually the lens. We return to this question in § 2.5.
30
The source star is substantially fainter than any of the other stars in the
OGLE-II CMD. In order to give a sense of the relation between this source CMD
position and those of main-sequence bulge stars, we also display the Hipparcos main
sequence (ESA 1997), placed at 10−0.15/5R0 = 7.5 kpc and reddened by the reddening
vector derived from the clump. At the best-fit value, V − I = 3.09, the source lies
well in front of (or to the red of) the bulge main-sequence. However, given the large
color error, it is consistent with lying on the bulge main sequence at the 1 σ level.
To obtain additional constraints on the color, we consider the µFUN
instrumental H-band data. The single highly-magnified (A ∼ 40) H-band point
(together with a few baseline points) yields IOGLE−II − HµFUN = 0.59 ± 0.11 source
color. To be of use, this must be translated to a (V − I)OGLE−II color using a
(V − I)/(I − H) color-color diagram of the stars in the field.
Unfortunately, there are actually very few field stars in the appropriate color
range. This partly results from the small size (∼ 2 arcmin2) of the H-band image
and partly from the fact that a large fraction of stars are either too faint to
measure in V -band or saturated in H-band. We therefore calibrate the µFUN
H-band data by aligning them to Two Micron All Sky Survery (2MASS) data and
generate a (V − I)/(I − H2MASS) color-color diagram by matching stars from the
2MASS H-band data with OGLE-II V, I photometry in a larger field centered on
OGLE-2004-BLG-343. We find that (H2MASS − HµFUN) = −1.99 ± 0.01 from 48
stars in common in the field, with a scatter of 0.08 mag. We transform the above
31
I−HµFUN color to I−H2MASS and plot it as a vertical line on a (V −I)/(I−H2MASS)
color-color diagram (see Fig. 2.3). From the intersection of the vertical line with the
diagonal track of stars in the field, we infer V − I = 2.40 ± 0.15.
Since the field stars used to make the alignment are giants, this transformation
would be strictly valid only if the source were a giant as well. However, the source
star is certainly a dwarf (see Fig. 2.2). After transforming 2MASS to standard
infrared bands (Carpenter 2001), we use the data from Tables II and III of Bessell &
Brett (1988) to construct dwarf and giant tracks on a (V − I)0/(I − H)0 color-color
diagram. These are approximately coincident for blue stars (I − H)0 < 1.6 but
rapidly separate by 0.28 mag in (V − I)0 by (I − H)0 = 1.7. In principle, we should
just adjust our estimate (V − I) by the difference between these two tracks at the
dereddened (I − H)0 color of the source. Unfortunately, there are two problems
with this seemingly straightforward procedure. First, the Bessell & Brett (1988)
giant track displays a modest deviation from its generally smooth behavior close
to the color of our source, a deviation that is not duplicated by either the giants
in our field or the color-color diagram formed by combining Hipparcos and 2MASS
data, which both show the same smooth behavior at this location. Second, if the
Hipparcos/2MASS diagram or the Bessell & Brett (1988) diagram is reddened using
the selective and total extinctions determined above from the position of the clump,
then the giant tracks do not align with our field giants. To obtain alignment, one
32
must use RV I = 2.4.7 The conflict among these three determinations of RV I (1.9–2.1
[Sumi 2004], 2.29 [clump], ∼ 2.4 [Gould et al. 2001]) is quite a puzzle, but not one
that we can explore here.
The bottom line is that there is considerable uncertainty in the dwarf-minus-
giant adjustment but only in the upward direction. To take account of this,
we add 0.2 mag error in quadrature to the upward error bar and finally adopt
V − I = 2.4+0.25−0.15 for the indirect color determination via the (I − H) measurement.
Finally, we combine this with the direct measurement of V − I = 3.09 based on
the combined V and I light curve. Because the errors on the latter measurement
are extremely large (and are Gaussian in flux rather than magnitudes), we
determine the probability distribution for the combined determination numerically
in a flux-based calculation and then convert to magnitudes. We finally find
V − I = (V − I)best ± σ(V − I) = 2.60 ± 0.20, which we show as a magenta point in
Figure 2.2. Hence,
(V − I)0 = 1.56 ± 0.20. (2.4)
7 Gould et al. (2001) found a similar value using the same method but a different data set.
However, the RV I we obtained at the beginning of this section is based on the dereddened magnitude
of the red clump, which depends on the distance to the Galactic center R0. If we were to adopt the
new geometric measurement of R0 = 7.62 kpc (Eisenhauer et al. 2005), rather than the standard
value of R0 = 8.0 kpc, we would then have I0,clump = 14.07, which would give RV I = 2.39. However
this value conflicts still more severely with the typical values of RV I = 1.9–2.1 in bulge fields found
by Sumi (2004).
33
In contrast to most microlensing events that have been analyzed for planets,
the color of OGLE-2004-BLG-343 is fairly uncertain. The color enters the analysis
in two ways. First, it indicates the surface brightness and so determines the
relation between dereddened source flux and angular size. Second, it determines the
limb-darkening coefficient.
Given the color error, we consider a range of colors in our analysis and integrate
over this range, just as we integrate over a range of impact parameters u0 and source
sizes (normalized to θE) ρ∗. We allow colors over the range 2.2 < (V − I) < 3.0
corresponding to 1.16 < (V − I)0 < 1.96. We integrate in steps of 0.1 mag. For each
color, we adopt a surface brightness such that the source size θ∗ is given by
θ∗ = θ(V −I) 10−0.2(I−Ibest), (2.5)
where I is the (reddened) apparent magnitude in the model, Ibest = 22.24, and
θ2.2 . . . θ3.0 = (0.350, 0.371, 0.391, 0.421, 0.466, 0.515, 0.546, 0.580, 0.615)µas. These
values are derived from the color/surface-brightness relations for dwarf stars given
in Kervella et al. (2004) using the method as described in Yoo et al. (2004a). In our
actual calculations, we use the full distribution of source radii, but for reference we
note that the 1σ range of this quantity is
θ∗ = 0.47 ± 0.13µas. (2.6)
34
We find from the models of Claret (2000) and Hauschildt et al. (1999) that the
linear limb-darkening coefficients for dwarfs in our adopted color range vary by only
a few hundredths. Therefore, for simplicity, we adopt the mean of these values
ΓI = 0.50 (2.7)
for all colors. This corresponds to c = 3Γ/(2 + Γ) = 0.60 in the standard
limb-darkening parameterization (Afonso et al. 2000).
Finally, each model specifies not only a color and magnitude for the source, but
also a source distance. Evaluation of the likelihood of each specific combination of
these requires a color-magnitude relation. We adopt (Reid 1991)
MI = 2.37(V − I)0 + 2.89 (2.8)
with a scatter of 0.6 mag. The ridge of this relation is shown as a red line segment in
Figure 2.2, with the sources placed at 10−0.15/5R0 = 7.5 kpc and reddened according
to the red-clump determination, just as was done for the Hipparcos stars. This track
is in reasonable agreement with the Hipparcos stars.
2.3.3. Finite-Source Effects
Yoo et al. (2004b) define z0 ≡ u0/ρ∗ (where ρ∗ = θ∗/θE is the angular size
of the source θ∗ in units of θE), which is a useful parameter to characterize the
35
finite-source effects in single-lens microlensing events. We fit the observational data
to a set of point-lens models on a grid of (u0, z0) and then compare the resulting χ2
with the best-fit PSPL model. As mentioned in § 2.3.2, we adopt the limb-darkening
formalism of Yoo et al. (2004b) and for simplicity choose ΓI = 0.50.
Figure 2.4 displays the resulting ∆χ2 contours. It shows that the 1 σ contour
extends from z0 ≃ 0.2 to arbitrarily large z0. This is qualitatively similar to
OGLE-2003-BLG-423 as analyzed in Yoo et al. (2004b). However, as we demonstrate
in § 2.3.4, the range of z0 that is consistent with the Galactic model is quite different
for these two events. This is to be expected since z0 = u0/ρ∗ = (u0/θ∗)θE, and u0/θ∗
is roughly 8 times smaller for this event, while θE is generally of the same order.
Figure 2.4 shows contours for both Amax and u−10 . For z0 ≡ u0/ρ∗ > 2, these
are very similar, which is expected because in the absence of finite-source effects
(and for u0 ≪ 1), Amax = u−10 . Note that the Amax contours are roughly rectangular,
so that while z0 is not well constrained, the 3σ lower limit Amax > 103.16 ∼ 1450 is
quite well defined. This shows that although the blending is very severe, it is also
very well constrained, implying that the event’s high magnification is secure.
2.3.4. Monte-Carlo Simulation
We perform a similar Monte-Carlo simulation using a Han & Gould (1996, 2003)
model as described in Yoo et al. (2004b) by taking into account all combinations of
36
source and lens distances, Dl < Ds, uniformly sampled along the line of sight toward
the source (l, b) = (4.21,−3.47). The simulation adopts the Gould (2000a) mass
function taking into account the bulge main sequence stars, white dwarfs (distributed
around 0.6 M⊙), neutron stars (narrowly peaked at 1.35 M⊙), and stellar-mass black
holes. This mass function is adequate to describe mass distributions of disk lenses
except that the disk contains stars with masses greater than 1 M⊙ while the bulge
does not. However, a disk main sequence star more massive than the Sun will be
too bright to be the lens star for this event (see Fig. 2.2); so for simplicity, we use
this mass function for both disk and bulge in our simulation. In Yoo et al. (2004b),
the source flux is determined from the tE for each Monte-Carlo event since only the
PSPL model is considered at this step. However, when finite-source effects are taken
into account, each tE corresponds to a series of Fs depending on the source size ρ∗,
so there is no longer a 1-1 correspondence between Fs and tE. As discussed in Yoo
et al. (2004a), θ∗ can be deduced from the source’s dereddened color and magnitude.
Since θE is known for each simulated event, ρ∗ is a direct function of Fs and the
(V − I) color of the source,
ρ∗ =θ(V −I)
θE
√
Fs
Fbest
, (2.9)
where Fbest corresponds to Ibest in equation (2.5). Using this constraint, we fit the
k-th Monte-Carlo event to a point-lens model with finite-source effects, holding tE,k
fixed at the value given by the simulation, for a variety of (V − I) color values
37
inferred from § 2.3.2. Hence, for the j-th (V − I) color and k-th Monte-Carlo
event, we have best-fit single-lens light-curve parameters t0,j,k, u0,j,k, ρ∗,j,k, Fs,j,k,
Fb,j,k, as well as ∆χ2j,k ≡ χ2
j,k − χ2PSPL. We construct a three-dimensional table that
includes these six quantities as well as the other parameters from the Monte-Carlo
simulation (tE,k, θE,k, Ds,k, Dl,k, Ml,k, Γk), the Einstein timescale and radius, the
source and lens distances, the lens mass, and the event rate. From these can also
be derived two other important quantities, the source absolute magnitude MI,j,k
and the physical Einstein radius rE,k ≡ θE,k × Dl,k. This three-dimensional table
is composed of nine two-dimensional tables, one for each (V − I)j color. Each
table contains approximately 200,000 rows, one for each simulated event. To make
the notation more compact, we refer to the parameters a, b, c, ... lying in the bin
a ∈ [amin, amax]; b ∈ [bmin, bmax]; c ∈ [cmin, cmax]... as bin(a, b, c, ...).
Similarly to Yoo et al. (2004b), the posterior probability of ai lying in bin(ai) is
given by
P [bin(ai)] ∝∑
j,k
(PV −I)j × (PReid)j,k × exp[−∆χ2j,k/2] (2.10)
×BC[bin(aij,k)] × Γk,
where (PReid)j,k = exp(−(MI)j,k −MI,Reid[(V − I)0,j]2/2[σ2Reid + (σMI
)2j,k]) accounts
for the scatter (σReid = 0.6) in MI about the Reid relation plus the dispersion
(σMI)j,k from light-curve fitting, (PV −I)j = exp−[(V − I)j − (V − I)best]
2/2σ2(V −I)
38
reflects the uncertainty in V − I color, and BC is a boxcar function defined by
BC[bin(a)] ≡ Θ (a − amin) × Θ (amax − a).
Figure 2.5 shows the posterior probability distributions of various parameters,
including u0, dereddened apparent I-band magnitude of the source I0, proper motion
µ, z0, source distance modulus, lens distance modulus, absolute I-band magnitude
of the source MI , and lens mass. The blue and red histograms represent bulge-disk
events and bulge-bulge events, respectively. The relative event rate is normalized
in the same way for both bulge-disk and bulge-bulge events. The total rate for
bulge-disk events is about 6 times larger than that for bulge-bulge events, which
means that the Monte-Carlo simulation tends strongly to favor bulge-disk events.
The Einstein radii are on average smaller for bulge-bulge events than for
bulge-disk events, and as a result, the bulge-bulge events tend to have bigger ρ∗
and hence smaller z0. However, the top right panel of Figure 2.5 shows that the z0
probability distributions have similar shapes for both bulge-bulge and bulge-disk
events. This is because the (lack of) finite-source effects constrain z0 ∼> 0.7 at
the 3σ level (see Fig. 2.4), which cuts off the lower end of the z0 distributions for
both categories of events. Since bulge-disk events have smaller ρ∗ than bulge-bulge
events, the u0 posterior probability distribution peaks at a lower value for the
former. Furthermore, since z0 ∼> 0.7, the proper motion is constrained to be
µ = θ∗z0/teff ∼> 7mas/yr, which is typical of bulge-disk events but ∼> 2 times the
proper motion of typical bulge-bulge events.
39
Figure 2.5 also shows the distributions of u0 and I0 from the light-curve
data alone by a black solid line. In strong contrast to the corresponding diagrams
for OGLE-2003-BLG-423 presented by Yoo et al. (2004b), the light-curve based
parameters agree quite well with the Monte-Carlo predictions. In the source
distance-modulus panel, the prior distributions for bulge-disk and bulge-bulge
events are shown in purple and green histograms, respectively. Again, distinct
from OGLE-2003-BLG-423, the most likely source distances of this event agree
reasonably well with typical values from the prior distributions. Moreover, from
Figure 2.5, the peak values of source MI distributions are also in good agreement
with those derived from the Reid relation (MI = 6.59 for [V − I]0 = 1.56, see
eq. [2.8]). Therefore, the source of this event shows very typical characteristics as
represented by the Monte-Carlo simulation. Also unlike OGLE-2003-BLG-423, the
probability that z0 ∼< 1 is very high for both bulge-disk and bulge-bulge events.
Therefore, finite-source effects must be taken into account in the analysis of this
event. In addition to the posterior probability distribution (orange) of the lens
mass, the prior distribution (dark green) is displayed in Figure 2.5 as well. In
microlensing analyses, the lens mass is commonly assigned a “typical” value (for
example, 0.3 M⊙). However, Figure 2.5 shows that, lenses with relatively high mass
are strongly favored for this event as compared to the prior distribution. Detailed
discussions on the lens properties are presented in § 2.5.
40
2.4. Detecting Planets
While there are no obvious deviations from point-lens behavior in the light
curve of OGLE-2004-BLG-343 at our adopted threshold of ∆χ2min = 60, planetary
deviations might be difficult to recognize by eye. We must therefore conduct a
systematic search for such deviations. Logically, this search should precede the
second step of testing to determine what planets we could have detected had they
been there. However, as a practical matter it makes more sense to first determine
the range of parameter space for which we are sensitive to planets because it is
only this range that needs to be searched for planets. We therefore begin with this
detection efficiency calculation.
2.4.1. Detection Efficiency
As reviewed in § 2.1.2, a variety of methods have been proposed to calculate
the planetary sensitivities of microlensing events, either in predicting planetary
detection efficiencies theoretically or in analyzing real observational data sets. In
those methods, ∆χ2 is often calculated by subtracting the χ2 of single-lens models
from that of the binary-lens models to evaluate detection sensitivities. However,
the ways in which single-lens and binary-lens models are compared differ from
study to study. As noted by Griest & Safizadeh (1998) and Gaudi & Sackett
(2000), for real planetary light curves, the lens parameters are not known a priori.
41
Therefore, ∆χ2 will generally be exaggerated if one subtracts from the binary lens
model the single-lens model that has the same t0, u0, and tE instead of the best-fit
single-lens model to the binary light curve. One important factor contributing to this
exaggeration is that the center of the magnification pattern (referred to as the center
of the caustic in Yoo et al. 2004b) in the binary-lens case is no longer the position of
the primary star as it is in the single-lens model (Dominik 1999b; An & Han 2002).
Therefore light-curve parameters such as u0 and t0 will shift correspondingly. We
find that by not taking into account this effect and directly comparing the simulated
binary (i.e., planetary) light curve with the best-fit single-lens model to the data,
Abe et al. (2004) exaggerate the planetary sensitivity of MOA 2003-BLG-32/OGLE
2003-BLG-219, although it remains the most sensitive event analyzed to date (see
Appendix B).
Following Gaudi & Sackett (2000), planetary systems are characterized by
a planet-star mass ratio q, planet-star separation in Einstein radius b, and the
angle α of the source trajectory relative to the planet-star axis. In our binary-lens
calculations, u0 and t0 are defined with respect to the center of magnification
discussed above. According to Gaudi & Sackett (2000), the next step is to fit the
data to both PSPL models and binary-lens models with a variety of (b, q, α) and
calculate ∆χ2(b, q, α) = χ2(b, q, α) − χ2PSPL. Then ∆χ2(b, q, α) is compared with a
threshold value χ2thres: if ∆χ2 > χ2
thres then a planet with parameters b, q, and α is
claimed to be excluded while it is detected if ∆χ2 < −χ2thres. The (b, q) detection
42
efficiency is then obtained by integrating Θ(∆χ2 − ∆χ2thres) over α in the exclusion
region at fixed (b, q), where Θ is a step function. However, Gaudi et al. (2002)
point out that for events with poorly constrained light-curve parameters, which is
the case for OGLE-2004-BLG-343, this method will significantly underestimate the
sensitivities since the binary-lens models will minimize the χ2 over the relatively
large available parameter space. As discussed in Yoo et al. (2004b), the detection
efficiency should be evaluated at a series of allowed u0 values. To take finite-source
effects into account, we generate a grid of permitted (u0, ρ∗), and each (u0,m, ρ∗,m) bin
is associated with the probability P [bin(u0,m, ρ∗,m)] obtained using the following
equation:
P [bin(u0,m, ρ∗,m)] ∝∑
j,k
Pm,j,k (2.11)
where
Pm,j,k = (PV −I)j × (PReid)j,k × exp[−∆χ2j,k/2] × BC[bin(u0,mj,k)]
×BC[bin(ρ∗,mj,k)] × Γk (2.12)
If the light-curve parameters were well-constrained, the approaches of Gaudi
& Sackett (2000) and Rhie et al. (2000) would be very nearly equivalent, with the
former retaining a modest philosophical advantage, since it uses only the observed
light curve and does not require construction of light curves for hypothetical events.
43
However, because in our case these parameters are not well constrained, the Gaudi
& Sackett (2000) approach would require integration over all binary-lens parameters
(except Fs and Fb). Regardless of its possible philosophical advantages, this approach
is therefore computationally prohibitive in the present case. We therefore do not
follow Gaudi & Sackett (2000), but instead construct a binary light curve with the
same observational time sequence and photometric errors as the OGLE observations
of OGLE-2004-BLG-343, for each (b, q, α ; u0, ρ∗) combination and the associated
probability-weighted parameters alc: t0, tE, Fs and Fb in the m-th (u0, ρ∗) bin,
alc(weighted),m =
∑
j,kPm,j,kalc,j,k
∑
j,kPm,j,k
. (2.13)
Then each simulated binary light curve (b, q, α ; u0,m, ρ∗,m) is fitted to a
single-lens model with finite-source effects whose best fit yields χ2(b, q, α ; u0,m, ρ∗,m).
Another set of artificial binary light curves is generated under the assumption
that OGLE had triggered a dense series of observations following the internal
alert at HJD′ 3175.54508. These cover the peak of the event with the normal
OGLE frequency and are used to compare results with those obtained from the real
observations.
Magnification calculations for a binary lens with finite-source effects are very
time-consuming. Besides (b, q, α), our calculations are also performed on (u0, ρ∗)
grids, two more dimensions than in any previous search of a grid of models with
finite-source effects included. This makes our computations extremely expensive,
44
comparable to those of Gaudi et al. (2002), which equaled several years of processor
time. Therefore, we have developed two new binary-lens finite-source algorithms to
perform the calculations, as discussed in detail in Appendix A.
In principle, we should consider the full range of b, i.e., 0 < b < ∞; in practice,
it is not necessary to directly simulate b < 1 due to the famous b ↔ b−1 degeneracy
(Dominik 1999a; An 2005). Instead, we just map the b > 1 results onto b < 1
except for the isolated sensitive zones along the x−axis caused by planetary caustics
perturbations.
We define the planetary detection efficiency ǫ(b, q) as the probability that an
event with the same characteristics as OGLE-2004-BLG-343, except that the lens is
a planetary system with configuration of (b, q), is inconsistent with the single-lens
model (and hence would have been detected),
ǫ(b, q, α) = ∑
m
Θ[
χ2(b, q, α ; u0,m, ρ∗,m) − ∆χ2thres
]
× P [bin(u0,m, ρ∗,m)]
×∑
m
P [bin(u0,m, ρ∗,m)]−1 (2.14)
and
ǫ(b, q) =1
2π
∫ 2π
0ǫ(b, q, α)dα. (2.15)
45
2.4.2. Constraints on Planets
Figure 2.6 shows the planetary detection efficiency of OGLE-2004-BLG-343
for planets with mass ratios q = 10−3, 10−4, and 10−5, as a function of b, the
planet-star separation (normalized to θE), and α, the angle that the moving source
makes with the binary axis passing the primary lens star on its left. Different colors
indicate 10%, 25%, 50%, 75%, 90% and 100% efficiency. Note that the contours are
elongated along an axis that is roughly 60 from the vertical (i.e., the direction of
the impact parameter for α = 0). This reflects the fact that the point closest to
the peak occurs at t = 2453175.77626 when (t − t0)/tE = 2.16u0, and so when the
source-lens separation is at an angle tan−1 2.16 = 65. For q = 10−3, the region
of 100% efficiency extends through 360 within about one octave on either side of
the Einstein ring. However, at lower mass ratios there is 100% efficiency only in
restricted areas close to the Einstein ring and along the above-mentioned principal
axis.
Figure 2.7 summarizes an ensemble of all figures similar to Figure 2.6, but with
q ranging from 10−2.5 to 10−5.0 in 0.1 increments. To place this summary in a single
figure, we integrate over all angles α at fixed b. Comparison of this figure to Figure 8
from Gaudi et al. (2002) shows that the detection efficiency of OGLE-2004-BLG-343
is similar to that of MACHO-1998-BLG-35 and OGLE-1999-BUL-35 despite the fact
that their maximum magnifications are Amax ∼ 100, roughly 30 times lower than
46
OGLE-2004-BLG-343. Of course, part of the reason is that OGLE-2004-BLG-343
did not actually probe as close as u = u0 ∼ 1/3000 because no observations were
taken near the peak. However, observations were made at u ∼ 1/1200, about 12
times closer than in either of the two events analyzed by Gaudi et al. (2002). One
problem is that because the peak was not well covered, there are planet locations
that do not give rise to observed perturbations at all. But this fact only accounts for
the anisotropies seen in Figure 2.6. More fundamentally, even perturbations that do
occur in the regions that are sampled by the data can often be fitted to a point-lens
light curve by “adjusting” the portions of the light curve that are not sampled.
Note the central “spike” of reduced detection efficiency plots near b = 1. As
first pointed out by Bennett & Rhie (1996), this is due to the extreme weakness of
the caustic for nearly resonant (b ∼ 1) small mass-ratio (q ≪ 1) binary lenses.
2.4.3. No Planet Detected
Based on the detection efficiency levels we obtained in § 2.4.2, we fit the
observational data to binary-lens models to search for a planetary signal in the
regions with efficiency greater than zero from q = 10−5 to q = 10−2.5. We find
no binary-lens models satisfying our detection criteria. In fact, the total χ2
contributions to the best-fit single-lens model of the observational points over the
peak ( HJD = 2453175.5 − 2453176.0) are no more than 30, so even if all of these
47
deviations were due to a planetary perturbation, such a binary-lens solution would
not easily satisfy our ∆χ2 = 60 detection criteria. Therefore there are no planet
detections in OGLE-2004-BLG-343 data.
2.4.4. Fake Data
Partly to explore further the issue of imperfect coverage of the peak, and partly
to understand how well present microlensing experiments can probe for planets, we
now ask what would have been the detection efficiency of OGLE-2004-BLG-343 if
the internal alert issued on HJD′ 3175.54508 had been acted upon.
Of course, since the peak was not covered, we do not know exactly what u0
and ρ for this event are. However, for purposes of this exercise, we assume that they
are near the best fit as determined from a combination of the light-curve fitting and
the Galactic Monte Carlo, and for simplicity, we choose u0 = 0.00040, ρ∗ = 0.00040
which is very close to the best-fit combination. We then form a fake light curve
sampled at intervals of 4.3 minutes, starting from the alert and continuing to the
end of the actual observations that night. This sampling reflects the intense rate
of OGLE follow-up observations actually achieved during this event (see § 2.2). We
assume errors similar to those of the actual OGLE data at similar magnifications.
For those points that are brighter than the brightest OGLE point, the minimum
actual photometric errors are assigned. We also assume that the color information is
48
known exactly in this case to be V − I = 2.6. We then analyze these fake data in
exactly the same way that we analyze the real data. In contrast to the real data,
however, we do not find a finite range of z0 ≡ u0/ρ∗ that are consistent with the fake
data. Rather, we find that all consistent parameter combinations have z0 = 1 almost
identically. We therefore consider only a one-dimensional set of (u0, ρ∗) combinations
subject to this constraint.
Figure 2.8 is analogous to Figure 2.6 except that the panels show planet
sensitivities for q = 10−3, 10−4, 10−5, and 10−6, that is, an extra decade. In sharp
contrast to the real data, these sensitivities are basically symmetric in α, except for
the lowest value of q. Sensitivities at all mass ratios are dramatically improved.
For example, at q = 10−3, there is 100% detection efficiency over 1.7 dex in b
(1/7 ∼< b ∼< 7). Even at q = 10−5 (corresponding to an Earth-mass planet around an
M star), there is 100% efficiency over an octave about the Einstein radius.
Figure 2.9 is the fake-data analog of Figure 2.7. It shows that this event would
have been sensitive to extremely low mass-ratios, lower than those accessible to any
other technique other than pulsar timing.
2.4.5. Detection Efficiency in Physical Parameter Space
One of the advantages of the Monte Carlo approach of Yoo et al. (2004b) is
that it permits one to evaluate the planetary detection efficiency in the space of
49
the physical parameters, planet mass and projected physical separation (mp, r⊥),
rather than just the microlensing parameters (b, q). Figures 2.10 and 2.11 show
this detection efficiency for the real and fake data, respectively. The fraction
of Jupiter-mass planets that could have been detected from the actual data
stream is greater than 25% for 0.8AU ∼< r⊥ ∼< 10AU and is greater than 90% for
2AU ∼< r⊥ ∼< 6AU. There is also marginal sensitivity to Neptune-mass planets.
However, the detection efficiencies would have been significantly enhanced had the
FWHM around the peak been observed, as previously discussed by Rattenbury et
al. (2002). For the fake data, more than 90% of Jupiter-mass planets in the range
0.7AU ∼< r⊥ ∼< 20AU and more than 25% with 0.3AU ∼< r⊥ ∼< 30AU would have
been detected. Indeed, some sensitivity would have extended all the way down to
Earth-mass planets.
2.5. Luminous Lens?
Understanding the physical properties of their host stars is a major component
of the study of extra-solar planets. It is especially important to know the mass and
distance of the lens star for planets detected by microlensing because only then
can we accurately determine the planet’s mass and physical separation from the
star. Obtaining similar information for microlensing events that are unsuccessfully
searched for planets enables more precise estimates of the detection efficiency. There
50
are only two known ways to determine the mass and distance of the lens: either
measure both the microlensing parallax and the angular Einstein radius (which are
today possible for only a small subset of events) or directly image the lens. In most
cases the lens is either entirely invisible or is lost in the much brighter light of the
source.
A simple argument suggests, however, that in extremely high-magnification
events like OGLE-2004-BLG-343, the lens will often be easily visible and, indeed, it
is the lens that is unknowingly being monitored, with the source revealing itself only
in the course of the event. Events of magnification Amax require that the source be
much smaller than the Einstein radius, θ∗ ∼< 2θE/Amax. Since θE =√
κMπrel, large
θE requires a lens that is either massive or nearby, both of which suggest that it is
bright. On the other hand, a small θ∗ implies that the source is faint. Generally, if
a faint source and a bright potential lens are close on the sky, only the lens will be
seen, until it starts to strongly magnify the source. This has important implications
for the real time recognition of extreme magnification events, as we discuss in § 2.6.
Here we review the evidence as to whether the blended light in OGLE-2005-BLG-343
is in fact the lens.
As was true for OGLE-2003-BLG-175/MOA-2003-BLG-45 mentioned in § 2.1.3,
the blended light in OGLE-2004-BLG-343 lies in the “reddening sequence” of
foreground disk stars. It is certainly “dazzling” by any criterion, being about 50
51
times brighter than the source in I and 150 times brighter in V (see Fig. 2.2). Is the
blended light also due to the lens in this case?
There is one argument for this hypothesis and another against. We initiate
the first by estimating the mass and distance to the blend as follows. We
model the extinction due to dust at a distance x along the line of sight by
dAI/dx = 0.4 kpc−1e−qx and set q = 0.26 kpc−1 in order to reproduce the measured
extinction to the bulge AI(8 kpc) = 1.34. Using the Reid (1991) color-magnitude
relation, we then adjust the distance to the blend until it reproduces the observed
color and magnitude of the blend. We find a distance modulus of 12.6 (∼ 3.3 kpc),
and with the aid of the Cox (2000) mass-luminosity relation, we estimate a
corresponding mass Ml = 0.9 M⊙. Inspecting Figure 2.5, we see that this is almost
exactly the peak of the lens-distance distribution function predicted by combining
light-curve information and the Galactic model. This is quite striking because, in the
absence of light-curve information, the lens would be expected to be relatively close
to the source. From our Monte-Carlo simulation toward the line of sight of this
event, the total prior probability of the bulge-bulge events is about 1.5 times higher
than the prior probability of the bulge-disk events, and furthermore, only about 7%
of all events have lenses less than 3.3 kpc away (see green and purple histograms in
source and lens distance modulus panels of Fig. 2.5). It is only because the light
curve lacks obvious finite-source effects (despite its very high-magnification) that one
is forced to consider lenses with large θE, which generally drives one toward nearby
52
lenses in the foreground disk. Based on our experience analyzing many blended
microlensing events, the blended light is most often from a bulge star rather than
a disk star, which simply reflects the higher density of bulge stars. In brief, it is
quite unusual for lenses to be constrained to lie in the disk, and it is quite unusual
for events to be blended with foreground disk stars. This doubly unusual set of
circumstances would be more easily explained if the blend were the lens.
However, if the blend were the lens, then the source and lens would be aligned
to better than 1 mas during the event, and one would therefore expect that the
apparent position of the source would not change as the source first brightened
and then faded. In fact, we find that the apparent position does change by about
73 ± 9 mas. However, since the apparent source (i.e., combined source and blended
light at baseline) has a near neighbor at 830 mas, which is almost as bright as the
source/blend, it is quite possible that the lens actually is the blend, but that this
neighbor is corrupting the astrometry.
Thus, the issue cannot be definitively settled at present. However, it could be
resolved in principle by, for example, obtaining high-resolution images of the field a
decade after the event when the source and lens have separated sufficiently to both
be seen. If the blend is the lens, then they will be seen moving directly apart with
a proper motion given µ = θE/tE, where θE is derived from the estimated mass and
distance to the lens and tE is the event timescale.
53
Since the blend cannot be positively identified as the lens, we report our main
results using a purely probabilistic estimate of the lens parameters. However, for
completeness, we also report results here based on the assumption that the lens is
the blend. Compared to the previous simulation, in which we considered the full
mass function and full range of distances, we sample only the narrow intervals of
mass and distance that are consistent with the observed color and magnitude of the
lens/blend. To implement these restrictions, we repeat the Monte Carlo, but with
the additional constraint that the predicted apparent magnitudes agree with the
observed blend magnitude (with an error of 0.5 mag) and that the predicted colors
(using the above extinction law and the Reid 1991 color-magnitude relation) also
show good agreement with the observed color (with 0.2 mag error). These errors are,
of course, much larger than the observational errors. They are included to reflect the
fact that the theoretical predictions for color and magnitude at a given mass are not
absolutely accurate.
Figure 2.12 is the resulting version of Figure 2.10 when the Monte Carlo is
constrained to reproduce the blend color and magnitude. The sensitivity contours
are narrower and deeper, reflecting the fact that the diagram no longer averages over
a broad range of lens masses but rather is restricted effectively to a single mass (and
single distance).
54
2.6. Summary and Discussion
In this paper we present our analysis of microlensing event OGLE-2004-BLG-
343, with the highest peak magnification (Amax = 3000±1100) ever analyzed to date.
The light curve is consistent with the single-lens microlensing model, and no planet
has been detected in this event. We demonstrate that if the peak had been well
covered by the observations, the event would have had the best sensitivity to planets
to date, and it would even have had some sensitivity to Earth-mass planets (§ 2.4.4,
§ 2.4.5). However, this potential has not been fully realized due to human error
(§ 2.2), and OGLE-2004-BLG-343 turns out to be no more sensitive to planets than
a few other high-magnification events analyzed before (§ 2.4.2, § 2.4.5). Thus, while
ground-based microlensing surveys are technically sufficient to detect very low-mass
planets, the relatively short timescale of the sensitive regime of high-magnification
microlensing events demands a rapidity of response that is not consistently being
achieved. In the final paragraph below, we develop several suggestions to rectify this
situation.
In § 2.3 we show that finite-source effects are important in analyzing this
event, so we extend the method of Yoo et al. (2004b) to incorporate such effects in
planetary detection efficiency analysis. Moreover, since magnification calculations
of binary-lens models with finite-source effects are computationally remarkably
expensive, and applying previous finite-source algorithms, it would have taken
55
of order a year of CPU time to do the detection efficiency calculation required
by this event. We therefore develop two new binary-lens finite-source algorithms
(Appendix A) that are considerably more efficient than previous ones. The
“map-making” method (Appendix A.1) is an improvement on the conventional
inverse ray-shooting method, which proves to be especially efficient for use in
detection efficiency calculations, while the “loop-linking” method (Appendix A.2)
is more versatile and could be easily implemented in programs aimed at finding
best-fit finite-source binary-lens solutions. Using these algorithms, we were able to
complete the computations for this paper in about 4 processor-weeks, roughly an
order of magnitude faster than would have been required using previous algorithms.
Finally, we show in § 2.5 that the blend, which is a Galactic disk star, might
very possibly be the lens, and that this case also proves to be highly probable
from the Monte-Carlo simulation. However, it seems to contradict the astrometric
evidence, and we point out that this issue could in principle be solved by future
high-resolution images. Among the high-magnification events discovered by current
microlensing survey groups, it is very likely that the lens star, which is also the
apparent source, of those events is in the Galactic disk. Thus the blended light
is usually far brighter than the source, thereby increasing the difficulty in early
identification of such events. This fact motivates the first of several suggestions
aimed at improving the recognition of very high-magnification events:
56
1) When events are initially alerted they should be accompanied by instrumental
CMD of the surrounding field, with the location of the apparent “source”
highlighted. Events whose apparent sources lie on the “reddening sequence”
of foreground disk stars (see Fig. 2.2) have a high probability to actually be
lenses of more distant (and fainter) bulge sources. These events deserve special
attention even if their initial light curves appear prosaic.
2) For each such event it is possible to measure the color (but not immediately
the magnitude) of the source by the standard technique of obtaining two-band
photometry and measuring the slope of the relative fluxes in the two bands.
If the color is different from that of the apparent “source” at baseline, that
will prove that this baseline light is not primarily due to the source, and it will
increase the probability that this baseline object is the lens. Moreover, if the
source color is relatively red, it will show that the source is probably faint and
so is (1) most likely already fairly highly magnified (thereby making it possible
to detect above the foreground blended light) and (2) capable, potentially at
least, of being magnified to very high magnification (see § 2.5). This would
motivate obtaining more data while the event was still faint to help predict
its future behavior and would enable a guess as to how to “renormalize” the
event’s apparent magnification to its true magnification. This is important
because generally one cannot accurately determine this renormalization until
57
the event is within 0.4 mag (when the event is teff before its peak), at which
point it may well be too late to act on this knowledge.
3) Both survey groups and follow-up groups should issue alerts on suspected
high-magnification events guided by a relatively low threshold of confidence,
recognizing that this will lead to more “false alerts” than at present. If such
alerts are accompanied by a cautionary note, they will promote intergroup
discussions that could lead to more rapid identification of high-magnification
events without compromising the credibility of the group.
58
Fig. 2.1.— Light curve of OGLE-2004-BLG-343 near its peak on 2004 June 19(HJD 2,453,175.7467). Only OGLE I-band data (open circles) are u sed in mostof the analysis, except OGLE V -band data (open triangles) and µFUN H-band data(crosses) are used to constrain the color of the source star. All bands are linearlyrescaled so that Fs and Fb are the same as the OGLE I-band observations. The solidline shows the bes t-fit PSPL model. The upper right inset shows the peak of thelight curve, wit h the range of the simulated data points plotted by the thick line.
59
Fig. 2.2.— CMD of the OGLE-2004-BLG-343 field. Hipparcos main-sequence stars(blue dots), placed at 10−0.15/5R0 = 7.5 kpc and reddened by the reddening vectorderived from the clump, are displayed with the OGLE-II stars (black dots). The Reid(1991) relation is plotted by the red solid line over the Hipparcos stars. On the CMD,the magenta filled circle is the red clump and the green filled circle is the blendedstar. The large black filled circle is the OGLE V measurement of the source with of1σ error bars, which sets a lower limit for the source V − I color. The magenta filledcircle with error bars is the result of combining the (I −H)/(V − I) information (seeFig. 2.3) with the OGLE measurement.
60
Fig. 2.3.— (VOGLE − IOGLE)/(IOGLE − H2MASS) color-color diagram. All points arefrom matching 2MASS H-band data with OGLE-II V, I photometry in a field centeredon OGLE-2004-BLG-343. Stars on the giant branch are shown by open circles. Thesolid and dashed vertical lines represent the source IOGLE−H2MASS color transformedfrom its IOGLE−HµFUN value and its 1σ ranges. Their intersections with the diagonaltrack of stars give corresponding VOGLE − IOGLE colors, which are represented by thehorizontal lines.
61
Fig. 2.4.— Likelihood contours (1σ, 2σ, 3σ) for finite-source points-lens modelsrelative to the best-fit PSPL model. Contours with x-axis as log u−1
0 and log Amax
are displayed in solid and dashed lines, respectively.
62
b-b(posterior)b-d(posterior)
bigger finite-sourceeffects
b-b(prior)b-d(prior)
priorposterior
Fig. 2.5.— Probability distributions of u0, dereddened apparent I-band magnitudeof the source I0, proper motion µ, log(z0), source distance modulus, lens distancemodulus, absolute I-band magnitude of the source MI , and lens mass for Monte Carloevents toward the line of sight of OGLE-2004-BLG-343. Blue histograms representthe posterior probability distributions for bulge-disk microlensing events while redones represent the posterior probability distributions for bulge-bulge events. In thesource and lens distance-modulus panels, histograms in purple and green representthe prior probability distributions for bulge-disk and bulge-bulge events, respectively.The black Gaussian curves in the u0 and I0 panels show probability distributions fromPSPL light-curve fitting alone. In the lens mass panel, the dark green histogram showsthe prior probability distribution, while the orange histogram represents the posteriordistribution.
63
Fig. 2.6.— (For real data) Planetary detection efficiency for mass ratios q = 10−3,10−4, and 10−5 for OGLE-2004-BLG-343 as a function of the planet-star separationbx = b× cos α and by = b× sin α in the units of θE where α is the angle of planet-staraxis relative to the source-lens direction of motion. Different colors indicate 10% (red),25% (yellow), 50% (green), 75% (cyan), 90% (blue) and 100% (magenta) efficiency.The black circle is the Einstein ring, i.e., b = 1.
64
Fig. 2.7.— (For real data) Planet detection efficiency of OGLE-2004-BLG-343 as afunction of the planet-star separation b (in the units of rE) and planet-star mass ratioq. The contours indicate 25%, 50%, 75%, and 90% efficiency.
65
Fig. 2.8.— (For fake data) Planetary detection efficiency for mass ratios q = 10−3,10−4, 10−5 and 10−6 of OGLE-2004-BLG-343 augmented by simulated data pointscovering the peak as a function of the planet-star separation bx and by in the units ofθE. Different colors indicate 10% (red), 25% (yellow), 50% (green), 75% (cyan), 90%(blue) and 100% (magenta) efficiency. The black circle is the Einstein ring, i.e., b = 1.
66
Fig. 2.9.— (For fake data) Planetary detection efficiency of OGLE-2004-BLG-343augmented by simulated data points covering the peak. as a function of the planet-star separation b (in the units of θE) and planet-star mass ratio q. The contoursrepresent 25%, 50%, 75%, and 90% efficiency.
67
Fig. 2.10.— (For real data) Planetary detection efficiency as a function of r⊥, thephysical projected star-planet distance and mp, the planetary mass for OGLE-2004-BLG-343. The contours represent 25%, 50%, 75%, and 90% efficiency.
68
Fig. 2.11.— (For fake data) Planetary detection efficiency as a function of r⊥,the physical projected star-planet distance and mp, the planetary mass for OGLE-2004-BLG-343 augmented by simulated data points covering the peak. The contoursrepresent 25%, 50%, 75%, and 90% efficiency.
69
Fig. 2.12.— Planetary detection efficiency as a function of r⊥, the physical projectedstar-planet distance, and mp, the planetary mass for OGLE-2004-BLG-343, byassuming that the blended light is due to the lens. The contours represent 25%,50%, 75%, and 90% efficiency.
70
t0(HJD’) u0 tE(days) Is Ib χ2 Degree of Freedom
3175.7467 ± 0.0005 0.000333 ± 0.000121 42.5 ± 15.6 22.24 ± 0.40 18.08 ± 0.01 200.1 188
Table 2.1. OGLE-2004-BLG-343 Best-Fit PSPL Model Parameters
71
Chapter 3
Microlensing Event MOA-2007-BLG-400:
Exhuming the Buried Signature of a Cool,
Jovian-Mass Planet
3.1. Introduction
In the currently favored paradigm of planet formation, the location of the snow
line in the protoplanetary disk plays a pivotal role. Beyond the snow line, ices can
condense, and the surface density of solids is expected to be higher by a factor of
several relative to its value just inside this line. As a result of this increased surface
density, planet formation is expected to be most efficient just beyond the snow
line, whereas for increasing distances from the central star the planet formation
efficiency drops, as the surface density decreases and the dynamical time increases
(Lissauer 1987). In this scenario, gas-giant planets must form in the region of the
protoplanetary disk immediately beyond the snow-line, as the higher surface density
is required to build icy protoplanetary cores that are sufficiently massive to accrete
a substantial gaseous envelope while there is remaining nebular gas (Pollack et al.
1996). Low-mass primaries are expected to be much less efficient at forming gas
72
giants because of the longer dynamical times and lower surface densities at the snow
lines of these stars (Laughlin et al. 2004; Ida & Lin 2005; Kennedy & Kenyon 2008).
Migration due to nebular tides and other dynamical processes can then bring the icy
cores or gas giants from their formation sites to orbits substantially interior to the
snow line (Lin et al. 1996; Ward 1997; Rasio & Ford 1996).
The precise location of the snow line in protoplanetary disks is a matter of
some debate (e.g., Lecar et al. 2006), and is even likely to evolve during the epoch
of planet formation, particularly for low-mass stars (Kennedy et al. 2006; Kennedy
& Kenyon 2008). The condensation temperature of water is ∼ 170 K, and a fiducial
value for the location of the snow line in solar-mass stars motivated by observations
in our solar system is ∼ 2.7 AU. This may scale linearly with the stellar mass M ,
since the stellar luminosity during the epoch of planet formation scales as ∼ M2
for stars with M ∼< M⊙ (Burrows et al. 1993, 1997). Whereas the radial velocity
and especially transit methods are most sensitive to planets that are close to their
parent star at distances well inside the snow line, the sensitivity of the microlensing
method peaks at planetary separations near the Einstein ring radius of the primary
lens (Mao & Paczynski 1991; Gould & Loeb 1992), which is ∼ 3.5 AU(M/M⊙)1/2 for
typical lens and source distances of 6 kpc and 8 kpc, respectively. This corresponds
to a peak sensitivity at equilibrium temperatures of Teq ∼ 150 K(M/M⊙) for a
mass-luminosity relation of the form L ∝ M5, and distances relative to the snow line
of ∼ 1.3(M/M⊙)−1/2 if the location of the snow line at the epoch of planet formation
73
scales as M . Thus microlensing is currently the best method of probing planetary
systems in the critical region just beyond the snow line (Gould & Loeb 1992).
Planetary perturbations in microlensing events come in two general classes. The
majority of planetary perturbations are expected to occur when a planet directly
perturbs one of the two images created by the primary lens, as the image sweeps
by the planet during the microlensing event (Gould & Loeb 1992). Although these
perturbations are more common, they are also unpredictable and can occur at any
time during the event. Early microlensing planet searches focused on this class of
perturbations, as it was the first to be identified and explored theoretically (Gould
& Loeb 1992; Bennett & Rhie 1996; Gaudi & Gould 1997). The second class of
planetary perturbations occurs in high-magnification events, in which the source
becomes very closely aligned with the primary lens (Griest & Safizadeh 1998).
In such events, the two primary-lens images become highly distorted and sweep
along nearly the entirety of the Einstein ring (Liebes 1964). These sweeping images
probe subtle distortions of the Einstein ring caused by nearby planets, which will
give rise to perturbations within the full-width half-maximum of the event (Bond
et al. 2002; Rattenbury et al. 2002). Although high-magnification events are rare
and so contribute a minority of the planetary perturbations, they are individually
more sensitive to planets because the images probe nearly the entire Einstein ring.
Furthermore, since the perturbations are localized to the peak of the event which
74
can be predicted beforehand, they can be monitored more efficiently with limited
resources than the more common low-magnification events.
For these reasons, current microlensing planet searches tend to deliberately
focus on high-magnification events. Thus, of the seven prior microlensing planets
discovered to date (Bond et al. 2004; Udalski et al. 2005; Beaulieu et al. 2006;
Gould et al. 2006; Gaudi et al. 2008; Bennett et al. 2008), five have been found
in high-magnification events, with peak magnifications ranging from A = 40 to
A = 800. However, despite the fact that this planet-search strategy has proven
to be so successful, the properties of the planetary perturbations generated in
high-magnification events are less well-understood than those in low-magnification
events.
Most of the studies of the properties of planetary perturbations in high-
magnification events have focused on the properties of the caustics, the locus of
points defining one or more closed curves, upon which the magnification of a point
source is formally infinite. The morphology and extent of the region of significant
perturbation by the planetary companion can be largely understood by the shape and
size of these caustic curves. Planetary perturbations in high-magnification events are
caused by a central caustic located near the position of the primary, and thus several
authors have considered the size and shape of these central caustics as a function
of the parameters of the planet (Griest & Safizadeh 1998; Dominik 1999b; Chung
et al. 2005). These and other authors have identified several potential degeneracies
75
that complicate the unique interpretation of central caustic perturbations. The
first to be identified is a degeneracy such that the caustic structure (and so light
curve morphology) of a planet with mass ratio q ≪ 1 and projected separation in
units of the Einstein ring d not too close to unity is essentially identical under the
transformation d ↔ d−1 (Griest & Safizadeh 1998). A second degeneracy arises
from the fact that very close or very wide roughly equal-mass binaries also produce
perturbations near the peak of the light curves. These give rise to perturbations that
have the same gross observables as planetary perturbations.
The severity of these degeneracies depends on both the specific parameters
of the planetary/binary companion, as well as on the data quality and coverage.
Griest & Safizadeh (1998) and Chung et al. (2005) demonstrated that the d ↔ d−1
degeneracy is less severe for more massive planets with separations closer to the
Einstein ring (d ∼ 1). Empirically, this degeneracy was broken at the ∆χ2 ∼ 4 level
for the relatively large mass-ratio planetary companion OGLE-2005-BLG-071Lb
(Dong et al. 2009), for which the light curve was well-sampled, but was essentially
unresolved for the low mass-ratio planetary companion MOA-2007-BLG-192Lb
(Bennett et al. 2008), for which the planetary perturbation was poorly sampled.
For the planetary/equal-mass binary degeneracy, Han & Gaudi (2008) argued
that, although the gross features of central caustic planetary perturbations can be
reproduced by very close or very wide binary lenses, the morphologies differ in detail,
and thus this degeneracy can be resolved with reasonable light curve coverage and
76
moderate photometric precision. Indeed for every well-sampled high-magnification
event containing a perturbation near the peak (and that is not in the Chang-Refsdal
(1979) limits), this degeneracy has been resolved (Albrow et al. 2002; Gould et
al. 2006; Dong et al. 2009). Even for the relatively poorly-sampled light curve of
MOA-2007-BLG-192Lb, an equal-mass binary model is ruled out at ∆χ2 ∼ 120
(Bennett et al. 2008), although in this case this is partially attributable to the
exquisite photometric precision (< 1%).
One complication with searching for planets in high-magnification events is
that, the higher the magnification, the more likely it is that the primary lens will
transit the source. When this happens, the peak of the event is suppressed and
smoothed out, as the lens strongly magnifies only a small portion of the source.
If the source is also larger than the region of significant perturbations due to a
planetary companion (roughly the size of the central caustic), then the planetary
deviations will also be smoothed out and suppressed (Griest & Safizadeh 1998;
Han 2007). These finite source effects have potential implications for both the
detectability of central caustic perturbations, as well as the ability to uniquely
determine the planetary parameters, and in particular resolve the two degeneracies
discussed above. In practice, the caustic structures of all four high-magnification
planetary events (containing five planets) were larger than the source. Hence,
while there were detectable finite-source effects in all cases (which helped constrain
the angular Einstein radius and so the physical lens parameters), the planetary
77
perturbations were in all four cases quite noticeable. Thus the effect of large sources
on the the detectability and interpretation of central-caustic perturbations has not
been explored in practice.
Theoretical studies of detectability of central caustic perturbations when
considering finite source effects have been performed by Griest & Safizadeh (1998),
Chung et al. (2005), and Han (2007). These authors demonstrated that the
qualitative nature of planetary perturbations from central caustics is dramatically
different for sources that are larger than the caustic. In particular, the detailed
structure of the point-source magnification pattern, which generally follows the shape
of the caustic, is essentially erased or washed out. Rather, the perturbation structure
is characterized by a roughly circular region of very low-level, almost imperceptible
deviation from the single-lens form that is roughly the size of the source and centered
on the primary lens. This region is surrounded by an annular rim of larger deviations
that has a width roughly equal to the width of the caustic (Griest & Safizadeh 1998;
Chung et al. 2005). Finally, there are less pronounced deviations that extend to a
few source radii. Planets are detectable even if their central caustics are quite a bit
smaller than the source, provided that the χ2 deviation is sufficiently high. (Often
∆χ2 > 60 is adopted, although ∆χ2 > 150 may be more realistic.) The magnitude
of these perturbations decrease as the ratio between the source size and caustic size
increases, making it difficult to detect very small planets for large sources (Chung et
al. 2005; Han 2007).
78
Although central caustics may formally be detectable when the source is
substantially larger than the caustic, it remains a significant question whether
these very washed out caustics can be recognized in practice, and even if they can,
whether they can be uniquely interpreted in terms of planetary parameters. Indeed,
it is unknown whether a washed out central caustic due to a planet can actually
be distinguished from one due to a binary companion. This question is especially
important with regard to low-mass planets. The size of the central caustic scales
as the product of the planet/star mass ratio and a definite function of planet-star
separation. Hence, taken as a whole, smaller planets produce smaller caustics,
meaning that events of higher magnification are required to detect them. These are
just the events that are most likely to have their peaks washed out by finite-source
effects.
Here we analyze the first high-magnification event with a buried signature of
a planet, in which the source size is larger than the central caustic of the planet.
The caustic is indeed so washed out that the event appears unperturbed upon
casual inspection. However, the residuals to a point-lens fit are clear and highly
significant. We show that one can infer the planetary (as opposed to binary) nature
of the perturbation from the general pattern of these residuals, and that a detailed
analysis constrains the mass ratio of the planet quite well, but leaves the close/wide
(d ↔ d−1) degeneracy intact. Hence, at least in this case, the fact that the caustic is
79
buried in the source does not significantly hinder one’s ability to uncover the planet
and measure its mass ratio.
3.2. Observations
MOA-2007-BLG-400 [(α, δ)J2000.0 = (18h09m41.s98,−2913′26.95′′),
(l,b)=(2.38,−4.70)] was announced as a probable microlensing event by the
Microlensing Observations in Astrophysics (MOA) collaboration on 5 Sept 2007
(HJD′ ≡ HJD - 2450000 = 4349.1), about 5 days before peak. The source star proves
to be a bulge subgiant and so is somewhat brighter than average, but the event
timescale was relatively short (tE ∼ 15 days) and observations had been interrupted
for 6 days by bad weather. Taken together, the two latter facts account for the
relatively late alert. By coincidence, the triggering observations took place on the
same night that another event, OGLE-2007-BLG-349 (aka MOA-2007-BLG-379),
was peaking at extremely high magnification with an already-obvious planetary
anomaly. After focusing exclusively on the latter event for the first 5 hours of the
night, MOA resumed its normal field rotation for the last 1.5 hours, which led to the
discovery of MOA-2007-BLG-400.
The Microlensing Follow Up Network (µFUN) began observing this as a possible
high-magnification event on 7 Sept, but did not mobilize intensive observations until
UT 08:55 10 Sept, just 15 hours before peak, following a high-mag alert issued
80
by MOA a few minutes earlier. Even at that point, the predicted minimum peak
magnification was only Amax > 90, which would have enabled only modest sensitivity
to planets. Nevertheless, all stops were pulled and it was observed as intensively as
possible from 7 observatories, µFUN CBA Perth (Australia) 0.25m unfiltered, µFUN
Bronberg (South Africa) 0.35m unfiltered, µFUN SMARTS (CTIO, Chile) 1.3m V ,
I, H, µFUN Campo Catino Austral (CAO, Chile) 0.50m unfiltered, µFUN Farm
Cove (New Zealand) 0.35m unfiltered, µFUN Auckland (New Zealand) 0.4m R, and
µFUN Southern Stars (Tahiti) 0.28m unfiltered.
The source star lies just outside one of the OGLE fields (as defined by their
field templates) and for this reason was not recognized as a microlensing event by
the OGLE Early Warning System. However, due to small variations in pointing,
there are a total of 452 OGLE images containing this source. Only two of these are
significantly magnified, ten days and nine days before peak. Hence, the OGLE data
do not help constrain the light curve parameters. However, they are useful to study
of the baseline behavior of the source (see Appendix D).
Essentially all of the “action”, both the peak of the event and the planetary
anomalies, occurred during the µFUN SMARTS (Chile) observations at CTIO,
4354.47 < HJD′ < 4354.69, using the ANDICAM optical/IR dual-channel camera,
and µFUN CAO (Chile) observations 4354.50 < HJD′ < 4354.70. Most (45) of the
optical CTIO observations over the peak were carried out in I band, with a few (8)
taken in V in order to measure the (V − I) color. Each of these was a 5 minute
81
exposure, with approximately 1 minute read-out time between exposures. During
each optical exposure, there were 5 dithered H-band exposures, each of 50 seconds,
almost equally spaced over the 6 minute cycle time. That is, 53 × 5 = 265 H-band
observations in all. Unfortunately, the source became so bright as it transited the
lens (i.e., when the planetary anomalies were the strongest), that 14 I-band images
were affected by non-linearities and saturation in the detector response. We exclude
these 14 I-band data points from analysis. The H-band photometry are not affected
by this problem, therefore, with higher time resolution and more continuous coverage
than the I-band data, the H-band data provide most of the constraining power to
the microlens model.
There were 84 CAO un-filtered observations taken during the peak night.
Unfortunately, the peak of the event was severely saturated and the clock zero point
is not securely known, therefore, these data are not used in the analysis. However,
the exposures during the times of maximum deviation from a point lens (i.e., when
the caustic was crossing the stellar limb) are not saturated, and these qualitatively
confirm the interpretation from the more detailed CTIO data.
The µFUN CTIO and MOA data are used in the analysis since they provide
essentially all the constraints to the microlens model. Data from other µFUN
sites are checked for consistency with the final models, and they are found to be
well fitted by the best-fit model. The MOA data were reduced using the standard
MOA difference imaging analysis (DIA) pipeline. The µFUN CTIO data were
82
reduced using DIA developed by Wozniak (2000). The H-band data are affected
by intrapixel sensitivity variations at the 1% level. Fortunately, the dither pattern
was repeated almost exactly over the night of the peak, so that these variations
follow the 5-element dither pattern quite well. We therefore treat the H-band data
as 5 independent data sets, which reduces χ2 by 180 for 8 degrees of freedom. The
H-band images show a triangular PSF, which is likely to introduce systematic errors
into the photometry. As a crosscheck, we also use the DIA package developed by
Bond et al. (2001) to independently reduce these images.
3.3. Microlens Model
Despite the fact that the peak of MOA-2007-BLG-400 was “flattened” by finite-
source effects, it nevertheless reached a very high peak magnification, Amax = 628.
However, even to the experienced eye, it looks like an ordinary point-lens light curve
with pronounced finite-source effects. More detailed modeling is required to infer
that it actually contains a Jovian mass-ratio planet.
Figure 3.1 shows the light curve together with the best-fit point-lens model
(blue) and planetary model (red). Both include finite-source effects. The most
pronounced features of the point-lens model residuals are a short positive spike on
the rising side and a short negative spike on the falling side, each lasting about 30
minutes, which leave very similar traces in I and H. As displayed in Figure 3.2, these
83
features clearly stand out in the reductions using both the Wozniak (top panel) and
Bond (bottom panel) DIA packages. Each package introduces its own systematic
deviations, but there are no obvious trends besides the above features that are
supported by both reductions. These occur very close to the times that the point
lens begins and ends its transit of the source (within the framework of this model).
The timing of these deviations strongly suggests that they are due to microlensing
rather than stellar variability. There are then two possible explanations: either the
source is actually being transited by a more complicated caustic than a point lens
(due to a binary or planetary companion) or the limb of the source is not being
properly modeled. However, if one assumes a circular source, the latter explanation
would imply symmetric residuals, whereas the actual residuals are closer to being
antisymmetric. (We address the possibility of an elliptical source in Appendix D.)
Indeed, this approximate antisymmetry extends to the less pronounced residual
features, including the sustained deficit prior to the first spike (and sustained excess
following the second one) as well as the declining residuals between the two spikes.
The similarity of the I and H residuals in itself argues that the deviations are due to
microlensing rather than some sort of stellar variability, which would not generally
be expected to be achromatic.
The short durations of the spikes tell us that the central caustic is quite
small, with “caustic width” w ∼ 30 min/15 day = 1.4 × 10−3. This implies that
the companion either has low mass ratio, or is a very wide or very close binary
84
companion. Formally, w is given by equation (12) of Chung et al. (2005) as a very
good approximation to the “short diameter” or “width” of a central caustic (see
Fig. 3.3). However, here the estimate is quite inexact not only because the width of
the spikes is not precisely defined, but also because we do not know, at this point,
the exact orientation of the caustic.
After some algebra, one finds that in two limiting regimes, the Chung et al.
(2005) formula takes the forms,
w(d, q) → 4q
d2(d ≫ 1), w(d, q) → 4qd2 (d ≪ 1), (3.1)
and
w(d, q) →√
27
16
q
|d − 1| (d ∼ 1), (3.2)
where q is the companion/primary mass ratio and d is the separation in units of
the Einstein ring. Note that in the first limit (eq. [3.1]), w → 4γ, where γ is the
shear. The crossover point for these approximations is d = 2.3 (or d = 0.43), at
which point each is in error by about 15%. (For simplicity, we restrict the discussion
here to the case d > 1. There is a well-known d ↔ d−1 degeneracy between the
d ≪ 1 and d ≫ 1 limits, as can be guessed from the forms for w in eq. [3.1]. This
degeneracy will prove to be almost perfect in this case, see § 3.3.3). Hence, in the
first limit, d ∼ 50q1/2, implying that if q were in the “binary range” (| log q| < 1),
then d would be quite large. That is, the central caustic would be generated by a
85
nearly pure shear and therefore would have a nearly symmetric, diamond-shaped,
Chang-Refsdal (1979) form. In the point-lens model, the lens passes almost directly
over the center of the source. For this trajectory, a symmetric caustic would yield
symmetric residuals, in sharp contrast to Figure 3.1. On the other hand, for the
opposite limit, q ∼ 1 × 10−3(d − 1), which lies squarely in the planetary regime.
Thus, simple arguments already argue strongly in favor of a planetary companion
that is fairly near the Einstein ring.
3.3.1. Hybrid Pixel/Ray Map Algorithm
Notwithstanding these arguments, we conduct a massive blind search for
companions over a very broad range of masses using a modified version of the
“magnification map” technique of Dong et al. (2006), which was specifically designed
for high-magnification events. The original approach was, for each given (d, q)
pair, to shoot rays over a fairly narrow annulus (say, 0.01 Einstein radii) around
the Einstein ring in the image plane and to sort these rays in pixels on the source
plane. Then for each source position being modelled (i.e., each data point), one
would identify the pixels that intersected the source and would check each ray
contained in these pixels to determine whether it landed on the source and, if so,
evaluate the source surface brightness at that position. In the initial broad search,
three parameters (d, q, α) are held fixed on a grid of values, while the remaining
parameters (t0, u0, tE, ρ, and possibly others) are varied to minimize χ2 at each grid
86
point. Here, α is the angle of the source trajectory relative to the binary axis, t0 is
the time of closest approach to the adopted center of the lens geometry (usually the
center of mass), u0 is the source-lens separation at this time in units of the Einstein
radius, ρ is the source radius in the same units, and tE is the Einstein crossing time.
This division is efficient because 1) (t0, u0, tE, ρ) are usually approximately known
from the general structure of the light curve, so minimization over these parameters
is straightforward once (d, q, α) are fixed; 2) (d, q) define the map, which naturally
facilitates minimization of other parameters except for α, whose value is not usually
even approximately obvious from the light curve.
The new approach differs principally in that the pixels that are contained
entirely within the source are now evaluated as a whole, i.e., by the total number
of rays in that pixel. Pixels that cross the source boundary are still evaluated
ray-by-ray, as previously. This primary change then leads to several other changes.
First, for each pixel, we record not only the geometric center but also the centroid
of the rays. The surface brightness is then evaluated at the latter position. Second,
the pixels are made much smaller, to minimize both the number of rays that must
be evaluated individually and the surface-brightness variations across the pixel
(which are corrected only to first order by the ray-centroid scheme just mentioned).
Typically, there are a few hundreds of pixels per source. Third, the pixels are
hexagonal, since this is the most compact tiling possible, i.e., the closest tile shape
to a circle. Fourth, the source positions outside the map region are evaluated using
87
the hexadecapole and quadrupole approximation of Gould (2008) (see also Pejcha &
Heyrovsky 2008). Finally, we use Markov chain Monte Carlo (MCMC) for the χ2
minimization.
3.3.2. The (w, q) Grid of Lens Geometries
The initial search for solutions is conducted over a rectilinear grid in (w, q)
rather (d, q). Since the short diameter, or “caustic width”, w, is a monotonic
function of the star-planet separation d (at fixed planet/star mass ratio q), these
formulations are in some sense equivalent. However, for many events (including the
present one) the short diameter w can be estimated by simple examination of the
data. In these cases, the search space is both more regular and easier to define in
terms of the (w, q) grid. In particular, equation (3.2) shows that at fixed w, d moves
very close to 1 for very low q.
3.3.3. Best-Fit Model
We consider short diameters w over the range −3.5 ≤ log w ≤ −2 and
companion mass ratios −4 ≤ log q ≤ 0, focusing on the regime d ≥ 1. We find that
there is only one local minimum in this range. The range of allowed solutions is well
localized around this minimum, with
q = (2.5+0.5−0.3) × 10−3, d = 2.9 ± 0.2, w = (1.30 ± 0.06) × 10−3, (3.3)
88
with the last quantity being, of course, dependent on the first two. Figure 3.4 shows
the ∆χ2 = 1, 4, 9 contours with respect to the mass ratio q and the projected
planet-star separation d along with the short diameter w. The main point to note
is that these parameters are quite well constrained. Note that, as expected, d and q
are strongly correlated, while w and q are basically uncorrelated. We also perform
similar searches using the alternative H-band reduction by Bond’s DIA package. The
solutions agree with the above to well within one sigma, but the parameters have
larger uncertainties: q = (2.6 ± 0.7) × 10−3, d = 2.9 ± 0.3, w = (1.26 ± 0.09) × 10−3.
We therefore adopt results from the Wozniak-based reductions, noting that they
may subject to systematic errors ∼< 1σ.
Figure 3.3 shows the source trajectory and the central caustics as well as
the differences in magnification between the best-fit planetary model and its
corresponding single-lens model. This geometry nicely accounts for the main features
of the point-lens residuals seen in Figure 3.1. The regions beyond the “back walls”
(long segments) of the caustic are somewhat de-magnified, which accounts for the
initial depression of the light curve. As the source crosses the “back wall” of the
caustic, it spikes. After the source has exited the caustic, it continues to suffer
additional magnification due to the “ridge” of magnification that extends from the
trailing cusp.
We also conducted a similar blind search as above, but concentrating on the
regime d < 1. As expected, we recover the well-known d ↔ d−1 degeneracy, and
89
find a solution with essentially the same mass ratio q = (2.6 ± 0.4) × 10−3, but with
d = 2.9−1 = 0.34+0.03−0.02, and the wide solution is slightly preferred by ∆χ2 = 0.2.
Thus, although each solution is well-localized to its respective minimum, this
discrete degeneracy implies that the projected separation can take on two values
that differ by a factor of ∼ 8.5. The severity of the degeneracy can be traced to
the planetary parameters. Although the planet/star mass ratio is quite large, which
tends to reduce the severity of the degeneracy, the planet lies quite far from the
Einstein ring, which tends to make it more severe. Actually, a better measure of the
overall expected asymmetry between the d and d−1 solutions is the short diameter
w, which in this case is small, implying a severe degeneracy. Indeed, the caustic
structure and magnification pattern of the two solutions are nearly identical. In this
case, the large size of the source has competing influences on the ability to resolve
the degeneracy. On one hand, the large size of the source serves to suppress the
planetary deviations, thus making subtle differences more difficult to distinguish.
On the other hand, the large source implies that a large fraction of the planetary
perturbation region is probed. In this case, the source probes essentially the entire
region of significant planetary perturbation, as can be seen in Figure 3.3. This is
important for distinguishing between the solutions, as the largest difference between
the magnification patterns of the two degenerate solutions occurs in the region near
the tip of the arrow-shaped caustic (Griest & Safizadeh 1998). From Figure 3.3
90
it is clear that this region would have been entirely missed if the source had been
substantially smaller than the caustic.
3.4. Finite-Source Effects
In addition to (d, q), the model also yields the source radius relative to the
Einstein radius,
ρ = θ∗/θE = (3.29 ± 0.08) × 10−3, (3.4)
We then follow the standard (Yoo et al. 2004a) technique to determine the angular
source radius,
θ∗ = 1.05 ± 0.05 µas. (3.5)
That is, we first adopt [(V − I)0, I0]clump = (1.00, 14.32) for the dereddened position
of the clump. We then measure the offset of the source relative to the clump centroid
∆[(V − I), I] = (−0.19, 3.25), to obtain [(V − I)0, I0]s = (0.81, 17.57). See Figure
3.5. The instrumental source color is derived from model-independent regression of
the V and I flux, while the instrumental magnitude is obtained from the light-curve
model. We convert (V − I) to (V − K) using the color-color relations of Bessell &
Brett (1988), yielding (V − K)0 = 1.75, and then obtain equation (3.5) using the
color/surface-brightness relations of Kervella et al. (2004). Combining equations
91
(3.4) and (3.5) gives θE = θ∗/ρ = 0.32 mas. And combining this with the definition
θ2E = κMπrel, where M is the lens mass, πrel is the source-lens relative parallax,
and κ = 4G/c2AU ∼ 8.1 mas M−1⊙ , together with the measured Einstein timescale,
tE = 14.3 ± 0.3days, we obtain
M = 0.10 M⊙
(
πrel
125 µas
)−1
(3.6)
and
µrel =θE
tE= 8 mas yr−1. (3.7)
The relatively high lens-source relative proper motion µrel is mildly suggestive of a
foreground disk lens, but still quite consistent with a bulge lens. Since πrel = 125µas
corresponds to a lens distance DL = 4 kpc (assuming source distance DS = 8 kpc),
equation (3.6) implies that if the lens did lie in the foreground, then it would be a
very low-mass star or a brown dwarf.
Assuming that the source lies at a Galactocentric distance modulus 14.52, its
dereddened color and magnitude imply that [(V − I)0,MI ] = (0.81, 3.07), making it
a subgiant.
92
3.5. Limb Darkening
As illustrated in Figure 3.1, the principal deviations from a point-lens light
curve occur at the limb of the star. This prompts us to investigate the degree to
which the planetary solution is influenced by our treatment of limb darkening. The
results that we report are based on a fit to the H-band surface brightness profile of
the form
S(ϑ)
Sλ
= 1 − Γ(
1 − 3
2cos ϑ
)
− Λ(
1 − 5
4cos1/2 ϑ
)
, (3.8)
where Γ and Λ are the linear and square-root parameters, respectively, and where
ϑ is the angle between the normal to the stellar surface and the line of sight, i.e.,
sin ϑ = θ/θ∗. See An & Han (2002) for the relation between (Γ, Λ) and the usual
(c, d) formalism.
In deriving the reported results, we fix the H-band limb-darkening parameters
(Γ, Λ) = (−0.15, 0.69), corresponding to (c, d) = (−0.21, 0.79) from Claret (2000)
for a star with effective temperature Teff = 5325 K and log g = 4.0. These stellar
parameters are suggested by comparison to Yale-Yonsei isochrones (Demarque et
al. 2004) for the dereddened color and absolute magnitude reported in § 3.4. We
also perform fits in which Γ and Λ are allowed to be completely free. From these
fits, we find that our best-fit model has (Γ, Λ) = (−0.64, 1.47). Γ and Λ are highly
correlated, so their individual values are not of interest, and the surface-brightness
93
profiles generated by these two sets of (Γ, Λ) are qualitatively similar. In the present
context, however, the key point is that when we fix the limb-darkening parameters
at the Claret (2000) values, the contours in Figure 3.4 remain essentially identical
and the best fit values change by much less than 1σ.
Because of lower point-density and the aforementioned problems with the
I data over the peak, we only attempt a linear limb-darkening fit, i.e., we use
equation (3.8) with Λ ≡ 0, and we adopt Γ = 0.47 from Claret (2000).
3.6. Blended Light
In the crowded fields of the Galactic bulge, the photometered light of a
microlensing event rarely comes solely from the lensed source. Rather there is
typically additional light that is blended with the source but is not being lensed.
This light can arise from unrelated stars that happen to be projected close enough
to the line of sight to be blended with the source, or it can come from companions
to the source, companions to the lens, or the lens itself. This last possibility is most
interesting because, if the lens flux can be isolated and measured, it provides strong
constraints on the lens properties, and in this case would enable a complete solution
of the lens mass and distance, when combined with the measurement of θE (e.g.,
Bennett et al. 2007).
94
To investigate the blended light, we begin by using the method of Gould &
An (2002) to construct an image of the field with the source (but not the blended
light) removed, and compare this to a baseline image, which of course contains both
the source and the blended light (see Fig. 3.7). In these images, the source/blend is
immersed in the wings of a bright star (roughly 3.7 mag brighter than the source),
which lies about 2′′ away. On the baseline image, the source/blend is noticeable
against this background, but hardly distinct. On the source-subtracted image, the
blend is not directly discernible.
To make a quantitative estimate of the blend flux, we fit the region in the
immediate vicinity of the bright star to the form F = a1 + a2 × PSF, where “PSF”
is the point-spread-function determined from the DIA analysis. We then subtract
the best fit flux profile from the image. This leaves a clear residual at the position
of the source/blend in the baseline image, but just noise in the source-subtracted
image. We add all the flux in a 1.8′′ square centered on the lens, finding 564 ADU
and −28 ADU, respectively. We conclude that the I-band flux blend/source ratio is
fb/fs < 0.05. Of course, even if we had detected blended light, it would be impossible
to tell whether it was directly coincident with the source. If it were, this would imply
that this light would be directly associated with the event, i.e., being either the lens
itself or a companion to the lens or the source. Hence, this measurement is an upper
limit on the light from the lens in two senses: no light is definitively measured, and
if it were we do not know that it came from the lens. Combining this limit with
95
equation (3.6), and assuming the lens is a main-sequence star, it must then be less
massive than M < 0.75 M⊙, and so must have relative parallax πrel > 15 mas. This
implies a lens-source separation DS − DL > 1 kpc, which certainly does not exclude
bulge lenses. Indeed, if the lens were a K dwarf in the Galactic bulge, it would
saturate this limit.
3.7. Discussion
MOA-2007-BLG-400 is the first high-magnification microlensing event for which
the central caustic generated by a planetary companion to the lens is completely
enveloped by the source. As a comparison, the planetary caustic of OGLE-2005-
BLG-390 (Beaulieu et al. 2006) is smaller than its clump-giant source star in angular
size. When the planetary caustics is covered by the source, the finite-source effects
broaden the “classic” Gould & Loeb (1992) planetary perturbation features (Gaudi
& Gould 1997). By contrast, planet-induced deviations in MOA-2007-BLG-400 are
mostly obliterated, rather than being broadened, because the source crosses the
central caustic rather than the planetary caustic. We showed, nevertheless, that the
planetary character of the event can be inferred directly from the light-curve features
and that the standard microlensing planetary parameters (d, q) = (2.9, 2.5 × 10−3)
can be measured with good precision, up to the standard close/wide d ↔ d−1
degeneracy. We demonstrated that, in this case, the close/wide degeneracy is quite
96
severe, and the wide solution is only preferred by ∆χ2 = 0.2. This is unfortunate,
since the separations of the two solutions differ by a factor of ∼ 8.5. We argued that
the severity of this degeneracy was primarily related to the intrinsic parameters of
the planet, rather than being primarily a result of the large source size.
Although the mass ratio alone is of considerable interest for planet formation
theories, one would also like to be able to translate the standard microlensing
parameters to physical parameters, i.e., the planet mass mp = qM , and planet-star
projected separation r⊥ = dθEDL. Clearly this requires measuring the lens mass
M and distance DL. In this case, the pronounced finite source effects have already
permitted a measurement of the Einstein radius θE = 0.32 mas, which gives a relation
between the mass and lens-source relative parallax (eq. [3.6]). This essentially
yields a relation between the lens mass and distance, since the source distance is
close enough to the Galactic center that knowing DL is equivalent to knowing πrel.
Therefore, a complete solution could be determined by measuring either M or DL,
or some combination of the two.
One way to obtain an independent relation between the lens mass and distance
is to measure the microlens parallax, πE. There are two potential ways of measuring
πE. First, one can measure distortions in the light curve arising from the acceleration
of the Earth as it moves along its orbit. Unfortunately, this is out of the question
in this case because the event is so short that these distortions are immeasurably
small. Second, one can measure the effects of terrestrial parallax, which gives
97
rise to differences between the light curves simultaneously observed from two or
more observatories separated by a significant fraction of the diameter of the Earth.
Practically, measuring these differences requires a high-magnification event, which
would appear to make this event quite promising. Unfortunately, although we
obtained simultaneous observations from two observatories separated by several
hundred kilometers during the peak of the event, one of these datasets suffers from
large systematic errors and an unknown time zero point, rendering it unusable for
this purpose.
The only available alternative for breaking the degeneracy between the lens
mass and distance would be to measure the lens flux, either under the glare of the
source or, at a later date, to separately resolve it after it has moved away from the
line of sight to the source (Alcock et al. 2001a; Kozlowski et al. 2007). Panels (e)
and (f) of Figure 3.6 show the Bayesian estimates of the lens brightness in I-band
and H-band, respectively. If the lens flux is at least 2% of the source flux, then the
former kind of measurement could be obtained from a single epoch Hubble Space
Telescope observation, provided it were carried out in the reasonably near future. At
roughly 99% probability, the blended light would be either perfectly aligned with
the source (and so associated with the event) or well separated from it. HST images
can be photometrically aligned to the ground-based images using comparison stars
with an accuracy of better than 1%. Hence, photometry of the source+blend would
detect the blend, unless it were at least 4 mag fainter than the source. In principle,
98
the blend could be a companion to either the source or lens. Various arguments can
be used to constrain either of those scenarios. We do not explore those here, but see
Dong et al. (2009). If the lens is not detectable by current epoch HST observations
(or no HST observations are taken), then it will be detectable by ground-based AO
H-band observations in about 5 years. This is because the lens-source relative proper
motion is measured to be µrel = 8 mas yr−1, and the diffraction limit at H band on
a 10m telescope is roughly 35 mas. If the lens proves to be extremely faint, then a
wider separation (and hence a few years more time baseline) would be required.
In the absence of additional observational constraints, we must rely on a
Bayesian analysis to estimate the properties of the host star and planet, which
incorporates priors on the distribution of lens masses, distances, and velocities
(Dominik 2006; Dong et al. 2006). This is a standard procedure, which we only
briefly summarize here. We adopt a Han & Gould (1995) model for the Galactic
bar, a double-exponential disk with a scale height of 325 pc, and a scale length
of 3.5 kpc, as well as other Galactic model parameters as described in Bennett &
Rhie (2002). We incorporate constraints from our measurement of the lens angular
Einstein radius θE and the event timescale, as well as limits on the microlens parallax
and I-band magnitude of the lens. In practice, only the measurements of θE and tE
provide interesting constraints on these distributions. In addition, we include the
small penalty on the close solution, exp(−∆χ2/2), where the wide solution is favored
99
by ∆χ2 = 0.2. For the estimates of the planet semimajor axis, we assume circular
orbits and that the orbital phases and cos(inclinations) are randomly distributed.
The resulting probability densities for the physical properties of the host
star, as well as selected properties of the planet, are shown in Figure 3.6. The
Bayesian analysis suggests a host star of mass M = 0.30+0.19−0.12M⊙ at distance of
DL = 5.8+0.6−0.8 kpc. In other words, given the available constraints, the host is most
likely an M-dwarf, probably in the foreground Galactic bulge. Given that the
planet/star mass ratio is measured quite precisely, the probability distribution for
the planet mass is essentially just a rescaled version of the probability distribution
for the host star mass. We find mp = 0.83+0.49−0.31 MJup. The close/wide degeneracy
is apparent in the probability distribution for the semimajor axis a. We estimate
aclose = 0.72+0.38−0.16 AU for the close solution, and awide = 6.5+3.2
−1.2 AU for the wide
solution. The equilibrium temperatures for these orbits are Teq.,close = 103+28−26 K and
Teq.,wide = 34 ± 9 K for the close and wide solutions, respectively.
Thus our Bayesian analysis suggests that this system is mostly likely a
bulge mid-M-dwarf, with a Jovian-mass planetary companion. The semimajor
axis of the planetary companion is poorly constrained primarily because of the
close/wide degeneracy, but the implied equilibrium temperatures are cooler than the
condensation temperature of water. Specifically we find that Teq ∼< 173 K at 2σ
level. Alternatively, if we assume the snow line is given by asnow = 2.7 AU(M/M⊙),
we find for this system a snow line distance of ∼ 0.84 AU, essentially the same as
100
the inferred semimajor axis of the close solution. Thus this planet is quite likely to
be located close to or beyond the snow line of the system.
Although we cannot distinguish between the close and wide solutions for the
planet separation, theoretical prejudice in the context of the core-accretion scenario
would suggest that a gas-giant planet would be more likely to form just outside
the snow line, thus preferring the close solution. However, we have essentially no
observational constraints on the frequency and distribution of Jupiter-mass planets
at the separations implied by the wide solution (∼ 5.3 − 9.7 AU), for such low-mass
primaries. Unfortunately, the prospects for empirically resolving the close/wide
degeneracy in the future are poor. The only possible method of doing this would
be to measure the radial velocity signature of the planet. Given the faintness of
the host star (see §3.6 and Fig. 3.6), this will likely be impossible with current or
near-future technology.
The mere existence of a gas-giant planet orbiting a mid-M-dwarf is largely
unexpected in the core-accretion scenario, as formation of such planets is thought
to be inhibited in such low-mass primaries (Laughlin et al. 2004). Observationally,
however, although the frequency of Jovian companions to M-dwarfs with a ∼< 3 AU
does appear to be smaller than the corresponding frequency of such companions
to FGK dwarfs (Endl et al. 2006; Johnson et al. 2007b; Cumming et al. 2008),
several Jovian-mass companions to M dwarfs are known (see Dong et al. 2009 for
a discussion), so this system would not be unprecedented. Furthermore, it must
101
be kept in mind that the estimates of stellar (and so planet) mass depend on the
validity of the priors, and even in this context have considerable uncertainties.
Most of the ambiguities in the interpretation of this event would be removed
with a measurement of the host star mass and distance, which could be obtained by
combining our measurement of θE with a measurement of the lens light as outlined
above. The Bayesian analysis informs the likelihood of success of such an endeavor.
This analysis suggests that, if the host is a main-sequence star, its magnitude will
be IL = 23.9+0.8−1.0 and HL = 21.4+0.7
−1.0, which corresponds to 0.6% and 1.7% of the
source flux, respectively. If initial efforts to detect the lens fail, more aggressive
observations would certainly be warranted: microlensing is the most sensitive
method for detecting planets around very low-mass stars simply because it is the
only method that does not rely on light from the host (or the planet itself) to detect
the planet. And given equation (3.6), even an M dwarf at the very bottom of the
main sequence M = 0.08 M⊙, would lie at DL = 3.5 kpc and so would be H ∼ 24.
102
HJD - 2454354.0
Fig. 3.1.— Top: Lightcurve of MOA-2007-BLG-400 with data from µFUN CTIO(Chile) simultaneously taken in H (cyan), I (DIA, black). Models are shown for apoint lens (blue) and planet-star system (red). There are 5 50-second H exposures foreach 300-second I (or V – not shown) exposure in 6 minutes cycles. Some I-band dataat the peak suffer from saturation, and those points are therefore removed from theanalysis (see text). Middle: Residuals for best-fit point-lens model and its differencewith the planetary model. Note that in the top panel, the H data are shown asobserved, while the I data are aligned. Normally, such alignment is straightforwardbecause microlensing of point sources is achromatic. However, here there is significantchromaticity due to different limb-darkening. The I-band points in the top panel areactually the residuals to the I-band limb-darkened model (middle panel), added tothe H-band model curve (top panel). Bottom: Residuals from a point-lens modelwith the same parameters as the planetary model, which can be directly compared tothe “magnification map” in Fig. 3.3. These “didactic residuals” are naturally morepronounced than those from the best-fit point lens.
103
Fig. 3.2.— Comparisons of residuals to the best-fit point-lens models between twophotometric reductions of µFUN H-band data, using the DIA packages developedby Wozniak (2000) (top) and Bond et al. (2001) (bottom). The red curve in eachpanel represents the difference of the best-fit planetary and point-lens models for thatpanel’s reduction. Both reductions agree on the main planetary features, but eachpackage introduces its own systematics. For example, the systematic deviations fromthe planetary model at HJD ∼ 2454354.57 shown in the top panel are not supportedby the reductions of the Bond’s package. During 2454354.46 < HJD < 2454354.51,most images have low transparency (∼< 50%), which causes relatively large scatter inBond’s DIA reductions. In comparison, the reduction by Wozniak’s DIA has smallerscatter during this period. However, the data exhibit some low-level systematics,which are not supported by the other reduction.
104
Fig. 3.3.— Magnification differences between of best-fit planetary model [(q, d) =(0.0025, 2.9) and (q, d) = (0.0026, 0.34) being nearly identical] and single-lens models,in units of the measured source size, ρ = 0.0033 Einstein radii. Contours show 1%, 2%,3%, and 4%, deviations in the positive (brown) and negative (blue) directions. Toppanel: Single-lens geometry (t0, u0, tE) is taken to be the same as in the planetarymodel, with no finite source effects. Caustic (contour of infinite magnification) isshown in white. The deviations are very pronounced. Bottom Panel: Same as toppanel, but including finite-source effects, which now explain the main features of thelight curve. The trajectory begins with a negative deviation, then hits a narrow“brown ridge” causing the spike seen in the bottom panel of Fig. 3.1, as the edgeof the source first hits the caustic. Then there are essentially no deviations (white)while the source covers the caustic. The caustic exit induces a narrow “blue ridge”corresponding to the negative-deviation spike seen in Fig 3.1. Finally, the source runsalong the long “brown ridge” corresponding to the prolonged post-peak mild excessseen in Fig 3.1.
105
Fig. 3.4.— Contours of ∆χ2 = 1, 4, 9 relative to the minimum as a function of planet-star mass ratio q and projected planet-star separation d (top), as well as “short causticdiameter” (see Fig. 3.3) w (bottom). w (in units of θE) is a function of q and d (seetext). The solution shown here corresponds to q = (2.5+0.5
−0.3)×10−3 and d = 2.9±0.2 (ord = 0.34+0.03
−0.02). These values of d correspond to physical separations and equilibriumtemperatures of ∼ 5.3 − 9.7 AU, ∼ 34 K and ∼ 0.6 − 1.1 AU, ∼ 103 K for the closeand wide solutions, respectively.
106
(V−I) [instrumental]
I [in
stru
men
tal]
−.6 −.4 −.2 0 .2
22
20
18
16
Fig. 3.5.— Instrumental color-magnitude diagram of field containing MOA-2007-BLG-400. The color and magnitude of the source (black filled circle) are derived fromthe fit to the light curve, which also yields an upper limit for the I-band blendedflux (green triangle). The large error bar on the latter point indicates a completelack of information about its V -band flux. The clump centroid is indicated in redsquare. From the source-clump offset, we estimate [I, (V − I)]0,s = (17.57, 0.81),implying it has angular radius θ∗ = 1.05 µas. Assuming the source lies at 8 kpc, ithas [MI , (V − I)]0,s = (3.07, 0.81), making it a subgiant. The lack of blended lightallows us to place an upper limit on the lens flux, which implies that it has massM < 0.75 M⊙. See text.
107
Fig. 3.6.— Bayesian relative probability densities for the physical properties of theplanet MOA-2007-BLG-400Lb and its host star. (a) Mass of the host star. (b) Planetsemimajor axis. (c) Distance to the planet/star system. (d) Equilibrium temperatureof the planet. (e) I-band magnitude of the host star. (f) H-band magnitude of thehost star. In panel (a), we also show the probability density for the planet mass, whichis essentially a rescaling of that of the host star, because the mass ratio is measuredso precisely q = (2.5+0.5
−0.3) × 10−3. In all panels, the solid vertical lines show themedians, and the 68.3% and 95.4% confidence intervals are enclosed in the dark andlight shaded regions, respectively. In panel (b) and (d), the probability distributionsfor wide and close degenerate solutions are computed separately, and then the closesolution is weighted by exp(−∆χ2/2), where ∆χ2 = 0.2 is the difference betweenthem. These distributions are derived assuming priors obtained from standard modelsof the mass, velocity, and density distributions of stars in the Galactic bulge and disk,and include constraints from the measurements of lens angular Einstein radius θE andthe timescale of the event tE, as well as limits on the I-band magnitude of the lens.In practice, only the measurements of θE and tE provide interesting constraints onthese distributions.
108
B) BASELINE A) PEAKPEAK
D) BASELINE - SOURCE C) PEAK - BASELINE
Fig. 3.7.— Constraining the blended flux from CTIO I-band images. A good-seeingbaseline image B (upper right) is subtracted from an image taken at magnification245 (A, upper left), and the resulting image C is shown in the bottom-left panel.The white circles on images A and C indicate the source positions. Then C is scaledby 1/244 in flux and subtracted from image B to generate image D, which has thesource contribution removed from the baseline image. As described in the text, we fitPSF to the stars close to the source, and subtract them from images B and D. Aftersubtraction, the flux sums in a 1.8” square (shown as black boxes on right two panels)are 564 ADU and -28 ADU, respectively. Non-detection of the blend constrains theblend-source flux ratio to be less than 5%.
109
Model1 t0 − tref2 u0 tE d q α3 ρ
day day deg
Close 0.08107 0.00025 14.41 0.34 0.0026 227.06 0.00326
Wide 0.08106 0.00027 14.33 2.87 0.0025 226.99 0.00329
1The wide solution is preferred over best-fit single-lens model by ∆χ2 =
1070.04 and the close solution by ∆χ2 = 1069.84
2tref = HJD 2454354.5 (Sep 11, 2007, 00:00 UT).
3The geometry of the source trajectory is visualized in Figure 3.3, in
which the planet is to the right of the lens star. t0, u0, and α are defined
with respect to the “center of magnification”, which is the center of mass
of the star/planet system for the close model and q/(1 + q)/d away from
the position of the lens star toward the direction of the planet for the wide
model.
Table 3.1. MOA-2007-BLG-400 Best-fit Planetary Models
110
Chapter 4
OGLE-2005-BLG-071Lb, the Most Massive
M-Dwarf Planetary Companion?
4.1. Introduction
Microlensing provides a powerful method to detect extrasolar planets. Although
only six microlens planets have been found to date (Bond et al. 2004; Udalski et al.
2005; Beaulieu et al. 2006; Gould et al. 2006; Gaudi et al. 2008; Dong et al. 2009),
these include two major discoveries. First, two of the planets are “cold Neptunes”,
a high discovery rate in this previously inaccessible region of parameter space,
suggesting this new class of extrasolar planets is common (Gould et al. 2006; Kubas
et al. 2008). Second, the discovery of the first Jupiter/Saturn analog via a very
high-magnification event with substantial sensitivity to multiple planets indicates
that solar system analogs may be prevalent among planetary systems (Gaudi et al.
2008). Recent improvements in search techniques and future major upgrades should
increase the discovery rate of microlensing planets substantially (Gaudi 2008).
111
Routine analysis of planetary microlensing light curves yields the planet/star
mass ratio q and the planet-star projected separation d (in units of the angular
Einstein radius). However, because the lens-star mass M cannot be simply extracted
from the light curve, the planet mass Mp = qM remains, in general, equally
uncertain.
The problem of constraining the lens mass M is an old one. When microlensing
experiments were initiated in the early 1990s, it was generally assumed that
individual mass measurements would be impossible and that only statistical
estimates of the lens mass scale could be recovered. However, Gould (1992) pointed
out that the mass and lens-source relative parallax, πrel ≡ πl − πs, are simply related
to two observable parameters, the angular Einstein radius, θE, and the Einstein
radius projected onto the plane of the observer, rE,
M =θE
κπE
, πrel = θEπE. (4.1)
Here, πE = AU/rE is the “microlens parallax” and κ ≡ 4G/(c2 AU) ∼ 8.1 mas/M⊙.
See Gould (2000b) for an illustrated derivation of these relations.
In principle, θE can be measured by comparing some structure in the light curve
to a “standard angular ruler” on the sky. The best example is light-curve distortions
due to the finite angular radius of the source θ∗ (Gould 1994a), which usually can be
estimated very well from its color and apparent magnitude (Yoo et al. 2004a). While
112
such finite-source effects are rare for microlensing events considered as a whole, they
are quite common for planetary events. The reason is simply that the planetary
distortions of the light curve are typically of similar or smaller scale than θ∗. In fact,
all six planetary events discovered to date show such effects. Combining θE with the
(routinely measurable) Einstein radius crossing time tE yields the relative proper
motion µ in the geocentric frame,
µgeo =θE
tE, (4.2)
From equation (4.1), measurement of θE by itself fixes the product Mπrel = θ2E/κ.
Using priors on the distribution of lens-source relative parallaxes, one can then make
a statistical estimate of the lens mass M and so the planet mass Mp.
To do better, one must develop an additional constraint. This could be
measurement of the microlens parallax πE, but this is typically possible only for long
events. Another possibility is direct detection of the lens, either under the “glare” of
the source during and immediately after the event, or displaced from the source well
after the event is over. Bennett et al. (2006) used the latter technique to constrain the
mass of the first microlensing planet, OGLE-2003-BLG-235/MOA-2003-BLG-53Lb.
They obtained Hubble Space Telescope (HST) Advanced Camera for Surveys (ACS)
images in B, V , and I at an epoch ∆t = 1.78 years after the event. They found
astrometric offsets of the (still overlapping) lens and source light among these images
of up to 0.7 mas. Knowing the lens-source angular separation ∆θ = µ∆t from the
113
already determined values of θE and tE, they were able to use these centroid offsets to
fix the color and magnitude of the lens and so (assuming that it was a main-sequence
star) its mass.
While the planet mass is usually considered to be the most important
parameter that is not routinely derivable from the light curve, the same degeneracy
impacts two other quantities as well, the distance and the transverse velocity of
the lens. Knowledge of these quantities could help constrain the nature of the
lens, that is, whether it belongs to bulge, the foreground disk, or possibly the
thick disk or even the stellar halo. Since microlensing is the only method currently
capable of detecting planets in populations well beyond the solar neighborhood,
extracting such information would be quite useful. Because the mass, distance,
and transverse velocity are all affected by a common degeneracy, constraints on
one quantity are simultaneously constraints on the others. As mentioned above,
simultaneous measurements of θE and πE directly yield the mass. However,
clearly from equation (4.1) they also yield the distance, and hence (from eq. [4.2])
also the transverse velocity. Here, we assemble all available data to constrain
the mass, distance and transverse velocity of the second microlensing planet,
OGLE-2005-BLG-071Lb, whose discovery we previously reported (Udalski et al.
2005, hereafter Paper I).
114
4.2. Overview of Data and Types of Constraints
The light curve consists of 1398 data points from 9 ground-based observatories
(see Fig. 2.1), plus two epochs of HST ACS data in the F814W (I) and F555W
(V ) filters. The primary ground-based addition relative to Paper I is late-time data
from OGLE, which continued to monitor the event down to baseline until HJD
= 2453790.9.
These data potentially provide constraints on several parameters in addition to
those reported in Paper I. First, the light curve shows a clear asymmetry between
the rising and falling parts of the light curve, which is a natural result of microlens
parallax due to the Earth’s accelerated motion around the Sun (see the best-fit
model without parallax effects plotted in dotted line in Fig. 2.1). However, it
is important to keep in mind that such distortions are equally well produced by
“xallarap” due to accelerated motion of the source around a companion. Poindexter
et al. (2005) showed that it can be difficult to distinguish between the two when, as
in the present case, the effect is detected at ∆χ2 <∼ 100.
Second, the two pronounced peaks of the light curve, which are due to “cusp
approaches” (see the bottom inset of Fig. 2.1), are relatively sharp and have good
coverage. These peaks would tend to be “rounded out” by finite-source effects, so
in principle it may be possible to measure ρ (i.e., θ∗ in the units of θE) from these
distortions.
115
Third, the orbital motion of the planet can give rise to two effects: rotation
of the caustic about the center of the mass and distortion of the caustic due to
expansion/contraction of the planet-star axis. The first changes the orientation of
the caustic structure as the event evolves while the second changes its shape. These
effects are expected to be quite subtle because the orbital period is expected to be
of an order of 10 years while the source probes the caustic structure for only about 4
days. Nevertheless, they can be very important for the interpretation of the event.
Finally, the HST data cover two epochs, one at 23 May 2005 (indicated by the
arrow in Fig. 2.1) when the magnification was about A = 2 and the other at 21 Feb
2006 when the event was very nearly at baseline, A ∼ 1. These data could potentially
yield four types of information. First, they can effectively determine whether the
blended light is “associated” with the event or not. The blended light is composed
of sources in the same photometric aperture as the magnified source, but that do
not become magnified during the event. If this light is due to the lens, a companion
to the lens, or a companion to the source, it should fall well within the ACS point
spread function (PSF) of the source. On the other hand, if it is due to a random
interloper along the line of sight, then it should be separately resolved by the ACS
or at least give rise to a distorted PSF. Second, the HST data can greatly improve
the estimate of the color of the blended light. The original model determined the
source fluxes in both OGLE V and I very well, and of course the baseline fluxes
are also quite well determined. So it would seem that the blended fluxes, which are
116
the differences between these two, would also be well determined. This proves to be
the case in the I band. However, while the source flux is derivable solely from flux
differences over the light curve (and so is well determined from OGLE difference
image analysis – DIA – Wozniak 2000), the baseline flux depends critically on the
zero point of PSF-fitting photometry, whose accuracy is fundamentally limited in
very crowded bulge fields. The small zero-point errors turn out to have no practical
impact for the relatively bright I background light, but are important for the V
band. Third, one might hope to measure a centroid shift between the two colors in
the manner of Bennett et al. (2006). Last, one can derive the source proper motion
µs from HST data (at least relative to the mean motion of bulge stars). This is
important, because the event itself yields the source-lens relative proper motion,
µgeo. Hence, precise determination of µl requires knowledge of two proper-motion
differences, first the heliocentric proper motion
µhel = µl − µs, (4.3)
and second, the offset between the heliocentric and geocentric proper motions
µhel − µgeo =v⊕πrel
AU. (4.4)
Here, v⊕ is the velocity of the Earth relative to the Sun at the time of peak
magnification t0. Note that, if the lens-source relative parallax πrel is known, even
117
approximately, then the latter difference can be determined quite well, since its total
magnitude is just 0.6 mas yr−1(πrel/0.17 mas).
4.3. Constraining the Physical Properties of the Lens
and its Planetary Companion
In principle, all the effects summarized in § 4.2 could interact with each
other and with the parameters previously determined, leading potentially to a very
complex analysis. In fact, we will show that most effects can be treated as isolated
from one another, which greatly facilitates the exposition. In the following sections,
we will discuss the higher order microlens effects in the order of their impact on
the ground-based light curve, starting with the strongest, that is, parallax effects
(§ 4.3.1), followed by planetary orbital motion (§ 4.3.2) and finally the weakest,
finite-source effects (§ 4.3.3). To study these effects, we implement Markov chain
Monte Carlo (MCMC) with an adaptive step-size Gaussian sampler (Doran &
Mueller 2003) to perform the model fitting and obtain the uncertainties of the
parameters. The HST astrometry is consistent with no (V − I) color-dependent
centroid shift in the first epoch, while such a shift is seen in the second epoch
observations (§ 4.3.4). In addition, the PSF of the source shows no sign of broadening
due to the blend, suggesting that the blend is associated with the event (§ 4.3.5).
Therefore, the HST observations provide good evidence that the blend is likely due
118
to the lens. In § 4.3.4, it is shown, under such an assumption, how the astrometry
can be used in conjunction with finite-source and microlens parallax measurements
to constrain the angular Einstein radius and proper motion (§ 4.3.4). In § 4.3.5,
we discuss using HST photometric constraints in the form of χ2 penalties to the
MCMC runs to extract the color and brightness of the blend. The results of these
runs, which include all higher order effects of the ground-based light curve and the
HST photometric constraints, are summarized as “MCMC A” in Table 4.1 and
4.2. Subsequently in § 4.3.6, by making the assumption that the blended light seen
by HST is due to the lens, we combine all constraints discussed above to obtain
physical parameters of the lens star and its planet. The corresponding best-fit model
parameters are reported as “MCMC B” in Table 4.3 and 4.4. The results for the
physical parameters from these runs are given in Table 4.5. Finally we discuss some
caveats in the analysis in § 4.3.7 and § 4.3.8.
4.3.1. Microlens Parallax Effects
A point-source static binary-lens model has 6 “geometric-model” parameters:
three “single-lens” parameters (t0, u0, tE), where we define the time of “peak”
magnification (actually lens-source closest approach) t0 and the impact parameter u0
with respect to the center of mass of the planet-star systems; and three “binary-lens”
parameters (q, d, α), where α is the angle between the star-planet axis and the
trajectory of the source relative to the lens. In addition, flux parameters are included
119
to account for light coming from the source star (Fs) and the blend (Fb) for each
dataset. In this paper, we extend the fitting by including microlens parallax, orbital
motion and finite-source effects. Paper I reported that, within the context of the
point-source static binary-lens models, the best-fit wide-binary (d > 1) solution is
preferred by ∆χ2 = 22 over the close-binary (d < 1) solution. Remarkably, when we
take account of parallax, finite-source and orbital effects, this advantage is no longer
as significant. We discuss the wide/close degeneracy with more detail in § 4.3.6.
The microlens parallax effects are parametrized by πE,E and πE,N, following the
geocentric parallax formalism by An et al. (2002) and Gould (2004). To properly
model the parallax effects, we characterize the “constant acceleration degeneracy”
(Smith et al. 2003) by probing models with u0 → −u0 and α → −α. We find that
all other parameters remain essentially unchanged under this form of degeneracy.
In the following sections, if not otherwise specified, parameters from models with
positive u0 are adopted.
As shown in Figure 4.2, microlens parallax is firmly detected in this event at
> 8σ level. Not surprisingly, the error ellipse of πE is elongated toward πE,⊥, i.e., the
direction perpendicular to the position of the Sun at the peak of event, projected
onto the plane of the sky (Gould et al. 1994; Poindexter et al. 2005). As a result,
πE,E is much better determined than πE,N,
πE,E = −0.26 ± 0.05, πE,N = −0.30+0.24−0.28 (wide), (4.5)
120
πE,E = −0.27 ± 0.05, πE,N = −0.36+0.24−0.27 (close). (4.6)
Xallarap (light-curve distortion from reflex motion of the source due to a binary
companion) could provide an alternate explanation of the detected parallax signals.
In § 4.3.8, we find that the best-fit xallarap parameters are consistent with those
derived from the Earth’s orbit, a result that favors the parallax interpretation.
4.3.2. Fitting Planetary Orbital Motion
To model orbital motion, we adopt the simplest possible model, with uniform
expansion rate b in binary separation b and uniform binary rotation rate ω. Because
orbital effects are operative only for about 4 days, while the orbital period is of
order 10 years, this is certainly adequate. Interestingly, the orbital motion is more
strongly detected for the close solutions (at >∼ 5.5σ level) than the wide solutions (at
∼ 3σ level), and as a result, it significantly lessens the previous preference of the
wide solution that was found before orbital motion was taken into account. Further
discussions on planetary orbital motion are given in § 4.3.6.
121
4.3.3. Finite-source effects and Other Constraints on
θE
Color-Magnitude Diagram
We follow the standard procedure to derive dereddened source color and
magnitude from the color-magnitude diagram (CMD) of the observed field.
Figure 4.3 shows the calibrated OGLE CMD (black), with the baseline source
being displayed as a green point. The V − I color of the source can be determined
in a model-independent way from linear regression of the I-band and V -band
observations. The I-band magnitude of the source is also precisely determined from
the microlens model, and it is hardly affected by any higher order effects. The center
of red clump (red) is at (V − I, I)clump = (1.89, 15.67). The Galactic coordinates of
the source are at (l, b) = (355.58,−3.79). Because the Galactic bulge is a bar-like
structure that is inclined relative to the plane of the sky, the red clump density
at this sky position peaks behind the Galactic center by 0.15 mag (Nishiyama
2005). Hence, we derive (V − I, I)0,clump = (1.00, 14.47), by adopting a Galactic
distance R0 = 8 kpc. We thereby obtain the selective and total extinction toward
the source [E(V − I), AI ] = (0.89, 1.20) and thus RV I = AV /E(V − I) = 2.35. The
dereddened color and magnitude of the source is ((V − I), I)s,0 = (0.45, 18.31). From
its dereddened color (V − I)0 = 0.45, as well as its absolute magnitude (assuming it
is in the bulge) MI ∼ 3.65, we conclude that the source is a main-sequence turnoff
122
star. Following the method of Yoo et al. (2004a), we transform (V − I)0 = 0.45 to
(V −K)0 = 0.93 (Bessell & Brett 1988), and based on the empirical relation between
the color and surface brightness for subgiant and main-sequence stars (Kervella et
al. 2004), we obtain the angular size of the source
θ∗ = 0.52 × 100.2(19.51−Is) ± 0.05 µas, (4.7)
where Is is the apparent magnitude of the source in the I band. Other features on
the CMD shown in Figure 4.3 are further discussed in § 4.3.5.
Photometric Systematics of the Auckland Data set
The Auckland data set’s excellent coverage over the two peaks makes it
particularly useful for probing the finite-source effects. Unlike the more drastic
“caustics crossings” that occur in some events, the finite-source effects during “cusp
approaches” are relatively subtle. Hence one must ensure that the photometry is not
affected by systematics at the few percent level when determining ρ = θ∗/θE. The
Auckland photometry potentially suffers from two major systematic effects.
First, the photometry of constant stars reduced by µFUN’s DoPHOT pipeline
are found to show sudden “jumps” of up to ∼ 10% when the field crossed the
meridian each night. The signs and amplitudes of the “jumps” depend on the
stars’ positions on the CCD. The Auckland telescope was on a German equatorial
mount, and hence the camera underwent a meridian flip. Due to scattered light,
123
the flat-fielded images were not uniform in illumination for point sources, an effect
that can be corrected by making “superflats” with photometry of constant stars
(Manfroid 1995). We have constructed such “superflats” for each night of Auckland
observations using 71 bright isolated comparison stars across the frame. The
DoPHOT instrumental magnitude mi,j for star i on frame j is modeled by the
following equation:
mi,j = m0,i − f(xi,j, yi,j) − Zj − fwhmi,j × si (4.8)
where m0,i is the corrected magnitude for star i, f(x, y) is a biquartic illumination
correction as a function of the (x, y) position on the CCD frame with 14 parameters,
Zj is a zero-point parameter associated with each frame (but with Z1 set to be zero),
and si is a linear correlation coefficient for the seeing fwhmi,j. A least-squares fit
that recursively rejects 4-σ outliers is performed to minimize χ2. The best-fit f(x, y)
is dominated by the linear terms and has small quadratic terms, while its cubic and
quartic terms are negligible. The resulting reduced χ2 is close to unity, and the
“jumps” for all stars are effectively eliminated. We apply the biquartic corrections
to the images and then reduce the corrected images using the DIA pipeline. The
resulting DIA photometry of the microlens target is essentially identical (at the
∼ 1% level) to that from the DIA reductions of the original Auckland images.
As we now show, this is because DIA photometry automatically removes any
artifacts produced by the first- and second-order illumination distortions if the
124
sources are basically uniformly distributed across the frame. For the first-order
effect, a meridian flip about the target (which is very close to center of the frame)
will induce a change in the flux from the source, but it will also induce a change in
the mean flux from all other stars in the frame, which for a linear correction will
be the same as the change in position of the “center of light” of the frame light.
If the frame sources are uniformly distributed over the frame, the “center of light”
will be the center of the frame, which is the same position as the source, therefore
introducing no effects. The second-order transformation is even under a rotation of
180 degrees, whereas a meridian flip is odd under this transformation. Hence the flip
has no effects at second order.
Second, the Auckland observations were unfiltered. The amount of atmospheric
extinction differs for stars with different colors. As shown in Figure 4.3, the source is
much bluer than most of the bright stars in the field, which dominate the reference
image. So the amount of extinction for the source is different from the average
extinction over the whole frame. This difference varies as the airmass changes over
time during the observations. Coincidentally, the times of the two peaks were both
near maximum airmass when the “differential extinction” effect is expected to be the
most severe. To investigate this effect, we match the isolated stars in the Auckland
frame with CTIO I and V photometry. We identify 33 bright, reasonably isolated
stars with |(V − I) − (V − I)s| < 0.25. We obtain a “light curve” for each of these
stars, using exactly the same DIA procedure as for the source. We measure the
125
mean magnitude of each of the 33 light curves and subtract this value from each of
the 508 points on each light curve, thereby obtaining residuals that are presumably
due primarily to airmass variation. For each of the 508 epochs, we then take the
mean of all of these residuals. We recursively remove outliers until all the remaining
points are within 3σ of the mean, as defined by the scatter of the remaining points.
Typically, 1 or 2 of the 33 points are removed as outliers. The deviations are well
fitted by a straight line,
dMag
dZ= 0.0347 ± 0.0016 (4.9)
where Z is airmass. The sense of the effect is that stars with the color of the
microlensed source are systematically fainter at high airmass, as expected. (We also
tried fitting the data to a parabola rather than a line, but the additional [quadratic]
parameter was detected at substantially below 1σ.) Finally, we apply these
“differential extinction” corrections to the “superflat”-adjusted DIA photometry to
remove both photometric systematics.
In general, the finite-source effects depend on the limb-darkening profile of the
source star in the observed passbands. We find below that in this case, the impact
proves to be extremely weak. Nevertheless, using the matched Auckland and CTIO
stars, we study the difference between Auckland magnitudes and I-band magnitude
as a function of the V − I color. We find the Auckland clear filter is close to the R
band.
126
Blending in Palomar and MDM Data
Palomar data cover only about 80 minutes, but these include the cresting of
the second peak, from which we derive essentially all information about finite-source
effects. The Palomar data are sensitive to these effects through their curvature.
The curvature derived from the raw data can be arbitrarily augmented in the fit
(and therefore the finite-source effects arbitrarily suppressed) by increasing the
blending. In general, the blending at any observatory is constrained by observations
at substantially different magnifications, typically on different nights. However, no
such constraints are available for Palomar, since observations were carried out on
only one night.
We therefore set the Palomar blending fb = 0.2 fs, that is, similar to the OGLE
blending. That is, we assume that the observed flux variation of 9%, over the Palomar
night, actually reflects a magnification variation of 9%/[1 − fb/Afs] = 9% + 0.026%,
where A ∼ 70 is the approximate magnification on that night. If our estimate of the
blending were in error by of order unity (i.e. either fb = 0 or fb = 0.4 fs), then the
implied error in the magnification difference would be 0.026%, which is more than an
order of magnitude below the measurement errors. Hence, the assumption of fixed
blending does not introduce “spurious information” into the fit even at the 1 σ level.
MDM data cover the second peak for only ∼ 18 minutes. For consistency, we treat
127
its blending in the same way as Palomar, although the practical impact of this data
set is an order of magnitude smaller.
Modeling the Finite-Source Effects
After careful tests that are described immediately below, we determined that
all finite-source calculations can be carried out to an accuracy of 10−4 using the
hexadecapole approximation of Gould (2008) (see also Pejcha & Heyrovsky 2008).
This sped up calculations by several orders of magnitude. We began by conducting
MCMC simulations using the “loop linking” finite-source code described in Appendix
A of Dong et al. (2006). From these simulations, we found the 4.5 σ upper bound
on the finite-source parameter ρ(4.5 σ) = 0.001. We then examined the differences
between loop-linking (set at ultra-high precision) and hexadecapole for light curves at
this extreme limit and found a maximum difference of 10−4. Based on Claret (2000),
we adopt linear limb-darkening coefficients ΓI = 0.35 for the I-band observations and
ΓR = 0.43 for the observations performed in the R-band and the clear filters, where
the local surface brightness is given by S(θ) ∝ 1 − Γ[1 − 1.5(1 − θ2/θ2∗)
1/2]. Ten
additional MCMC runs are performed with ΓI and ΓR that differ from the above
values by 0.1 or 0.2. They result in essentially the same probability distributions of
ρ. Therefore, the choice of limb-darkening parameters has no effect on the results.
The source size is found to be ρ = 3.9+1.8−2.7 for the wide solution and ρ = 3.1+1.7
−2.5 for the
close solution. Solutions with ρ > 0.0009 are ruled out at more than 3σ. The angular
128
Einstein radius is given by θE = θ∗/ρ. Hence, the lack of pronounced finite-source
effects yields a 3σ lower limit: θE > 0.6 mas. The lens-source relative proper
motion in the geocentric frame is simply µgeo = θE/tE. The posterior probability
distributions of µgeo derived from these MCMC simulations are compared with those
derived from astrometry in § 4.3.4.
4.3.4. HST Astrometry
HST observations were taken at two epochs (HJD = 2453513.6 and
HJD = 2453788.2) with the ACS High Resolution Camera (HRC). For each epoch,
4 dithered images were acquired in each of F814W and F555W with individual
exposure times of 225s and 315s respectively. The position of the microlens on the
HST frame is in excellent agreement with its centroid on the OGLE difference image
(within ∼ 0′′.01). The closest star to the source is about 0′′.6 away. This implies
that the OGLE photometry of the target star does not contain additional blended
light that would be identifiable from the HST images. Data analysis was carried
out using the software program img2xym HRC (Anderson & King 2004) in a manner
similar to that described in Bennett et al. (2006). Stars are fitted with an empirical
“library” PSF that was derived from well-populated globular cluster fields. These
positions are then corrected with precise distortion-correction models (accurate to
∼ 0.01 pixel). We adopted the first F555W frame of the first epoch as the reference
frame, and used the measured positions of stars in this frame and the frame of each
129
exposure to define a linear transformation between the exposure frame and the
reference frame. This allowed us to transform the position of the target star in each
exposure into the reference frame, so that we see how the target star had moved
relative to the other stars. The centroid positions of the target star in each filter
and epoch are shown in Figure 4.4. For convenience, in this figure, the positions are
displayed relative to the average of the centroid positions. The error bars are derived
from the internal scatter of the four dithered images. The probability is P = 38%
of measuring the observed separation (or larger) between F814W and F555W under
the assumption that the true offset is zero. The fact that the blended light is aligned
with the source argues that it is associated with the event (either it is the lens itself
or a companion to the lens or the source). We give a more quantitative statement
of this constraint in § 4.3.5. For the present we simply note that the P = 38%
probability is compatible with the picture that the blend is due to the lens since the
first epoch was only about half of the Einstein-radius crossing time after t0, implying
that the lens-source separation induces only a very small centroid offset, well below
the HST detection limit. For the second epoch, the centroid offset is,
∆rF814W−F555W,East = −0.52 ± 0.20 mas,
∆rF814W−F555W,North = 0.22 ± 0.20 mas. (4.10)
130
We also calculate the error in the centroid offsets from the scatter in such
offsets among all comparison stars with F555W magnitudes within 0.5 mag of the
target and find that it is consistent with the internally-based error quoted above.
At the peak of the event, the angular separation between the lens and the
source was negligible, since u0 ≪ 1. We therefore fix the angular positions of the
lens and source at a common θ0. From the CMD (Fig. 4.3), most of the stars in the
HST field are from the bulge. So we set a reference frame that is fixed with respect
to the bulge field at distance Ds. The source and lens positions at time t are then,
θs(t) = θ0 + µs(t − t0),
θl(t) = θ0 + µl(t − t0) + πrel[s(t) − s(t0)], (4.11)
where s(t) is the Earth-to-Sun vector defined by Gould (2004). Then by applying
equations (4.3) and (4.4), the angular separation between the lens and source is,
θrel(t) = θl(t) − θs(t) = µgeo(t − t0) + πrel∆s(t) (4.12)
where ∆s(t) is given by eq. (5) in Gould (2004).
The centroid of the source images θ′s is displaced from the source position by
(Walker 1995),
131
∆θs(t) = θ′
s(t) − θs(t) =
−θrel(t)
[θrel(t)/θE]2 + 2(4.13)
Therefore, one can obtain the centroid position of the lens and the source at
time t,
θc(t) = [1 − fl(t)][θs(t) + ∆θs(t)] + fl(t)θl(t)
= θ0 + µs(t − t0) + θrel(t)[fl(t) +1 − fl(t)
[θrel(t)/θE]2 + 2] (4.14)
where fl(t) is the fraction of the total flux due to the lens.
The centroid offset between the two passbands, F814W and F555W, is related
to the properties of the system by,
∆θc(t)F814W−F555W = [f(t)l,F814W−f(t)l,F555W]×[1− 1
[θrel(t)/θE]2 + 2]θrel(t).(4.15)
The difference of the blend’s fractional flux between F814W and F555W is obtained
from “MCMC A” described in § 4.3.5. Consequently, under the assumption that the
blend is the lens, we can use the measurement of the second-epoch HST centroid
offset to estimate the relative proper motion from equation (4.15) for a given πrel.
For purposes of illustration, we temporarily adopt πrel = 0.2 when calculating the
probability distribution of µgeo (black contours in the upper panels of Fig. 4.5). The
132
centroid shift generally favors faster relative proper motion than that derived from
the source size measurement (green contours in Fig. 4.5), but the difference is only
at the ∼ 1σ level. We then get a joint probability distribution of µgeo from both
finite-source effects and astrometry, which is shown as the red contours in the upper
panels of Figure 4.5.
We then derive the distribution of the µgeo position angle φµgeo
(North through
East), which is shown by the red histograms in the lower panels of Figure 4.5. Since
the direction of the lens-source relative proper motion µgeo is the same as that of
the microlens parallax πE in the geocentric frame, we have an independent check
on the φµgeo
from our parallax measurements, whose distribution is plotted as blue
histograms in Figure 4.5. Both constraints favor the lens-source proper motion to
be generally West, but they disagree in the North-South component for which both
constraints are weaker. The disagreements between two histograms is at about 2.5σ
level.
4.3.5. “Seeing” the Blend with HST
If the blend were not the lens (or otherwise associated with the event), the
PSF of the source would likely be broadened by the blended star. We examine the
HST F814W images of the target and 45 nearby stars with similar brightness for
each available exposure. We fit them with the library PSF produced by Anderson
133
& King (2004). In order to account for breathing-related changes of focus, we fit
each of these 45 nearby stars with the library PSF, and construct a residual PSF
that can be added to the library PSF to produce a PSF that is tailor-made for
each exposure. For both epochs, the source-blend combination shows no detectable
broadening relative to the PSFs of other isolated stars in the field. From the
ground-based light curve, it is already known that ∼ 16% of this light comes from
the blend. We add simulated stars with the same flux as the blended light from 0
to 2.0 pixels away from the center of the source. We find that the blend would have
produced detectable broadening of the PSF if it were more than 15 mas apart from
the source at the second epoch. Hence, the source-blend separation must then be
less than about 15 mas. From the HST image itself, the density of ambient stars
at similar magnitudes is ∼< 1 arcsec−2. The probability of a chance interloper is
therefore < 0.07%, implying that the blended light is almost certainly associated
with the event, i.e., either the lens itself, a companion to the lens, or a companion
to the source. Both of the latter options are further constrained in § 4.3.7 where, in
particular, we essentially rule out the lens-companion scenario.
As discussed in § 4.2, the blended flux in I is relatively well determined from
the ground-based OGLE data alone, but the blended V flux is poorly determined,
primarily because the systematic uncertainty in the zero point of the baseline flux
(determined from PSF fitting) is of the same order as the blended flux. Because the
HST image is very sparse, there is essentially no zero-point error in the HST V -band
134
flux. The problem is how to divide the baseline V flux into source and blend fluxes,
Fbase = Fs + Fb.
The standard method of doing this decomposition would be to incorporate the
HST V light curve into the overall fit, which would automatically yield the required
decomposition. Since this “light curve” consists of two points, the “fit” can be
expressed analytically
Fs =F (t1) − F (t2)
A1 − 1, Fb = F (t2) − Fs. (4.16)
where we have made the approximation that the second observation is at
baseline. Let us then estimate the resulting errors in Fs and Fb, ignoring for the
moment that there is some uncertainty in A1 due to uncertainties in the general
model. Each of the individual flux measurement is determined from 4 separate
subexposures, and this permits estimates of the errors from the respective scatters.
These are σ1 = 0.01 and σ2 = 0.03 mag. Hence, the fractional error in Fs is
(2.5/ ln 10)σ(Fs)/Fs = [σ21(A1 + r)2 + σ2
2(1 + r)2]1/2/(A1 − 1), where r ≡ Fb/Fs.
Adopting, for purposes of illustration, A1 = 2 and r = 0.1, this implies an error
σ(Vs,HST ) of 0.04 mag. This may not seem very large, but after the subtraction
in equation (4.16), it implies an error σ(Vb,HST ) ∼ σ(Vs,HST )/r ∼ 0.4 mag. And
taking into account of the uncertainties introduced by model fitting in determining
the magnifications, the error is expected to be even larger. Hence, we undertake an
alternate approach.
135
Because the HST and OGLE V filters have very nearly the same wavelength
center, Vs,HST should be nearly identical to Vs,OGLE up to a possible zero-point
offset on their respective magnitude scales. Because the OGLE data contain many
more points during the event, some at much higher magnification than the single
HST event point, Vs,OGLE is determined extremely well (for fixed microlensing
model), much better than the 0.04 mag error for Vs,HST . Thus, if the zero-point
offset between the two systems can be determined to better than 0.04 mag, this
method will be superior. Although the I-band blend is much better measured than
the V -band blend from the ground-based data, for consistency we determine the
zero-point offset in I by the same procedure.
Figure 4.6 shows differences between OGLE and HST V magnitudes for
matched stars in the HST image. The error for each star and observatory is
determined from the scatter among measurements of that star. We consider only
points with V < 19.5 because at fainter magnitudes the scatter grows considerably.
Each star was inspected on the HST images, and those that would be significantly
blended on the OGLE image were eliminated. The remaining points are fit to an
average offset by adding a “cosmic error” in quadrature to the errors shown. We
carry out this calculation twice, once including the “outlier” (shown as a filled
circle) and once with this object excluded. For the V band, we find offsets of
136
VHST − VOGLE = 0.17 ± 0.01 and 0.18 ± 0.01, respectively. We adopt the following
the V -band offset
∆V = VHST − VOGLE = 0.18 ± 0.01. (4.17)
A similar analysis of the I band leads to
∆I = IHST − IOGLE = 0.08 ± 0.01. (4.18)
We find no obvious color terms for either the V -band or I-band transformations. As
a check, we perform linear regression to compare the OGLE and HST (V − I) colors,
and we find they agree within 0.01 mag, which further confirms the color terms are
unlikely to be significant in the above transformations.
We proceed as follows to make HST-based MCMC (“MCMC A”) estimates of
Vb,OGLE and Ib,OGLE that place the blending star on the OGLE-based CMD. Since
flux parameters are linear, they are often left free and fitted by linear least-squares
minimization, which significantly accelerates the computations. However, for
“MCMC A”, the source fluxes from OGLE and HST are treated as independent
MCMC parameters so that they can help align the two photometric systems as
described below. Since HST blended light is not affected by light from ambient
stars (as OGLE is), we also leave HST blended fluxes as independent. Therefore, in
“MCMC A”, we include the following independent MCMC flux parameters, FI,s,OGLE,
FV,s,OGLE, FI,s,HST , FV,s,HST , FI,b,HST , and FV,b,HST , which for convenience we express
137
here as magnitudes. For each model on the chain, we add to the light-curve
based χ2 two additional terms ∆χ2V = (Vs,HST − Vs,OGLE − ∆V )2/[σ(∆V )]2 and
∆χ2I = (Is,HST − Is,OGLE −∆I)2/[σ(∆I)]2 to enforce the measured offset between the
two systems. Finally, we evaluate the V -band blended flux from HST and convert
it to OGLE system, Vb,OGLE/HST = Vs,OGLE − Vs,HST + Vb,HST (and similarly for I
band), where all three terms on the rhs are the individual Monte Carlo realizations
of the respective parameters.
The result is shown in Figure 4.3, in which the blend (magenta) is placed on
the OGLE CMD. Also shown, in cyan points, is HST photometry (aligned to the
OGLE system) of the stars in the ACS subfield of the OGLE field. Although this
field is much smaller, its stars trace the main sequence to much fainter magnitudes.
The blend falls well within the bulge main sequence revealed by the HST stars on
the CMD, so naively the blend can be interpreted as being in the bulge. Hence,
this diagram is, in itself, most simply explained by the blend being a bulge lens or
a binary companion of the source. However, the measurement of V − I color has
relatively large uncertainty, and it is also consistent with the blend being the lens (or
a companion to it) several kpc in front of the bulge, provided the blend is somewhat
redder than indicated by the best-fit value of its color. In the following section, we
assume the blended light seen by HST is the foreground lens star, and the HST
photometry is combined with other information to put constraints on the lens star
under this assumption.
138
4.3.6. Final Physical Constraints on the Lens and
Planet
Constraints on a Luminous Lens
In the foregoing, we have discussed two types of constraints on the host star
properties: the first class of constraints, consisting of independent measurements of
πE, θE and µ, relate the microlens parameters to the physical parameters of the
lens; the second class are HST and ground-based observations that determine the
photometric properties of the blend.
In this section, we first describe a new set of MCMC simulations taking all
these constraints into account. Similarly to what is done to include HST photometry
in the “MCMC A” (see § 4.3.5), we incorporate HST astrometry constraints by
adding χ2 penalties to the fittings. For a given set of microlens parameters, we can
derive the physical parameters, namely, M , πrel, µgeo, and so calculate ρ = θ∗/θE
(from eq. [4.1]) and the F814W − F555W centroid offset (from eq. [4.15]). Then we
assign the χ2 penalties based on the observed centroid offset from § 4.3.4. In this
way, the MCMC simulations simultaneously include all microlens constraints on the
lens properties. The posterior probability distribution of M and πrel are plotted in
Figure 4.7. The πrel determination very strongly excludes a bulge (πrel ∼< 0.05) lens.
139
Note that by incorporating HST astrometry, we implicitly assume that the blend is
the lens.
If the blend is indeed the lens itself, we can also estimate its mass and
distance from the measured color and magnitude of the blend. In doing so, we
use theoretical stellar isochrones (M. Pinsonneault 2007, private communication)
incorporating the color-temperature relation by Lejeune et al. (1997, 1998). We first
use an isochrone that has solar metal abundance, with stellar masses ranging from
0.25M⊙ − 1.0M⊙, and an age of 4 Gyrs. The variation in stellar brightness due to
stellar age is negligible for our purpose. Extinction is modeled as a function of Dl
by dAI/dDl = (0.4 kpc−1) exp(−wDl), where w is set to be 0.31 kpc−1 so that the
observed value AI(8.6 kpc) = 1.20 (as derived from CMD discussed in § 4.3.3) is
reproduced. Again, the distance to the source is assumed to be 8.6 kpc, implying
πs = 0.116 mas, and hence that the lens distance is Dl/kpc = mas/(πrel + πs). In
Figure 4.7, we show the lens mass M and relative parallax πrel derived from the
isochrone that correspond to the observed I-band magnitude I = 21.3 in black
line and a series of V − I values V − I = 1.8 (best estimate), 2.0 (0.5 σ), 2.1 (1 σ),
2.3 (1.5 σ) and 2.6 (2 σ) as black points. The observed color is in modest disagreement
< 2σ with the mass and distance of the lens at solar metallicity. We also show
analogous trajectories for [M/H] = −0.5 (red) and [M/H] = −1.0 (green). The level
of agreement changes only very weakly with metallicity.
140
We then include the isochrone information in a new set of MCMC runs
(“MCMC B”). To do so, the HST blended fluxes in I and V bands can no longer
be treated as independent MCMC parameters. Instead, based on the isochrone
with solar metallicity, the lens V − I color and I magnitude are predicted at the
lens mass and distance determined from MCMC parameters. Then the HST I-band
and V -band fluxes are fixed at the predicted values in the fitting for each MCMC
realization.
Figure 4.8 illustrates the constraints on M and πrel from the MCMC, which are
essentially the same for both wide-binary (solid contours) and close-binary (dashed
contours) solutions:
M = 0.46 ± 0.04 M⊙, πrel = 0.19 ± 0.03 mas. (4.19)
Assuming the source distance at 8.6 kpc, the πrel estimates translate to the following
lens distance measurement:
Dl = 3.2 ± 0.4 kpc. (4.20)
Furthermore, we can derive constraints on the planet mass Mp and the projected
separation between the planet and the lens star r⊥,
Mp = 3.8 ± 0.4MJupiter, r⊥ = 3.6 ± 0.2AU (wide), (4.21)
141
and
Mp = 3.4 ± 0.4MJupiter, r⊥ = 2.1 ± 0.1AU (close). (4.22)
The wide solution is slightly preferred over close solution by ∆χ2 = 2.1.
To examine possible uncertainties in extinction estimates, we reran our MCMC
with AI and AV that are 10% higher and lower than the fiducial values. These runs
result in very similar estimates as when adopting the fiducial values.
From equations (4.14) and (4.13), one can easily obtain the centroid shift
between two epochs in a given passband by ignoring ∆θs(t)1,
θc(t2) − θc(t1) = µs(t2 − t1) + µgeo[fl(t2)(t2 − t0) − fl(t1)(t1 − t0)]
+πrel[fl(t2)∆s(t2) − fl(t1)∆s(t1)] (4.23)
Because µgeo, πrel and fl in a given passband can be extracted from the MCMC
realizations (“MCMC B”), we can use the above equation to measure the source
proper motion by making use of the centroid shift in F814W between two epochs.
1 The angular separations between the source and the lens are ∼ 0.47θE and ∼ 4.4θE for the
two HST epochs, respectively. Thus the angular position offsets between the centroids of the source
images and the source are both ∼ 0.21θE and the directions of the offset relative to the source are
almost the same due to the small impact parameter u0. The difference between lens flux fractions
of the two epochs are about 7% in I band, so the offsets can be confidently ignored in deriving the
source proper motion using the relative astrometry in F814W at two different epochs.
142
The source proper motion with respect to the mean motion of stars in the HST field
is measured to be
µs = (µs,E, µs,N) = (2.0 ± 0.2, −0.5+0.2−0.7) mas yr−1. (4.24)
We obtain similar results with F555W, but with understandably larger errorbars
since the astrometry is more precise for the microlens in F814W.
Combining equations (4.3) and (4.4), the lens proper motion in the heliocentric
frame is therefore
µl = µgeo + µs +v⊕πrel
AU. (4.25)
For each MCMC realization, πrel is known, so we can convert the lens proper motion
to the velocity of the lens in the heliocentric frame vl,hel and also in the frame of
local standard of rest vl,LSR (we ignore the rotation of the galactic bulge). The
lens velocity in the LSR is estimated to be vl,LSR = 103 ± 15 km s−1. This raises
the possibility of the lens being in the thick disk, in which the stars are typically
metal-poor. As shown in Figure 4.7, the constraints we have cannot resolve the
metallicity of the lens star.
143
Planetary Orbital Motion
Wide/Close Degeneracy Binary-lens light curves in general exhibit a
well-known “close-wide” symmetry (Dominik 1999b; An 2005). Even for some
well-covered caustics-crossing events (e.g., Albrow et al. 1999a), there are quite
degenerate sets of solutions between wide and close binaries. In Paper I, we found
that the best-fit point-source wide-binary solution was preferred over close-binary
solutions by ∆χ2 ∼ 22. But this did not necessarily mean that the wide-close binary
degeneracy was broken, since the two classes of binaries may be influenced differently
by higher order effects. We find that the χ2 difference between best-fit wide and
close solutions is within 1 from “MCMC A” and 2.1 (positive u0) or 2.2 (negative
u0) from “MCMC B”.
However, orbital motion of the planet is subject to additional dynamical
constraints: the projected velocity of the planet should be no greater than the escape
velocity of the system: v⊥ ≤ vesc, where,
v⊥ =√
d2 + (ωd)2AU
πl
θE, (4.26)
vesc =
√
2GM
r≤ vesc,⊥ ≡
√
2GM
dθEDl
=
√
πl
2dπE
c, (4.27)
and where r is the instantaneous 3-dimensional planet-star physical separation. Note
that in the last step, we have used equation (4.1).
144
We then calculate the probability distribution of the ratio
v2⊥
v2esc,⊥
= 2AU2
c2
d3[(d/d)2 + ω2]
[πE + (πs/θE)]3πE
θE
(4.28)
for an ensemble of MCMC realizations for both wide and close solutions. Figure 4.9
shows probability distributions of the projected velocity r⊥γ in the units of critical
velocity vc,⊥, where r⊥γ is the instantaneous velocity of the planet on the sky, which
is further discussed in Appendix C and vc,⊥ = vesc,⊥/√
2. The dotted circle encloses
the solutions that are allowed by the escape velocity criteria, and the solutions that
are inside the solid line are consistent with circular orbital motion. We find that the
best-fit close-binary solutions are physically allowed while the best-fit wide-binary
solutions are excluded by these physical constraints at 1.6 σ. The physically excluded
best-fit wide solutions are favored by ∆χ2 = 2.1 (or 2.2) over the close solutions,
so by putting physical constraints, the degenerate solutions are statistically not
distinguishable at 1σ.
Circular Planetary Orbits and Planetary Parameters Planetary deviations
in microlensing light curves are intrinsically short, so in most cases, only the
instantaneous projected distance between the planet and the host star can be
extracted. As shown in § 4.3.6, for this event, we tentatively measure the
instantaneous projected velocity of the planet thanks to the relatively long
(∼ 4 days) duration of the planetary signal. One cannot solve for the full set of
145
orbital parameters just from the instantaneous projected position and velocity.
However, as we show in Appendix C, we can tentatively derive orbital parameters by
assuming that the planet follows a circular orbit around the host star. In Figure 4.10,
we show the probability distributions of the semimajor axis, inclination, amplitude
of radial velocity, and equilibrium temperature of the planet derived from “MCMC
B” for both wide and close solutions. The equilibrium temperature is defined to
be Teq ≡ (Lbol/Lbol,⊙)1/4(2a/R⊙)−1/2T⊙, where Lbol is the bolometric luminosity of
the host, a is the planet semimajor axis, and Lbol,⊙, R⊙, and T⊙ are the luminosity,
radius, and effective temperature of the Sun, respectively. This would give the Earth
an equilibrium temperature of Teq = 285 K. In calculating these probabilities, we
assign a flat (Opik’s Law) prior for the semimajor axis and assume that the orbits
are randomly oriented, that is, with a uniform prior on cos i.
4.3.7. Constraints on a Non-Luminous Lens
In § 4.3.5, we noted that the blended light must lie within 15 mas of the source:
otherwise the HST images would appear extended. We argued that the blended
light must be associated with the event (either the lens itself or a companion to
either the source or lens), since the chance of such an alignment by a random
field star is < 0.07%. In fact, even stronger constraints can be placed on the
blend-source separation using the arguments of § 4.3.4. These are somewhat more
complicated and depend on the blend-source relative parallax, so we do not consider
146
the general case (which would only be of interest to further reduce the already very
low probability of a random interloper) but restrict attention to companions of the
source and lens. We begin with the simpler source-companion case.
Blend As Source Companion
As we reported in § 4.3.4, there were two HST measurements of the astrometric
offset between the V and I light centroids, dating from 0.09 and 0.84 years after peak,
respectively. In that section, we examined the implications of these measurements
under the hypothesis that the blend is the lens. We therefore ignored the first
measurement because the lens source separation at that epoch is much better
constrained by the microlensing event itself than by the astrometric measurement.
However, as we now examine the hypothesis that the blend is a companion to the
source, both epochs must be considered equally. Most of the weight (86%) comes
from the second observation, partly because the astrometric errors are slightly
smaller, but mainly because the blend contributes about twice the fractional light,
which itself reduces the error on the inferred separation by a factor of 2. Under
this hypothesis, we find a best-fit source-companion separation of 5 mas, with a
companion position angle (north through east) of 280. The (isotropic) error is
3 mas. Approximating the companion-source relative motion as rectilinear, this
measurement strictly applies to an epoch 0.73 years after the event, but of course
147
the intrinsic source-companion relative motion must be very small compared to the
errors in this measurement.
There would be nothing unusual about such a source-companion projected
separation, roughly 40 ± 25 AU in physical units. Indeed, the local G-star binary
distribution function peaks close to this value (Duquennoy & Mayor 1991).
The derived separation is also marginally consistent with the companion
generating a xallarap signal that mimics the parallax signal in our dominant
interpretation. The semi-major axis of the orbit would have to be about 0.8 AU to
mimic the 1-year period of the Earth, which corresponds to a maximum angular
separation of about 100 µas, which is compatible with the astrometric measurements
at the 1.6 σ level.
Another potential constraint comes from comparing the color difference with
the magnitude difference of the source and blend. We find that the source is about
0.5 ± 0.5 mag too bright to be on the same main sequence. However, first, this is
only a 1σ difference, which is not significant. Second, both the sign and magnitude
of the difference are compatible with the source being a slightly evolved turnoff star,
which is consistent with its color.
The only present evidence against the source-companion hypothesis is that the
astrometric offset between V and I HST images changes between the two epochs,
and that the direction and amplitude of this change is consistent with other evidence
148
of the proper motion of the lens. Since this is only a P = 1.7% effect, it cannot be
regarded as conclusive. However, additional HST observations at a later epoch could
definitively confirm or rule out this hypothesis.
Blend As A Lens Companion
A similar, but somewhat more complicated line of reasoning essentially rules
out the hypothesis that the blend is a companion to the lens, at least if the lens is
luminous. The primary difference is that the event itself places very strong lower
limits on how close a companion can be to the lens.
A companion with separation (in units of θE) d ≫ 1 induces a Chang-Refsdal
(1979) caustic, which is fully characterized by the gravitational shear γ = q/d2. We
find that the light-curve distortions induced by this shear would be easily noticed
unless γ < 0.0035, that is,
γ =qc
d2c
=qcθ
2E
θ2c
< 0.0035, (4.29)
where qc = Mc/M is the ratio of the companion mass to the lens mass and dc = θc/θE
is the ratio of the lens-companion separation to the Einstein radius. Equivalently,
θc > 19(
qc
1.3
)1/2
θE. (4.30)
149
Here, we have normalized qc to the minimum mass ratio required for the companion
to dominate the light assuming that both are main-sequence stars. (We will also
consider completely dark lenses below).
We now show that equation (4.30) is inconsistent with the astrometric data. If a
lens companion is assumed to generate the blend light, then essentially the same line
of reasoning given in § 4.3.7 implies that 0.73 years after the event, this companion
lies 5 mas from the source, at position angle 280 and with an isotropic error of 3
mas. The one wrinkle is that we should now take account of the relative-parallax
term in equation (4.15), whereas this was identically zero (and so was ignored) for
the source-companion case. However, this term is only about 1.8πrel and hence is
quite small compared to the measurement errors for typical πrel ∼< 0.2mas. We will
therefore ignore this term in the interest of simplicity, except when we explicitly
consider the case of large πrel further below.
Of course, the lens itself moves during this interval. From the parallax
measurement alone (i.e. without attributing the V/I astrometric displacement to
lens motion), it is known that the lens is moving in the same general direction,
i.e., with position angle roughly 210. In assessing the amplitude of this motion we
consider only the constraints from finite-source effects (and ignore the astrometric
displacement). These constraints yield a hard lower limit on θE (from lack of
pronounced finite-source effects) of θE > 0.6 mas, which corresponds to a proper
motion µ = 3.1 mas yr−1. At this extreme value (and allowing for 2σ uncertainty
150
in the direction of lens motion as well in the measurement of the companion
position), the maximum lens-companion separation is 11.4 mas (i.e., 19 θE),
which is just ruled out by equation (4.30). At larger θE, the lens-companion
scenario is excluded more robustly. For example, in the limit of large θE, we have
θc = µ × 0.73 yr = θE(0.73 yr/tE) = 3.9θE, which is clearly ruled out by equation
(4.30).
Then we note that any scenario involving values of πrel that are large enough
that they cannot be ignored in this analysis (πrel ∼> 0.5 mas), must also have very
large θE = πrel/πE ∼> 1 mas, a regime in which the lens-companion is easily excluded.
The one major loophole to this argument is that the lens may be a stellar
remnant (white dwarf, neutron star, or black hole), in which case it could be more
massive than the companion despite the latter’s greater luminosity.
4.3.8. Xallarap Effects and Binary Source
Binary source motion can give rise to distortions of the light curve, called
“xallarap” effects. One can always find a set of xallarap parameters to perfectly
mimic parallax distortions caused by the Earth’s motion (Smith et al. 2003).
However, it is a priori unlikely for the binary source to have such parameters, so
if the parallax signal is real, one would expect the xallarap fits to converge to the
Earth parameters. For simplicity, we assume that the binary source is in circular
151
orbit. We extensively search the parameter space on a grid of 5 xallarap parameters,
namely, the period of binary motion P , the phase λ and complement of inclination
β of the binary orbit, which corresponds to the ecliptic longitude and latitude in
the parallax interpretation of the light curve, as well as (ξE,E, ξE,N), which are the
counterparts of (πE,E, πE,N) of the microlens parallax. We take advantage of the
two exact degeneracies found by Poindexter et al. (2005) to reduce the range of the
parameter search. One exact degeneracy takes λ′ = λ + π and χE′ = −χE, while
all other parameters remain the same. The other takes β′ = −β, u0′ = −u0 and
ξ′E,N = −ξE,N (the sign of α should be changed accordingly as well). Therefore we
restrict our search to solutions with positive u0 and with π ≤ λ ≤ 2π. In modeling
xallarap, planetary orbital motion is neglected. In Figure 4.11, the χ2 distribution
for best-fit xallarap solutions as a function of period is displayed in a dotted line,
and the xallarap solution with a period of 1 year has a ∆χ2 = 0.5 larger than the
best fit at 0.9 year. Figure 4.12 shows that, for the xallarap solutions with period of
1 year, the best fit has a ∆χ2 = 3.2 less than the best-fit parallax solution (displayed
as a black circle point) and its orbital parameters are close to the ecliptic coordinates
of event (λ = 268, β = −11). Therefore, the overall best-fit xallarap solution has
∆χ2 = 3.7 smaller than that of the parallax solution (whose χ2 value is displayed as
a filled dot in Fig. 4.11) for 3 extra degrees of freedom, which gives a probability
of 30%. The close proximity between the best-fit xallarap parameters and those
of the Earth can be regarded as good evidence of the parallax interpretation. The
152
slight preference of xallarap could simply be statistical fluctuation or reflect low-level
systematics in the light curve (commonly found in the analysis by Poindexter et al.
2005).
We also devise another test on the plausibility of xallarap. In § 4.3.5, we argued
that the blend is unlikely to be a random interloper unrelated to either the source or
the lens. If the source were in a binary, then the blend would naturally be explained
as the companion of the source star. Then from the blend’s position on the CMD,
its mass would be mc ∼ 0.9 M⊙. By definition, ξE is the size of the source’s orbit as
in the units of rE (the Einstein radius projected on the source plane),
ξE =as
rE
=amc
(mc + ms)rE
, (4.31)
where a is the semimajor axis of the binary orbit, and ms and mc are the masses of
the source and its companion, respectively. Then we apply Kepler’s Third Law:
(
P
yr
)2 m3c
M⊙(mc + ms)2=(
ξErE
AU
)3
. (4.32)
Once the masses of the source and companion are known, the product of ξE and
rE are determined for a given binary orbital period P . And in the present case,
rE/AU = θEDs = θ∗/ρDs = 4.5 × 10−3/ρ. By adopting ms = 1M⊙, mc = 0.9M⊙, for
each set of P and ρ, there is a uniquely determined ξE from equation (4.32). We then
apply this constraint in the xallarap fitting for a series of periods. The minimum χ2s
for each period from the fittings are shown in solid line in Figure 4.11. The best-fit
153
solution has ∆χ2 ∼ 1.0 less than the best-fit parallax solution for two extra degrees
of freedom. Although as compared to the test described in the previous paragraph,
the current test implies a higher probability that the data are explained by parallax
(rather than xallarap) effects, it still does not rule out xallarap.
4.4. Summary and Future Prospects
Our primary interpretation of the OGLE-2005-BLG-071 data assumes that the
light-curve distortions are due to parallax rather than xallarap and that the blended
light is due to the lens itself rather than a companion to the source. Under these
assumptions, the lens is fairly tightly constrained to be a foreground M dwarf, with
mass M = 0.46 ± 0.04 M⊙ and distance Dl = 3.2 ± 0.4 kpc, which has thick-disk
kinematics (vLSR ∼ 103 km s−1). As we discuss below, future observations might
help to constrain its metallicity. The microlens modeling suffers from a well-known
wide-close binary degeneracy. The best-fit wide-binary solutions are slightly favored
over the close-binary solutions, however, from dynamical constraints on planetary
orbital motion, the physically allowed solutions are not distinguishable within 1 σ.
For the wide-binary model, we obtain a planet of mass Mp = 3.8 ± 0.4 MJupiter
at projected separation r⊥ = 3.6 ± 0.2 AU. The planet then has an equilibrium
temperature of about T = 55 K, i.e. similar to Neptune. In the degenerate
154
close-binary solutions, the planet is closer to the star and so hotter, and the
estimates are: Mp = 3.4 ± 0.4 MJupiter, r⊥ = 2.1 ± 0.1 AU and T ∼ 71 K.
As we have explored in considerable detail, it is possible that one or both of
these assumptions is incorrect. However, future astrometric measurements that are
made after the lens and source have had a chance to separate, will largely resolve
both ambiguities. Moreover, such measurements will put much tighter constraints
on the metallicity of the lens (assuming that it proves to be the blended light).
First, the astrometric measurements made 0.84 yr after the event detected
motion suggests that there was still 1.7% chance that the blend did not move relative
to the source. A later measurement that detected this motion at higher confidence
would rule out the hypothesis that the blend is a companion to the source. We
argued in § 4.3.7 that the blend could not be a companion to a main-sequence
lens. Therefore, the only possibilities that would remain are that the lens is the
blend, that the lens is a remnant (e.g., white dwarf), or that the blend is a random
interloper (probability < 10−3). As we briefly summarize below, a future astrometric
measurement could strongly constrain the remnant-lens hypothesis as well.
Of course, it is also possible that future astrometry will reveal that the blend
does not moving with respect to the source, in which case the blend would be a
companion to the source. Thus, either way, these measurements would largely
resolve the nature of the blended light.
155
Second, by identifying the nature of blend, these measurements will largely,
but not entirely, resolve the issue of parallax vs. xallarap. If the blend proves not to
be associated with the source, then any xallarap-inducing companion would have to
be considerably less luminous, and so (unless it were a neutron star) less massive
than the mc = 0.9 M⊙ that we assumed in evaluating equation (4.32). Moreover,
stronger constraints on rE (rhs of eq. [4.32]) would be available from the astrometric
measurements. Hence, the xallarap option would be either excluded or very strongly
constrained by this test.
On the other hand, if the blend were confirmed to be a source companion, then
essentially all higher order constraints on the nature of the lens would disappear.
The parallax “measurement” would then very plausibly be explained by xallarap,
while the “extra information” about θE that is presently assumed to come from the
blend proper-motion measurement would likewise evaporate.
These considerations strongly argue for making a future high-precision
astrometric measurement. Recall that in the HST measurements reported in § 4.3.5,
the source and blend were not separately resolved: the relative motion was inferred
from the offset between the V and I centroids, which are displaced because the source
and blend have different colors. Due to its well-controlled PSF, HST is capable of
detecting the broadening of the PSF even if the separation of the lens and source is
a fraction of the FWHM. Assuming that the proper motion is µgeo ∼ 4.4 mas yr−1,
and based on our simulations in § 4.3.5, such broadening would be confidently
156
detectable about 5 years after the event (see also Bennett et al. 2007 for analytic
PSF broadening estimates). Ten years after the event, the net displacement would
be ∼ 40 mas. This compares to a diffraction limited FWHM of 40 mas for H band on
a ground-based 10m telescope and would therefore enable full resolution. The I −H
color of the source is extremely well determined (0.01 mag) from simultaneous I and
H data taken during the event from the CTIO/SMARTS 1.3m in Chile. Hence,
the flux allocation of the partially or fully resolved blend and source stars would
be known. The direct detection of a partially or fully resolved lens will provide
precise photometric and astrometry measurements (see Kozlowski et al. 2007 for one
such example), which will enable much tighter constraints on the mass, distance
and projected velocity of the lens. It also opens up the possibility of determining
the metallicity of the host star by taking into account both non-photometric and
photometric constraints. If, as indicated by the projected velocity measurement, it
is a thick-disk star, then it will be one of the few such stars found to harbor a planet
(Haywood 2008).
As remarked above, a definitive detection of the blend’s proper motion would
still leave open the possibility that it was a companion to the lens, and not the
lens itself. In this case, the lens would have to be a remnant. Without going into
detail, the astrometric measurement would simultaneously improve the blend color
measurement as well as giving a proper-motion estimate (albeit with large errors
because the blend-source offset at the peak of the event would then not be known).
157
It could then be asked whether the parallax, proper-motion, and photometric
data could be consistently explained by any combination of remnant lens and
main-sequence companion. This analysis would depend critically on the values of
the measurements, so we do not explore it further here. We simply note that this
scenario could also be strongly constrained by future astrometry.
4.5. Discussion
With the measurements presented here, and the precision with which these
measurements allow us to determine the properties of the planet OGLE-2005-BLG-
071Lb and its host, it is now possible to place this system in the context of similar
planetary systems discovered by radial velocity (RV) surveys. Of course, the kind
of information that can be inferred about the planetary systems discovered via RV
differs somewhat from that presented here. For example, for planets discovered via
RV, it is generally only possible to infer a lower limit to the planet mass, unless
the planets happen to transit or produce a detectable astrometric signal. Mutatis
mutandis, for planets discovered via microlensing, it is generally only possible to
measure the projected separation at the time of the event, even in the case for which
the microlensing mass degeneracy is broken as it is here (although see Gaudi et al.
2008).
158
With these caveats in mind, we can compare the properties of OGLE-2005-
BLG-071Lb and its host star with similar RV systems. It is interesting to note that
the fractional uncertainties in the host mass and distance of OGLE-2005-BLG-071Lb
are comparable to those of some of the systems listed in Table 4.6.
OGLE-2005-BLG-071Lb is one of only eight Jovian-mass (0.2MJupiter <
Mp < 13MJupiter) planets that have been detected orbiting M dwarf hosts (i.e.,
M∗ < 0.55 M⊙) (Marcy et al. 1998, 2001; Delfosse et al. 1998; Butler et al. 2006;
Johnson et al. 2007b; Bailey et al. 2008). Table 4.6 summarizes the planetary
and host-star properties of the known M dwarf/Jovian-mass planetary systems.
OGLE-2005-BLG-071Lb is likely the most massive known planet orbiting an M
dwarf.
As suggested by the small number of systems listed in Table 4.6, and shown
quantitatively by several recent studies, the frequency of relatively short-period
P ∼< 2000 days, Jupiter-mass companions to M dwarfs appears to be ∼ 3 − 5 times
lower than such companions to FGK dwarfs (Butler et al. 2006; Endl et al. 2006;
Johnson et al. 2007b; Cumming et al. 2008). This paucity, which has been shown
to be statistically significant, is expected in the core-accretion model of planet
formation, which generally predicts that Jovian companions to M dwarfs should be
rare, since for lower mass stars, the dynamical time at the sites of planet formation is
longer, whereas the amount of raw material available for planet formation is smaller
(Laughlin et al. 2004; Ida & Lin 2005; Kennedy & Kenyon 2008, but see Kornet
159
et al. 2006). Thus, these planets typically do not reach sufficient mass to accrete a
massive gaseous envelope over the lifetime of the disk. Consequently, such models
also predict that in the outer regions of their planetary systems, lower mass stars
should host a much larger population of ‘failed Jupiters,’ cores of mass ∼< 10 M⊕
(Laughlin et al. 2004; Ida & Lin 2005). Such a population was indeed identified
based on two microlensing planet discoveries (Beaulieu et al. 2006; Gould et al.
2006).
Our detection of a ∼ 4 MJupiter companion to an M dwarf may therefore present
a difficulty for the core-accretion scenario. While we do not have a constraint on the
metallicity of the host, the fact that it is likely a member of the thick disk suggests
that its metallicity may be subsolar. If so, this would pose an additional difficulty for
the core-accretion scenario, which also predicts that massive planets should be rarer
around metal-poor stars (Ida & Lin 2004), as has been demonstrated observationally
(Santos et al. 2004; Fischer & Valenti 2005). This might imply that a different
mechanism is responsible for planet formation in the OGLE-2005-BLG-071L system,
such as the gravitational instability mechanism (Boss 2002, 2006).
One way to escape these potential difficulties is if the host lens is actually a
stellar remnant, such as white dwarf. The progenitors of remnants are generally
more massive stars, which are both predicted (Ida & Lin 2005; Kennedy & Kenyon
2008) and observed (Johnson et al. 2007a,b) to have a higher incidence of massive
planets. As we discussed above, future astrometric measurements could constrain
160
both the low-metallicity and remnant-lens hypotheses. These measurements are
therefore critical.
Although it is difficult to draw robust conclusions from a single system,
there are now four published detections of Jovian-mass planetary companions
with microlensing (Bond et al. 2004; Gaudi et al. 2008), and several additional
such planets have been detected that are currently being analyzed. It is therefore
reasonable to expect several detections per year (Gould 2009), and thus that it will
soon be possible to use microlensing to constrain the frequency of massive planetary
companions. These constraints are complementary to those from RV, since the
microlensing detection method is less biased with respect to host star mass (Gould
2000a), and furthermore probes a different region of parameter space, namely cool
planets beyond the snow line with equilibrium temperatures similar to the giant
planets in our solar system (see, e.g. Gould et al. 2007 and Gould 2009).
161
RoboNet FTN R
PLANET Canopus I
MOA I
I
I
Clear
V
I
ClearOGLE V
OGLE I
3479 3480 3481 3482 348316
15.8
15.6
15.4
15.2
15
Fig. 4.1.— Main panel: all available ground-based data of the microlensing eventOGLE-2005-BLG-071. HST ACS HRC observations in F814W and F555W weretaken at two epochs, once when the source was magnified by A ∼ 2 (arrow), andagain at HJD = 2453788.2 (at baseline). Planetary models that include (solid) andexcludes (dotted) microlens parallax are shown. Zoom at bottom: triple-peak featurethat reveals the presence of the planet. Each of the three peaks corresponds to thesource passing by a cusp of the central caustic induced by the planet. Upper inset:trajectory of the source relative to the lens system in the units of angular Einsteinradius θE. The lens star is at (0, 0), and the star-planet axis is parallel to the x-axis.The best-fit angular size of the source star in units of θE is ρ ∼ 0.0006, too small tobe resolved in this figure.
162
Fig. 4.2.— Probability contours (∆χ2 = 1, 4) of microlens parallax parametersderived from MCMC simulations for wide-binary (in solid line) and close-binary (in adashed line) solutions. Fig. 2 and eq. (12) in Gould (2004) imply that πE,⊥ is definedso that πE,‖ and πE,⊥ form a right-handed coordinate system.
163
Fig. 4.3.— CMD for the OGLE-2005-BLG-071 field. Black dots are the stars withthe OGLE I-band and V -band observations. The red point and green points show thecenter of red clump and the source, respectively. The errors in their fluxes and colorsare too small to be visible on the graph. Cyan points are the stars in the ACS field,which are photometrically aligned with OGLE stars using 10 common stars. Themagenta point with error bars show the color and magnitude of the blended light.
164
Fig. 4.4.— HST ACS astrometric measurements of the target star in F814W (red)and F555W (blue) filters in 2005 (filled dots) and 2006 (open dots). The centerpositions of the big circles show mean values of the 4 dithered observations in eachfilter at each epoch while radii of the circle represent the 1σ errors.
165
finite-sourceastrometryjoint
wide
wideparallaxjoint
finite-sourceastrometryjoint
close
closeparallaxjoint
Fig. 4.5.— Upper two panels show posterior probability contours at ∆χ2 = 1 (solidline) and 4 (dotted line) for relative lens-source proper motion µgeo. The left panelis for wide-binary solutions and the right one is for close-binary. The green contoursshow the probability distributions constrained by the finite-source effects. The blackcontours are derived from HST astrometry measurements assuming πrel = 0.2 mas.The red contours show the joint probability distributions from both constraints. Thelower two panels show the posterior probability distribution of the position angle φµ
geo
of the relative lens-source proper motion for wide-binary and close-binary solutions,respectively. The histogram in red is derived from the red contours of joint probabilityfor finite source and astrometry constraints in the upper panel. The blue histogramrepresents that of the microlens parallax. They mildly disagree at 2.5σ.
166
Fig. 4.6.— Differences between OGLE V and HST F555W magnitudes for thematched stars are plotted against their V magnitudes measured by OGLE. Tocalculate the offset, we add a 0.017 mag “cosmic error” in quadrature to each point inorder to reduce χ2/dof to unity. The open circles represent the stars used to establishthe final transformation, and the filled point shows an “outlier”.
167
wide
close
V-I
1.8 (best-estimate)
2.0
2.1
2.3
2.6
Fig. 4.7.— Posterior probability distribution of lens mass M and relative lens-source parallax πrel from MCMC simulations discussed in § 4.3.6. The constraintsinclude those from parallax effects, finite-source effects and relative proper motionmeasurements from HST astrometry. The ∆χ2 = 1, 4, 9 contours are displayed insolid, dotted and dashed lines, respectively. Both wide-binary (magenta) and close-binary (blue) solutions are shown. The lines in black, red and green represent thepredicted M and πrel from the isochrones for different metal abundances: [M/H] = 0(black), −0.5 (red), −1.0 (green). The points on these lines correspond to the observedI-band magnitude I = 21.3 and various V −I values V −I = 1.8 (best estimate, filleddots), 2.0 (0.5 σ, filled triangle), 2.1 (1.0 σ, filled squares), 2.3 (1.5 σ, filled pentagons),and 2.6 (2.0 σ, filled hexagons)
168
Fig. 4.8.— Posterior probability distribution of lens mass M and relative lens-sourceparallax πrel from MCMC simulations assuming that the blended light comes fromthe lens star. The ∆χ2 = 1, 4 contours are displayed in a solid line for wide solutions,and in a dotted line for close solutions.
169
Fig. 4.9.— Probability contours of projected velocity r⊥γ (defined in Appendix C)in the units of vc,⊥ for both close-binary (upper panel) and wide-binary (lower panel)solutions. All the solutions that are outside the dotted circle are physically rejected asthe velocities exceed the escape velocity of the system. The boundary in a solid lineinside the dotted circle encloses the solutions for which circular orbits are allowed.
170
Fig. 4.10.— Probability distributions of planetary parameters (semimajor axis a,equilibrium temperature, cosine of the inclination, and amplitude of radial velocity ofthe lens star) from MCMC realizations assuming circular orbital motion. Histogramsin black and red represent the close-binary and wide-binary solutions, respectively.Dotted and dashed histograms represent the two degenerate solutions for each MCMCrealization discussed in Appendix C.
171
Fig. 4.11.— χ2 distributions for best-fit xallarap solutions at fixed binary-sourceorbital periods P . The solid and dotted lines represent xallarap fits with and withoutdynamical constraints described in § 4.3.8. The best-fit parallax solution is shown asa filled dot at period of 1 year. All of the fits shown in this figure assume no planetaryorbital motion.
172
Fig. 4.12.— Results of xallarap fits by fixing binary orbital phase λ and complementof inclination β at period P = 1 yr and u0 > 0. The plot is color-coded for solutionswith ∆χ2 within 1 (black), 4 (red), 9 (green), 16 (blue), 25 (magenta), 49 (yellow)of the best fit. The Earth parameters are indicated by black circles. Because of aperfect symmetry (u0 → −u0 and α → −α), the upper black circle represents Earthparameter (λ = 268, β = −11) for the case u0 < 0. Comparison of parallax withxallarap must be made with the better of the two, that is, the lower one.
173
Model t0 u0 tE d q α ρ πE,N πE,E ω d/d
χ2 (HJD’) (day) ×103 (deg) ×104 (yr−1) (yr−1)
Wide+ 3480.7024 0.0282 71.1 1.306 7.5 273.63 3.9 -0.30 -0.26 -1.328 -0.256
1345.0 +0.0058−0.0054
+0.0008−0.0009
+2.3−2.4
+0.002−0.004 ±0.2 +0.16
−0.15+1.8−2.7
+0.24−0.28 ±0.05 +0.274
−0.165+0.134−0.129
Wide− 3480.7028 -0.0283 70.6 1.307 7.5 86.21 3.9 -0.34 -0.26 1.117 -0.277
1345.3 +0.0054−0.0056
+0.0010−0.0008 ±2.2 +0.003
−0.004 ±0.3 +0.13−0.16
+1.8−2.6
+0.30−0.23 ±0.05 +0.130
−0.293+0.152−0.113
Close+ 3480.6789 0.0239 70.1 0.763 6.9 274.27 3.1 -0.36 -0.27 0.301 0.502
1345.8 +0.0055−0.0043
+0.0009−0.0007
+2.1−2.4
+0.004−0.006 ±0.3 +0.25
−0.36+1.7−2.5
+0.24−0.27 ±0.05 +0.486
−0.788+0.148−0.101
Close− 3480.6799 -0.0241 69.2 0.762 6.9 85.53 2.7 -0.33 -0.26 -0.405 0.528
1345.2 +0.0042−0.0051
+0.0009−0.0007
+2.3−1.9
+0.004−0.006 ±0.3 +0.39
−0.26 ±2.2 +0.28−0.26 ±0.05 +0.696
−0.622+0.127−0.118
Table 4.1. OGLE-2005-BLG-071 Light Curve Parameter Estimations From Markov chain Monte
Carlo Simulations A (Part 1).
174
Model Is Ib Vs Vb
χ2 (mag) (mag) (mag) (mag)
Wide+ 19.51 21.29 20.85 23.11
1345.0 +0.04−0.03
+0.22−0.17 ±0.04 +0.43
−0.26
Wide− 19.51 21.30 20.85 23.11
1345.3 ±0.04 ±0.19 ±0.03 +0.43−0.26
Close+ 19.52 21.28 20.85 23.13
1345.8 ±0.03 +0.21−0.16
+0.04−0.03
+0.40−0.28
Close− 19.52 21.30 20.85 23.17
1345.2 ±0.03 ±0.18 ±0.04 +0.35−0.32
Table 4.2. OGLE-2005-BLG-071 Light Curve Parameter Estimations
From Markov chain Monte Carlo Simulations A (Part 2).
175
Model t0 u0 tE d q α ρ πE,N πE,E ω d/d
χ2 (HJD’) (day) ×103 (deg) ×104 (yr−1) (yr−1)
Wide+ 3480.7015 0.0287 69.3 1.305 7.7 273.67 6.1 -0.02 -0.22 -1.242 -0.283
1353.4 +0.0050−0.0059 ±0.0007 +1.6
−1.7+0.003−0.005 ±0.2 +0.17
−0.11 ±0.4 ±0.12 ±0.03 +0.321−0.125 ±0.129
Wide− 3480.7012 -0.0287 69.2 1.305 7.7 86.29 6.0 0.02 -0.21 1.193 -0.293
1353.3 +0.0052−0.0060 ±0.0007 +1.7
−1.8+0.002−0.005 ±0.2 +0.12
−0.15+0.5−0.3
+0.10−0.13 ±0.03 +0.127
−0.342+0.131−0.121
Close+ 3480.6792 0.0245 68.3 0.763 7.0 274.38 6.0 -0.01 -0.22 0.415 0.569
1355.5 +0.0041−0.0051
+0.0005−0.0006 ±1.6 +0.003
−0.006+0.3−0.2
+0.23−0.39 ±0.4 +0.12
−0.15 ±0.02 +0.503−0.744
+0.112−0.130
Close− 3480.6793 -0.0245 68.2 0.762 7.1 85.63 6.0 0.04 -0.22 -0.179 0.561
1355.5 +0.0042−0.0051 ±0.0006 +1.8
−1.5+0.004−0.006 ±0.3 +0.46
−0.24 ±0.4 +0.09−0.16 ±0.03 +0.703
−0.722+0.126−0.112
Table 4.3. OGLE-2005-BLG-071 Light Curve Parameter Estimations From Markov chain Monte
Carlo Simulations B (Part 1).
176
Model Is Ib Vs Vb
χ2 (mag) (mag) (mag) (mag)
Wide+ 19.49 21.40 20.82 23.91
1353.4 ±0.03 ±0.19 ±0.03 +0.24−0.20
Wide− 19.49 21.40 20.82 23.97
1353.3 ±0.02 ±0.19 ±0.03 +0.19−0.24
Close+ 19.49 21.35 20.83 23.88
1355.5 ±0.03 +0.20−0.16 ±0.02 +0.24
−0.18
Close− 19.50 21.36 20.83 23.90
1355.5 ±0.02 ±0.18 ±0.02 ±0.21
Table 4.4. OGLE-2005-BLG-071 Light Curve Parameter Estimations
From Markov chain Monte Carlo Simulations B (Part 2).
177
Model M πrel Dl µN µE θE Mp r⊥
χ2 M⊙ mas kpc mas yr−1 mas yr−1 mas MJupiter AU
Wide+ 0.46 0.19 3.2 -0.4 -4.3 0.84 3.8 3.6
1353.4 ±0.04 ±0.04 ±0.4 +2.7−3.1 ±0.3 +0.06
−0.04+0.3−0.4 ±0.2
Wide− 0.46 0.19 3.2 0.3 -4.3 0.85 3.8 3.6
1353.3 ±0.04 +0.04−0.03 ±0.4 +2.3
−3.6+0.3−0.2 ±0.05 +0.3
−0.4 ±0.2
Close+ 0.46 0.19 3.1 -2.6 -4.4 0.86 3.4 2.1
1355.5 ±0.04 +0.04−0.03 ±0.4 +4.8
−1.1 ±0.3 ±0.05 +0.3−0.4 ±0.1
Close− 0.46 0.20 3.1 -0.2 -4.4 0.87 3.4 2.1
1355.5 ±0.04 ±0.04 ±0.3 +3.6−3.4 ±0.3 ±0.04 ±0.3 ±0.1
Table 4.5. OGLE-2005-BLG-071 Derived Physical Parameters
178
Name M∗ Metallicity Dist. Mp P a Ref.
(M⊙) (pc) (MJup) (days) (AU)
GJ 876c 0.32 −0.12 4.660 0.6− 30.340 0.13030 1,2,3
±0.03 ±0.12 ±0.004 0.8 ±0.013
GJ 876b – – – 1.9− 60.940 0.20783 –
2.5 ±0.013
GJ 849b 0.49 0.16 8.8 0.82/ sin i 1890 2.35 4
±0.05 ±0.2 ±0.2 ±130
GJ 317b 0.24 −0.23 9.2 1.2/ sin i 692.9 0.95 5
±0.04 ±0.2 ±1.7 ±4
GJ 832b 0.45 ∼ −0.7 ∼ 4.93 0.64/ sin i 3416 3.4 6
±0.05 /-0.3 ±131 ±0.4
OGLE-2006 0.50 ? 1490 0.71 1830 2.3 7
-BLG-109Lb ±0.05 ? ±130 ±0.08 ±370 ±0.2
OGLE-2006 – – – 0.27 5100 4.6 –
-BLG-109Lc ±0.03 ±730 ±0.5
(cont’d)
Table 4.6. Jovian-mass Companions to M Dwarfs (M∗ < 0.55 M⊙)
179
Table 4.6—Continued
Name M∗ Metallicity Dist. Mp P a Ref.
(M⊙) (pc) (MJup) (days) (AU)
OGLE-2005 0.46 Subsolar?a 3300 3.8b – 3.6b,c This
-BLG-071Lb ±0.04 ±300 ±0.4 ±0.2 Paper
aWhile the metallicity of the OGLE-2005-BLG-071Lb host star is not directly
constrained by our data, its kinematics indicate it is likely a member of the metal-
poor thick disk.
bWe give the planet mass and projected separation for the wide solution, which
is favored by ∆χ2 = 2.1. The second, close solution has Mp = 3.4 ± 0.3 MJupiter
and r⊥ = 2.1 ± 0.1 AU.cWe give the the projected separation between the host and planet at the
time of event, which is the orbital parameter most directly constrained by our
observations. However, assuming a circular orbit, we infer that the semi-major
axis is likely only ∼ 10 − 20% larger [a(wide) ∼ 4.1AU, a(close) ∼ 2.5AU].
References. — (1)Rivera et al. 2005; (2)Bean et al. 2006; (3)Benedict et al.
2002; (4)Butler et al. 2006; (5) Johnson et al. 2007b (6) Bailey et al. 2008 (7)
Gaudi et al. 2008
180
Chapter 5
First Space-Based Microlens Parallax
Measurement: Spitzer Observations of
OGLE-2005-SMC-001
5.1. Introduction
In a visionary paper written more than 40 years ago, Refsdal (1966) argued
that two important but otherwise unmeasurable parameters of microlensing events
could be determined by simultaneously observing the event from the Earth and a
satellite in solar orbit. In modern language, these are the Einstein radius projected
onto the observer plane, rE, and the direction of lens-source relative proper motion.
Since the Einstein timescale, tE, is routinely measured for all events, these parameter
determinations are equivalent to knowing the projected relative velocity, v, whose
magnitude is simply v ≡ rE/tE. Here, rE ≡ AU/πE, tE = θE/µ, and
πE =
√
πrel
κM, θE =
√
κMπrel, (5.1)
181
where πE is the microlens parallax, M is the mass of the lens, θE is the angular
Einstein radius, πrel and µ are the lens-source relative parallax and proper motion,
respectively, and κ ≡ 4G/(c2AU).
The practical importance of this suggestion became clear when the MACHO
(Alcock et al. 1993) and EROS (Aubourg et al. 1993) collaborations reported the
detection of microlensing events toward the Large Magellanic Cloud (LMC). Over
the course of time, MACHO (Alcock et al. 1997, 2000) has found about 15 such
events and argued that these imply that about 20% of the Milky Way dark halo
is composed of compact objects (“MACHOs”), while EROS (Afonso et al. 2003;
Tisserand et al. 2006) has argued that their relative lack of such detections was
consistent with all the events being due to stars in the Milky Way disk or the
Magellanic Clouds (MCs) themselves. For any given individual event, it is generally
impossible to tell (with only a measurement of tE) where along the line of sight the
lens lies, so one cannot distinguish among the three possibilities: Milky Way disk,
Milky Way halo, or “self-lensing” in which the source and lens both lie in the same
external galaxy.
However, as Boutreux & Gould (1996) argued, measurement of v might allow
one to distinguish among these populations with good confidence: disk, halo, and
MC lenses typically have v values of 50, 300, and 2000 km s−1, respectively. The high
projected speed of MC lenses derives from the long “lever arm” that multiplies their
182
small local transverse speed by the ratio of the distances from the observer and the
lens to the source.
There are serious obstacles, both practical and theoretical to measuring v. One
obvious practical problem is simply launching a spacecraft with a suitable camera
into solar orbit. But the theoretical difficulties also place significant constraints on
the characteristics of that spacecraft. To understand these properly, one should
think in terms of the “microlens parallax” πE, whose magnitude is πE ≡ AU/rE
and whose direction is the same as v. Choosing a coordinate system whose x-axis
is aligned with the Earth-satellite separation at the peak of the event, we can write
πE = (πE,τ , πE,β). Then to good approximation,
πE = (πE,τ , πE,β) =AU
d⊥
(
∆t0tE
, ∆u0
)
, (5.2)
where d⊥ is the Earth-satellite separation (projected onto the plane of the sky), ∆t0
is the difference in time of event maximum as seen from the Earth and satellite,
and ∆u0 is the difference in dimensionless impact parameter (determined from the
maximum observed magnification).
Refsdal (1966) already realized that equation (5.2) implicitly contains a
four-fold degeneracy: while ∆t0/tE is unambiguously determined, there are four
different values of ∆u0 that depend on whether the individual impact parameters are
positive or negative (on one side of the lens or the other; see Fig. 2 of Gould 1994b).
183
In fact, the situation is considerably worse than this. While t0 is usually measured
very precisely in individual microlensing events, u0 typically has much larger errors
because it is strongly correlated with three other parameters, the timescale, tE, the
source flux fs, and the blended flux, fb. For a satellite separated by d⊥ ∼ 0.2 AU, and
a projected Einstein radius rE ∼ 5 AU, errors in the impact-parameter determinations
of only σ(u0) ∼ 2% would lead to fractional errors σ(πE)/πE ∼√
2σ(u0)rE/d⊥ ∼ 70%.
However, Gould (1995) showed that if the two cameras had essentially identical
spectral responses and similar point-spread-functions, so that one knew a priori that
the blended light was virtually identical for the Earth and satellite measurements,
then the error in ∆u0 would be reduced far below the individual errors in u0, making
the parallax determination once again feasible.
Unfortunately, this trick cannot be used on Spitzer, the first general purpose
camera to be placed in solar orbit. The shortest wavelength at which Spitzer operates
is the L band (3.6 µm), implying that the camera’s sensitivity cannot be duplicated
from the ground, because of both higher background and different throughput as a
function of wavelength.
In principle, microlens parallaxes can also be measured from the ground.
As with space-based parallaxes, one component of πE can generally be measured
much more precisely than the other. For most events, tE ≪ yr, and for these the
Earth’s acceleration can be approximated as constant during the event. To the
degree that this acceleration is aligned (anti-aligned) with the lens-source relative
184
motion, it induces an asymmetry in the light curve, since the event proceeds faster
(slower) before peak than afterward (Gould et al. 1994). This is characterized
by the “asymmetry parameter” Γ ≡ πE · α = πE,‖α, where α is the apparent
acceleration of the Sun projected onto the sky and normalized to an AU, and πE,‖
is the component of πE parallel to α. Since Γ is directly measurable from the light
curve, one can directly obtain 1-D parallax information πE,‖ = Γ/α for these events,
while the orthogonal component πE,⊥ is measured extremely poorly (e.g., Ghosh et
al. 2004; Jiang et al. 2005). While there are a few exceptions (Alcock et al. 2001a;
Gould et al. 2004; Park et al. 2004), 2-D parallaxes can generally only be obtained
for relatively long events tE ∼> 90 days, and even for these, the πE error ellipse is
generally elongated in the πE,⊥ direction (Poindexter et al. 2005).
Since Spitzer is in an Earth-trailing orbit and the SMC is close to the ecliptic
pole, the πE,τ direction (defined by the Earth-satellite separation vector) is very
nearly orthogonal to the πE,‖ direction (defined by the direction of the Sun).
Recognizing this, Gould (1999) advocated combining the two essentially 1-D
parallaxes from the Earth-Spitzer comparison and the accelerating Earth alone to
produce a single 2-D measurement of πE. He noted that once the difficult problem
of measuring πE,β was jettisoned, the satellite observations could be streamlined to
a remarkable degree: essentially only 3 observations were needed, 2 at times placed
symmetrically around the peak, which are sensitive to the offset in t0 between the
Earth and satellite, and a third at late times to set the flux scale. This streamlining is
185
important from a practical point of view because Target-of-Opportunity (ToO) time
on Spitzer incurs a large penalty. Gould (1999) noted that the components of πE
measured by the two techniques were not exactly orthogonal but argued (incorrectly
as it turns out) that this had no significant consequences for the experiment. We
return to this point below.
Here we analyze Spitzer and ground-based observations of the microlensing
event OGLE-2005-SMC-001 to derive the first microlens parallax measurement using
this technique.
5.2. Observations
On 2005 July 9 (HJD′ ≡ HJD-2450000 = 3561.37), the OGLE-III Early
Warning System (EWS, Udalski 2003) alerted the astronomical community that
OGLE-2005-SMC-001 (αJ2000.0 = 0h40m28.s5, δJ2000.0 = −7344′46.1′′) was a probable
microlensing event, approximately 23 days (and seven observations) into the 2005
OGLE-III observing season for the SMC. In fact, the EWS issued an internal alert
five days earlier, when there were only three 2005 points, but the OGLE team reacted
cautiously because of the high rate of questionable alerts toward the SMC and
because the source lies projected against a background galaxy, making it a potential
supernova candidate. However, the event shows a modest, but unambiguous rise 140
days earlier, at the end of the 2004 season, which is inconsistent with a supernova,
186
and the unmagnified source sits right in the middle of the red giant clump on
the color-magnitude diagram (CMD), with (V − I, I) = (0.92, 18.4). Moreover,
the light curve is achromatic. These factors convinced us that this was genuine
microlensing, leading us to exercise our Spitzer ToO option, which consisted of
three 2-hr observations: two placed symmetrically around the peak and one at the
baseline. Once this decision was made, OGLE increased its density of coverage to
3–5 I-band observations per clear night. OGLE observations were obtained using
the 1.3 m Warsaw telescope at Las Campanas Observatory in Chile, operated by
the Carnegie Institution of Washington. The photometry was reduced using the
standard OGLE-III data pipeline (Udalski 2003) based on the image subtraction
technique, DIA (Wozniak 2000). Also many V -band observations were obtained
during the event for monitoring achromaticity. There are Spitzer observations at
four epochs (not three, as originally envisaged). These were centered at 2005 July
15 UT 20:02:40, 2005 August 25 UT 12:44:25, 2005 September 15 UT 20:13:53,
and 2005 November 29 UT 10:24:40. The first, third, and fourth observations each
lasted 2 hr and consisted of two sets of about 100 dithered exposures, each of 26.8
s. The second observation (in August) was 1 hr, consisting of one set of 99 dithered
exposures, each of 26.8 s. It was obtained with director’s discretionary time (DDT).
All four were carried out simultaneously at 3.6 and 5.8 µm. However, the third
observation (which took place at relatively high magnification) was supplemented by
30 minutes of very short exposures in all four Infrared Array Camera (IRAC) filters
187
to probe the detailed spectral energy distribution of the source. The reason for the
additional DDT observation is discussed in detail in § 5.3.
As originally conceived, the experiment was to consist only of OGLE and
Spitzer observations. However, unexpected complications led us to take additional
data from other ground-based observatories as well as the Hubble Space Telescope
(HST).
Initially, we obtained some data using the 1.3 m SMARTS (former Two Micron
All Sky Survey [2MASS]) telescope at the Cerro Tololo Inter-American Observatory
(CTIO) in Chile simply as a precaution against possible future problems with the
OGLE telescope. (In order to align different light curves, it is generally necessary
that they have some overlap; one cannot wait for the problems to arise before
beginning to take data.) However, as the event approached peak, we found that it
could not be fit with a classical Paczynski (1986) model, even when augmented by
parallax. We therefore began to intensively observe the event from both the OGLE
and SMARTS telescopes in the hopes of obtaining enough data to determine the
nature of the light curve anomaly. Similar considerations led us to begin observations
using the 0.35m Nustrini telescope at the Auckland Observatory in New Zealand,
which lies at a substantially different longitude and suffers from substantially
different weather patterns from those experienced by the two Chile telescopes.
188
Additionally, several high-dispersion spectra were obtained at Las Campanas
Observatory using 6.5 m Magellan and 2.5 m du Pont telescopes with echelle
spectrographs at different magnifications in order to check for potential radial-
velocity variations in the spectra of the magnified source.
Finally, the anomalous behavior made it prudent to get high-resolution images
using HST, both to improve the modeling of blended light in the ground-based and
Spitzer images, and to determine whether the apparent ground-based source could
actually be resolved into multiple sources. We had HST ToO time to complement
low-resolution space-based microlensing parallax observations, which was originally
to be applied to observations by Deep Impact. With the probability low that these
would be triggered as originally planned, we applied this time to obtain two orbits
of observations of OGLE-2005-SMC-001. There were two epochs of (V, I, J,H,K)
exposures, on 2005 October 1 and 2006 May 17/18, with exposure times of (300,
200, 351, 351, and 639) seconds. The infrared observations were then repeated on
2006 June 25.
5.2.1. Error Rescaling
Errors from each ground-based observatory are rescaled to force χ2 per degree
of freedom close to unity. For the OGLE data, we find by inspecting the cumulative
distribution of the normalized residuals (δ/σ)2, that the rescaling factor is not
189
uniform over the data set. Here σ is the error reported by OGLE and δ is the
deviation of the data from the model. We therefore rescale in 4 segments, which are
separated at HJD = 2453100, 2453576, and 2453609, with rescaling factors 1.4, 1.9,
3.6, and 2.0. We tested two other error-rescaling schemes, one with no rescaling and
the other with uniform rescaling of the OGLE data. We found that the solutions do
not differ qualitatively when these alternate schemes are used.
5.2.2. Spitzer Data Reduction and Error Determination
Our scientific goals critically depend on obtaining high-precision IRAC
photometry for each of the 4 epochs. See § 5.3.1. These 4 epochs are divided into
7 1-hour sub-epochs, each consisting of about 100 dithers, one sub-epoch for the
second epoch and two for each of the other three epochs. Based on photon statistics
alone, the best possible precision would be about 0.2% for the five sub-epochs near
peak, and ∼ 0.4% for the last two sub-epochs. However, there are three interrelated
problems that must be overcome to even approach this potential. First, the images
contain “stripes” produced by nearby bright stars, perhaps AGB stars, which
(because the 3 near-peak observations took place over 60 days) appear at several
different rotation angles. Indeed, we expended considerable effort repositioning each
successive image to avoid having these stripes come too close to, or actually overlap,
the microlensed target, but they inevitably did overlap some reference stars. Second,
the microlensed source is blended with a neighboring star within 1.′′3, which is
190
easily resolved in HST images and clearly resolved in OGLE images as well. Third,
this problem is significantly complicated by the well-known fact that IRAC 3.6 µm
images are undersampled.
We apply the procedures of Reach et al. (2005) to perform aperture photometry
on the Basic Calibrated Data (BCD), which includes array-location-dependent and
“pixel-phase” photometric corrections at the few percent level. We choose 7 bright
and isolated stars, which we select from the OGLE images. The HST frames are
of course even better resolved, but they are too small to contain a big enough
sample of reference stars. The centroid position of the target-star aperture on each
of the (roughly 100) BCD dithers is determined by aligning the comparison stars
with the OGLE coordinates. We determine the “internal error” for the target star
and the comparison stars at each sub-epoch from the internal scatters in their
measurements. This is typically very close to the photon limit. However, we find
that the epoch-to-epoch scatter in the comparison stars is about 0.7%. While in
principle this could be due to intrinsic variability, such variability is unlikely to be
so pervasive at this level, particularly since any star showing variability in the I
band over several years was excluded as a reference star. Hence, we attribute this
variation to unknown epoch-to-epoch systematics, and we assume that these affect
the target in the same way that they affect the reference stars. Hence, we adopt
0.7% as our photometric error for each of the 7 Spitzer sub-epochs.
191
We also attempted to do point-spread-function (rather than aperture)
photometry, making use of “Point Response Functions” available at the Spitzer
Science Center Web site. However, we found that the reference stars showed greater
scatter between sub-epochs with this approach and so adopted the results from
aperture photometry.
5.3. Complications Alter Strategy and Analysis
Figure 5.1 shows the ground-based light curve with a fit to a standard
(Paczynski 1986) model. The residuals are severe. The model does not include
parallax. However, models that include parallax are quite similar. In the period
before the peak, we were constantly refitting the light curve with every new night’s
data in order to be able to predict the time of peak and thus the time of the second
ToO observation (which was supposed to be symmetric around the peak with the
first). It became increasingly clear that the event was not standard microlensing,
and we began to consider alternate possibilities, including binary source (also called
“xallarap”), binary lens, and variable source. The last was especially alarming
because if the variability were irregular, it would be almost impossible to model at
the high precision required to carry out this experiment, particularly because the
source might vary differently at I and L. Our concern about xallarap led us to
obtain radial-velocity measurements at several epochs near the peak. These turned
192
out to be the same within less than 1 km s−1, which ruled out xallarap for all but
the most pathologically face-on orbits. But regardless of the nature of the anomaly,
it could potentially cause serious problems because much more and better data are
required to accurately model complicated light curves compared to simple ones.
Hence, as described in § 5.2, recognition of the anomaly caused us to significantly
intensify our ground-based observations. Moreover, it also caused us to think more
carefully about how we would extract parallax information from a more complicated
light curve, and this led us to recognize a complication that affects even light curves
that do not suffer from additional anomalies.
5.3.1. Need for Additional Spitzer Observation
Recall that πE,τ is derived from the different peak time t0 as seen from Earth
and Spitzer: if the two Spitzer observations are timed so that the fluxes seen at
Earth are equal to each other (one on the rising and one on the falling wing of the
light curve), then the Spitzer fluxes will nevertheless be different, the first one being
higher if the event peaks at Spitzer before the Earth. However, the two Spitzer fluxes
may differ not only because Spitzer is displaced from the Earth along the direction
of lens-source relative motion, but also if it is displaced in the orthogonal direction
by different amounts at the two epochs. Gould (1999) recognized this possibility,
but argued that the amplitude of this displacement could be determined from the
measurement of πE,‖, which is derived from the ground-based parallax measurement
193
(i.e., from the asymmetry of the light curve). Hence, he argued that it would be
possible to correct for this additional offset and still obtain a good measurement of
πE,τ . Unfortunately, while it is true that the amplitude can be so derived, the sign of
this correction is more difficult to determine.
The problem can be understood by considering the work of Smith et al. (2003),
who showed that ground-based microlensing parallaxes are subject to a two-fold
degeneracy, essentially whether the source passes the lens on the same or opposite
side of the Earth compared to the Sun. Within the geocentric formalism of Gould
(2004), this amounts to switching the sign of the impact parameter u0 (which
by convention is normally positive) and leaving all other parameters essentially
unchanged. Smith et al. (2003) derived this degeneracy under the assumption that
the Earth accelerates uniformly during the course of the event. This is a reasonable
approximation for short events, but is grossly incorrect for OGLE-2005-SMC-001,
with its timescale of tE ∼ 0.5 yr. Nevertheless, this degeneracy can hold remarkably
well even for relatively long events, particularly for |u0| ≪ 1. The sign of the
correction to πE,τ depends essentially on whether the absolute value of the impact
parameter as seen from Spitzer is higher or lower than that seen from Earth. While
the algebraic displacement of Spitzer along this direction can be predicted from the
ground-based measurement of the parallax asymmetry (just as Gould 1999 argued),
its effect on the absolute value of u0 depends on whether u0 is positive or negative.
194
If the parallax is sufficiently large, then the ground-based light curve alone can
determine the sign of u0, and if it is sufficiently small, the difference between the two
solutions is also very small and may not be significant. However, for intermediate
values of the parallax, this degeneracy can be important. To understand how
an additional Spitzer observation can help, consider an idealized set of four
observations, one at peak, one at baseline, and two symmetrically timed around
peak (as seen from Earth). Call these fluxes fP , fb, f−, and f+. Consider now
the ratio [fP − fb]/[(f− + f+)/2 − fb]. If the impact parameter seen from Spitzer
is higher than that from Earth, this ratio will be lower for the Spitzer data than
for the ground-based data. (Note that blended light, which may be different for
the two sets of observations, cancels out of this expression.) In practice, we found
from simulations that it was not necessary to have the three observations timed
so perfectly. Hence, it was possible to plan both the additional DDT observation,
as well as the second ToO observation to occur during regularly scheduled IRAC
campaigns, so there was no 6.5 hour penalty for either observation. Hence, the net
cost to Spitzer time was less than would have been the case for a single precisely
timed ToO observation.
In brief, the above considerations demonstrate that the Gould (1999) technique
requires a total of four observations, not three as originally proposed. Moreover,
these observations do not have to be so precisely timed as Gould (1999) originally
imagined. See Figure 5.2 for a visual explanation of these arguments.
195
5.3.2. Eight-fold Way
Ultimately, we found that the anomaly was caused by a binary lens. Binary
lenses are subject to their own discrete degeneracies. This means that analysis of
the event is impacted by two distinct classes of discrete degeneracies: those due to
parallax and thoese due to binarity. The discrete parallax degeneracy, as summarized
in § 5.3.1, takes the impact parameter u0 → −u0 and (because u0 ≪ 1) leaves other
parameters changed by very little (see Gould 2004).
The discrete binary degeneracy is between wide and close binaries. Here “wide”
means b ≫ 1 and “close” means b ≪ 1, where b is the angular separation between the
two binary components in units of θE, (for which we give and justify our convention
in § 5.4.1). It gives (bc, qc) → (bw, qw),
bw =1 + qc
1 − qc
b−1c , qw =
qc
(1 − qc)2(5.3)
and leaves other parameters roughly unchanged. Here q is the mass ratio of the lens,
with the convention that for qw, the component closer to the source trajectory goes
in the denominator of the ratio. In both cases, the central magnification pattern is
dominated by a 4-cusp caustic.
This degeneracy was first discovered empirically by Albrow et al. (1999b) and
theoretically by Dominik (1999b) and can be incredibly severe despite the fact that
196
the two caustics are far from identical: the solutions can remain indistinguishable
even when there are two well-observed caustic crossings (An 2005).
In the present case, the deviations from a simple lens are not caused by caustic
crossings, but rather by a close approach to a cusp, which makes this degeneracy
even more severe. In fact, the caustic is symmetric enough that the approach may
almost equally well be to either of two adjacent cusps. That is, the cusp degeneracy
would be “perfect” if the cusp were four-fold symmetric, and it is only the deviation
from this symmetry that leads to distinct solutions for different cusp approaches.
In brief, the lens geometry is subject to an 8-fold discrete degeneracy, 2-fold for
parallax, 2-fold for wide/close binary, and 2-fold for different cusp approaches.
5.4. Binary Orbital Motion
Of course all binaries are in Kepler orbits, but it is usually possible to ignore
this motion in binary-lens analyses. Stated less positively, it is rarely possible to
constrain any binary orbital parameters from microlensing light curves. In the few
known exceptions, (Albrow et al. 2000a; An et al. 2002), the light curve contained
several well-measured caustic crossings that pinned down key times in the trajectory
to O(10−5) of an Einstein crossing time. Hence, we did not expect to measure binary
rotation in the present case in which there are no such crossings.
197
We were nevertheless led to investigate rotation by the following circumstance.
When we initially analyzed the event using only ground-based data, we found that
the best fit (for all 8 discrete solutions) had negative blended light, roughly −10%
of the source light, but with large errors and thus consistent with zero at the 1.5 σ
level. This was not unexpected. As mentioned in § 5.1, the component of the
microlens parallax perpendicular to the Sun, πE,⊥, is generally poorly constrained
by ground-based data alone. The reason for this is that small changes in πE,⊥, the
Einstein timescale tE, the impact parameter u0, the source flux fs, and the blended
flux, fb, all induce distortions in the light curve that are symmetric about the
peak, and hence all these parameters are correlated. Thus, the large errors (and
consequent possible negative values) of fb are just the obverse of the large errors in
πE,⊥. Indeed, this is the reason for adding in Spitzer observations.
However, we found that when the Spitzer observations were added, the blending
errors were indeed reduced, but the actual value of the blending remained highly
negative, near −10%. This prompted us to look for other physical effects that could
induce distortions in the light curve that might masquerade as negative blending.
First among these was binary orbital motion. Before discussing this motion, we first
review microlensing by static binaries.
198
5.4.1. Static Binary Lens Parameters
Point-lens microlensing is described by three geometric parameters, the impact
parameter u0 (smallest lens-source angular separation in units of θE), the time
at which the separation reaches this minimum t0, and the Einstein crossing time
tE = θE/µ, where µ is the lens-source relative proper motion.
In binary lensing, there are three additional parameters, the projected
separation of the components (in units of θE) b, the mass ratio of the components q,
and the angle of the source-lens trajectory relative to the binary axis, φ. Moreover
the first three parameters now require more precise definition because there is no
longer a natural center to the system. One must therefore specify where the center
of the system is. Then u0 becomes the closest approach to this center and t0 the
time of this closest approach. Finally, tE is usually taken to be the time required to
cross the Einstein radius defined by the combined mass of the two components.
In fact, while computer programs generally adopt some fairly arbitrary point
(such as the midpoint between the binaries or the binary center of mass), the
symmetries of individual events can make other choices much more convenient. That
is certainly the case here. Moreover, symmetry considerations that are outlined
below will also lead us to adopt a somewhat non-standard tE for the wide-binary
case, namely the timescale associated with the mass that is closer to the source
trajectory, rather than the total mass. To be consistent with this choice, we also
199
express b as the separation between wide components in units of the Einstein radius
associated with the nearest mass (rather than the total mass).
As is clear from Figure 5.1, the light curve is only a slightly perturbed version
of standard (point-lens) microlensing, which means that it is generated either by the
source passing just outside the central caustic of a close binary (that surrounds both
components) or just outside one of the two caustics of a wide binary (each associated
with one component). In either case, the standard point-lens parameters u0 and t0
will be most closely reproduced if the lens center is placed at the so-called “center
of magnification”. For close binaries this lies at the binary center of mass. For wide
binaries, it lies q(1 + q)−1/2b−1 from the component that is closer to the trajectory.
Hence, it is separated by approximately bq/(1 + q) from the center of mass. This will
be important in deriving equation (5.4), below.
For light curves passing close to the diamond caustic of a close binary, the
standard Einstein timescale (corresponding to the total mass) will be very close to
the timescale derived from the best-fit point lens of the same total mass. However,
for wide binaries, the standard Einstein timescale is longer by a factor (1 + q)1/2,
where q is the ratio of companion (whether heavier or lighter) to the component that
is approached most closely. This is because the magnification is basically due just
to this latter component (with the companion contributing only minor deviations
via its shear), while the usual Einstein radius is based on the total mass. For wide
200
binaries, we therefore adopt an Einstein radius and Einstein timescale reduced by
this same factor.
The advantage of adopting these parameter definitions is that, being fairly well
fixed by the empirical light curve, they are only weakly correlated with various other
parameters, some of which are relatively poorly determined.
5.4.2. Binary Orbital Parameters
While close and wide binaries can be (and in the present case are) almost
perfectly degenerate in the static case, binary orbital motion has a radically different
effect on their respective light curves. Note first that while 7 parameters would be
required to fully describe the binary orbital motion, even in the best of cases it has
not proven possible to constrain more than 4 of them (Albrow et al. 2000a; An et al.
2002). Two of these have already been mentioned, i.e., b and q, from the static case.
For the two binaries for which additional parameters have been measured, these have
been taken to be a uniform rotation rate ω and a uniform binary-expansion rate b.
This choice is appropriate for close binaries because for these, the center of
mass is the same as the center of magnification. Hence the primary effects of binary
motion are rotation of the magnification pattern around the center of magnification
and the change of the magnification pattern due to changing separation. Both
201
of these changes may be (probably are) nonuniform, but as the light curves are
insensitive to such subtleties, the simplest approximation is uniform motion.
However, the situation is substantially more complicated for wide binaries,
consideration of which leads to a different parameterization. Recall that the
wide-binary center of magnification is not at the center of mass, and indeed is close
to one of the components. Hence, as the binary rotates, the center of magnification
rotates basically with that component. Nominally, the biggest effect of this rotation
is the resulting roughly linear motion of the lens center of magnification relative to
the source. However, the linear component of this motion, i.e., the first derivative of
the motion at the peak of the event, is already subsumed in the source-lens relative
motion in the static-binary fit. The first new piece of information about the binary
orbital motion is the second derivative of this motion, i.e., the acceleration. Note
that the direction of this acceleration is known: it is along the binary axis. Moreover,
for wide binaries, Kepler’s Third Law predicts that the periods will typically be
much longer than the Einstein timescale, so to a reasonable approximation, this
direction remains constant during the event. We designate the acceleration (in
Einstein radii per unit time squared) as αb.
The parameter αb is related to the distance to the lens in a relatively
straightforward way. For simplicity, assume for the moment that the center of
magnification is right at the position of the component that is closer to the source
trajectory (instead of just near it). The 3-dimensional acceleration of the component
202
is a = GM2/(brE csc i)2, where M2 is the mass of the companion to the closer
component, rE is the physical Einstein radius, and i is the angle between the binary
axis and the line of sight. Hence, αb = a sin i/rE = (q/b2)GM1 sin3 i/r3E, where M1
is the mass of the closer component. This can be simplified with the aid of the
following three identities: 1) 4GM1/c2 = rEθE, 2) rE = DLθE, 3) rE/rE = DLS/DS.
Here DL and DS are the distances to the lens and source, and DLS = DS − DL. We
then find,
αb
πE
=γc2 sin3 i
4DLAU
D2S
D2LS
, (5.4)
where γ ≡ q/b2 is the shear. Note that the shear determines the size of the caustic
and so is one of the parameters that is most robustly determined from the light curve.
If we were to take account of the offset between M1 and the center of magnification,
the r.h.s. of equation (5.4) would change fractionally by the order of b−2.
5.4.3. Summary of Parameters
Thus, the model requires a total of 10 geometrical parameters in addition
to the 8 flux parameters (fs and fb for each of the 3 ground-based observatories
plus Spitzer). These are the three standard microlensing parameters, t0, u0, tE
(the time of closest approach, separation at closest approach in units of θE, and
Einstein timescale), the three additional static-binary parameters b, q, φ (the binary
separation in units of θE, the binary mass ratio, and the angle of the source trajectory
203
relative to the binary axis), the two binary-orbit parameters, b and either ω (close)
or αb (wide), and the two parallax parameters, πE = (πE,N , πE,E), where N and E
represent the North and East directions. These must be specified for 8 different
classes of solutions.
5.5. Search for Solutions
We combine two techniques to identify all viable models of the observed
microlensing light curve: stepping through parameter space on a grid (grid-search)
and Markov Chain Monte Carlo (MCMC) (Doran & Mueller 2003).
We begin with the simplest class of binary models, i.e., without parallax or
rotation. Hence, there are six geometric parameters, t0, u0, tE, b, q, and φ. We
consider classes of models with (b, q, φ) held fixed, and vary (t0, u0, tE) to minimize
χ2. (Note that for each trial model, fs and fb can be determined algebraically from
a linear fit, so their evaluations are trivial.) This approach identifies four solutions,
i.e. (2 cusp-approaches)×(wide/close degeneracy). We then introduce parallax, and
so step over models with (b, q, φ, πE,N , πE,E) held fixed, working in the neighborhood
of the (b, q, φ) minima found previously. The introduction of parallax brings with it
the ±u0 degeneracy, and so there are now 8 classes of solutions.
Next we introduce rotation. We begin by employing grid search and find,
somewhat surprisingly, that several of the eight (close/wide, ±u0, on/off-axis cusp)
204
classes of solutions have more than one minimum in (ω, b) for close binaries or (αb, b)
for wide binaries. We then use each of these solutions as seeds for MCMC and find
several additional minima that were too close to other minima to show up in the
grid search. Altogether there are 20 separate minima, 12 for close binaries and 8 for
wide binaries.
We use MCMC to localize our solutions accurate to about 1σ and to determine
the covariance matrix of the parameters. In MCMC, one moves randomly from one
point in parameter space to another. If the χ2 is lower, the new point is added to
the “chain”. If not, one draws a random number and adds the new point only if
this number is lower than relative probability (exp(−∆χ2/2)). If the parameters
are highly correlated (as they are in microlensing) and the random trial points are
chosen without reference (or without proper reference) to these covariances, then the
overwhelming majority of trial points are rejected. We therefore sample parameter
space based on the covariance matrix drawn from the previous “links” in the chain.
During the initial ”burning in” stage of the MCMC, we frequently evaluate the
covariance matrix (every 100 “links”) until it stabilizes. Then we hold the covariance
matrix fixed in the simulation (Doran & Mueller 2003). From the standpoint of
finding the best χ2, one can combine linear fits for the flux parameters fs and fb with
MCMC for the remaining parameters. However, since part of our MCMC objective
is to find the covariances, we treat (fs, fb)OGLE as MCMC parameters while fitting
for the remaining four flux parameters analytically.
205
In order to reduce the correlations among search parameters, we introduce
the following parameter combinations into the search: teff ≡ u0tE, fbase = fs + fb,
fmax ≡ fs/u0, and γ ≡ q/b2 (wide) or Q ≡ b2q/(1 + q)2 (close). Because these are
directly related to features in the light curve, they are less prone to variation than
the naive model parameters. The effective timescale teff is 1/√
12 of the full-width
at half-maximum, fbase is just the flux at baseline, and fmax is the flux at maximum.
The scale of the Chang-Refsdal (1979, 1984) distortion (which governs the binary
perturbation) is given by the shear γ for wide binaries and by the quadrupole Q for
close binaries.
The MCMC “chain” automatically samples points in the neighborhood of the
minimum with probability density proportional to their likelihoods, exp(−χ2/2).
Somewhat paradoxically, this means that for higher-dimensional problems, it does
not actually get very close to the minimum. Specifically, for a chain of length
N sampling an m-dimensional space, there will be only one point for which ∆χ2
(relative to the minimum) obeys ∆χ2 < Y , where P [χ2(m dof) < Y ] = N−1. Hence,
for m ≫ 1 one requires a chain of length N ∼ e1/2(m/2)!2m/2 to reach 1σ above
the true minimum, or N ∼ 104.9 for m = 12. Further improvements scale only
as ∼ N1/m. Hence, to find the true minimum, we construct chains in which the
rejection criterion is calculated based on exp(−25χ2/2) rather than exp(−χ2/2).
However, when calculating error bars and covariances, or when integrating over the
MCMC, we use the exp(−χ2/2) chain.
206
5.5.1. Convergence
A general problem in MCMC fitting is to determine how well the solution
“converges”, that is, how precisely the “best fit” solution is reproduced when the
initial seed solution is changed. In our case, the problem is the opposite: MCMC
clearly does not converge to a single minimum, and the challenge is to find all the
local minima. We described our procedure for meeting this challenge above, first
by identifying 8 distinct regions of parameter space semi-analytically, and then
exploring these with different MCMC seeds. This procedure led to well-defined
minima (albeit a plethora of them), whose individual structures were examined by
putting boundaries into the MCMC code that prevented the chain from “drifting”
into other minima. We halted our subdivision of parameter space when the structure
on the χ2 surface fell to of order ∆χ2 ∼ 1, regarding the ∆χ2 ∼< 1 region as the zone
of convergence. As mentioned above, we located the final minimum by artificially
decreasing the errors by a factor of 5, again making certain that the resulting
(exaggerated) χ2 surface was well behaved.
5.6. Solution Triage
Table 1 gives parameter values and errors for a total of 20 different discrete
solutions, which are labeled by (C/W) for close/wide binary, (+/−) for the sign
of u0, (‖/⊥) for solutions that are approximately parallel or perpendicular to the
207
binary axis, and then by alphabetical sequential for different viable combinations of
rotation parameters. In Table 1, we allow a free fit to blending.
Note that some solutions have severe negative blending. While it is possible in
principle that these are due to systematic errors, the fact that other solutions have
near-zero blending and low χ2 implies that the negative-blending solutions probably
have lens geometries that do not correspond to the actual lens. Other solutions have
relatively high χ2 and so are also unlikely. Solutions with χ2 that is less than 9 above
the minimum have their χ2 displayed in boldface, while the remaining solutions are
shown in normal type.
There are several reasons to believe that the blending is close to zero. First, the
source appears isolated on our K-band NICMOS HST images, implying that if it is
blended, this blended light must be within of order 100 mas of the source. As the
density of sources in the HST images is low, this is a priori very unlikely unless the
blended light comes from a companion to the source or the lens. Moreover, all the
near neighbors of the source on the HST image are separately resolved by the OGLE
photometry, so if there is blended light in the OGLE photometry then it must also
be blended in the HST images. Second, the V − I color of the source, which can be
derived by a model-independent regression of V flux on I flux, is identical within
measurement error to the color of the baseline light from the combined source and
(possible) blend. This implies that either 1) the source is unblended, 2) it is blended
by another star of nearly the same color as the source, or 3) it is blended by a star
208
that is so faint that it hardly contributes to the color of the blend. The second
possibility is strongly circumscribed by the following argument. The source is a
clump star. On the SMC CMD, there are first ascent giants of color similar to the
source from the clump itself down to the subgiant branch about 2.5 mag below the
clump. Thus, in principle, the blended light could lie in the range 0.1 ∼< fb/fs ∼< 1
without causing the baseline color to deviate from the source color. However, there
are no solutions in Table 1 with blending this high. There is one low-χ2 solution
with fb/fs ∼ 0.06, but this would be 3 mag below the clump and so below the
subgiant branch. If we restrict the blend to turnoff colors, i.e., V − I = 0.6, then
the color constraint implies fb/fs = 0.01 ± 0.01, which is negligibly small from our
perspective. Thus, while it is possible in principle that the source is blended with
a reddish subgiant, the low stellar density in the HST image, the low frequency of
such subgiants on the CMD, and the difficulty of matching color constraints even
with such a star, combine to make this a very unlikely possibility. We therefore
conduct the primary analysis assuming zero blending, as listed in Table 2. Again,
the ∆χ2 < 9 solutions are marked in boldface. Note that most solutions in Table
1 are reasonably consistent with zero blending, which is the expected behavior for
the true solution provided it is not corrupted by systematic errors. For these, the χ2
changes only modestly from Table 1 to Table 2. However, several solutions simply
disappear from Table 2. This is because, in some cases, forcing the blending to zero
has the effect of merging two previously distinct binary-rotation minima.
209
The main parameters of interests are πE and the closely related quantity v.
However, for reasons that will be explained below, v can be reliably calculated only
for the close solutions, but not the wide solutions. Hence, we focus first on πE.
Figure 5.3 shows error ellipses for all solutions, color-coded by according to ∆χ2
relative to the global minimum. The right-hand panels show the solutions presented
in Tables 1 and 2, which include the Spitzer data. The left-hand panels exclude these
data. The upper panels are based on a free fit for blending whereas the lower panels
are constrained to zero blending for the OGLE dataset. Comparing the two upper
panels, it is clear that when blending is a fitted parameter, the Spitzer data reduce
the errors in the πE,⊥ direction by about a factor of 3. However, once the blending is
fixed (lower panels), the Spitzer data have only a modest additional effect. This is
expected since πE,⊥ is correlated with blending and one can simultaneously constrain
both parameters either by constraining πE,⊥ with Spitzer data or just by fixing the
blending by hand. Figure 5.4 shows the best overall zero-blending fit to the data.
5.6.1. Wide-Binary Solutions
All eight wide solutions are effectively excluded. When a free fit to blending is
allowed, their χ2 values are already significantly above the minimum. When zero
blending is imposed, only five independent solutions survive and those that have
negative blending are driven still higher. In all five cases to ∆χ2 > 16.
210
5.6.2. Close-Binary Solutions
Eight Of the 11 close solutions survive the imposition of zero blending. Of
these, only one has ∆χ2 < 4, and only another two have ∆χ2 < 8.7 relative to
the best zero-blending solution. We focus primarily on these three, which all have
best-fit parallaxes in the range 0.030 < πE < 0.047 and projected velocities in the
range 210 km s−1 < v < 330 km s−1. These projected velocities are of the order
expected for halo lenses but are about 1 order of magnitude smaller than those
expected for SMC self lensing. Note that because there are multiple solutions, the
errors in v are highly non-Gaussian and are best judged directly from Figures 5.5
and 5.6 (below) rather than quoting a formal error bar.
5.7. Lens Location
When Alcock et al. (1995) made the first measurement of microlensing parallax,
they developed a purely kinematic method of estimating the lens distance (and thus
mass) based on comparison of the measured value of v with the expected kinematic
properties of the underlying lens population. Starting from this same approach,
Assef et al. (2006) devised a test that uses the microlens parallax measurement to
assign relative probabilities to different lens populations (e.g., SMC, Galactic halo,
Galactic disk) based solely on the kinematic characteristics of these populations, and
without making prior assumptions about either the mass function or the density
211
normalization of any population. This is especially useful because, while a plausible
guess can be given for the mass function and normalization of SMC lenses, nothing is
securely known about a putative Galactic halo population. In the present case, the
high projected velocity v immediately rules out Galactic-disk lenses, so we restrict
consideration to the other two possibilities.
We begin by recapitulating the Assef et al. (2006) test in somewhat more
general form. The differential rate of microlensing events of fixed mass M (per
steradian) is
d(6)Γ(M) ≡ d6Γ(M)
d2vL d2vSdDLdDS
= fL(vL)fS(vS)D2SνS(DS)νL(DL)2vrE
D2LS
D2S
, (5.5)
where fL(vL) and fS(vS) are the two-dimensional normalized velocity distributions
of the lenses and sources, νL and νS are the density distributions of the lenses and
sources, v is an implicit function of (DL, DS,vL,vS), and rE is an implicit function
of (DL, DS,M). The method is simply to evaluate the likelihood
L =
∫
d2vL d2vSdDLdDS d(6)Γ(M) exp[−∆χ2(v)/2]∫
d2vL d2vSdDLdDS d(6)Γ(M)(5.6)
for each population separately, and then take the ratio of likelihoods for the two
populations: Lratio ≡ Lhalo/LSMC. Here, ∆χ2(v) is the difference of χ2 relative to
the global minimum that is derived from the microlensing light curve. Note that all
212
dependence on M disappears from L. Equation (5.6) can be simplified in different
ways for each population. In both cases, we express the result in terms of Λ,
Λ ≡ v
v2=
πEtEAU
, (5.7)
rather than v, because it is better behaved in the neighborhood of Λ = 0, just as
trigonometric parallax is better behaved near zero than its inverse, distance.
5.7.1. Halo Lenses
For halo lenses, the depth of the SMC is small compared to DLS and the internal
dispersion of SMC sources is small compared to the bulk motion of the SMC. Hence,
one can essentially drop the three integrations over SMC sources, implying
v =DS
DLS
vL − v⊕ − DL
DLS
vSMC, (5.8)
where vL, v⊕, and vSMC are the velocities of the lens, the “geocentric frame”, and the
SMC (all in the Galactic frame) projected on the plane of the sky. The “geocentric
frame” is the frame of the Earth at the time of the peak of the event. It is the most
convenient frame for analyzing microlensing parallax (Gould 2004) and for this event
is offset from the heliocentric frame by
(v⊙,N, v⊙,E) − (v⊕,N, v⊕,E) = (−24.9,−15.5) km s−1. (5.9)
213
We assume an isotropic Gaussian velocity dispersion for the lenses with
σhalo = vrot/√
2, where vrot = 220 km s−1. After some manipulations (and dropping
constants that would cancel out between the numerator and denominator) we obtain
Lhalo =
∫
exp[−∆χ2(Λ)/2]ghalo(Λ, DL)dDLdΛNorthdΛEast∫
ghalo(Λ, DL)dDLdΛNorthdΛEast
, (5.10)
where
ghalo(Λ, DL) = exp(−v2L/2σ2
halo)νhalo(DL)Λ−5D7/2LS D
1/2L , (5.11)
and where vL is an implicit function of Λ through equations (5.7) and (5.8). We adopt
νhalo(r) = const/(a2halo + r2), where ahalo = 5 kpc and the Galactocentric distance is
R0 = 7.6 kpc. We adopt v⊙ = (10.1, 224, 6.7) km s−1 in Galactic coordinates, which
leads to a 2-dimensional projected velocity of (v⊙,N, v⊙,E) = (126,−126) km s−1
toward the SMC source. From our assumed distances DSMC = 60 kpc and the SMC’s
measured proper motion of (−1.17± 0.18, +1.16± 0.18) (Kallivayalil et al. 2006), we
obtain,
(vSMC,N, vSMC,E) − (v⊙,N, v⊙,E) = (−333, 330) km s−1. (5.12)
214
5.7.2. SMC Lenses
For the SMC, we begin by writing
v =vL/DL − vS/DS
D−1L − D−1
S
= vS +DS
DLS
∆v → vSMC − v⊕ +DSMC
DLS
∆v, (5.13)
where vL and vS are now measured in the geocentric frame and ∆v ≡ vL − vS.
The last step in equation (5.13) is an appropriate approximation because the SMC
velocity dispersion is small compared to its bulk velocity and DLS ≪ DS.
We now assume that the sources and lenses are drawn from the same population,
which implies that the dispersion of ∆v is larger than those of vL and vS by 21/2.
We assume that this is isotropic with Gaussian dispersion σSMC. Again making
the approximation DS → DSMC, we can factor the integrals in equation (5.6) by
evaluating the density integral
η(DLS) =∫
dDLν(DL)ν(DL + DLS), (5.14)
where ν = νL = νS. We then obtain
LSMC =
∫
exp[−∆χ2(Λ)/2]gSMC(Λ, DLS)dDLSdΛNorthdΛEast∫
gSMC(Λ, DLS)dDLSdΛNorthdΛEast
, (5.15)
where
gSMC(Λ, DL) = exp[−(∆v)2/4σ2SMC]η(DLS)D
7/2LS Λ−5, (5.16)
215
and where ∆v is an implicit function of Λ through equations (5.7) and (5.13).
5.7.3. SMC Structure
In order to evaluate equation (5.16) one must estimate σSMC as well as the
SMC density ν along the line of sight, which is required to compute η(DLS). This
requires an investigation of the structure of the SMC.
In sharp contrast to its classic “Magellanic irregular” appearance in blue light,
the SMC is essentially a dwarf elliptical galaxy whose old population is quite regular
in both its density (Zaritsky et al. 2000) and velocity (Harris & Zaritsky 2006)
distributions. Harris & Zaritsky (2006) find that after removing an overall gradient
(more below), the observed radial velocity distribution is well fit by a Gaussian with
σ = 27.5 km s−1, which does not vary significantly over their 4 × 2 (RA,Dec) field.
We adopt the following SMC parameters for the 1-dimensional dispersion
σSMC, the tidal radius rt, and the density profile (along the line of sight),
ν(r) = const/(a2SMC + r2)n/2,
σSMC = 25.5 km s−1, rt = 6.8 kpc, aSMC = 1 kpc, n = 3.3, (5.17)
as we now justify.
The σ = 27.5 km s−1 dispersion reported by Harris & Zaritsky (2006) includes
measurement errors. When the reported errors (typically 10 km s−1 per star) are
216
included in the fit, this is reduced to σSMC = 24.5 km s−1. However, these reported
errors may well be too generous: the statistical errors (provided by D. Zaritsky 2006,
private communication) are typically only 2–3 km s−1, the reported errors being
augmented to account for systematic errors. If the statistical errors are used, we
find σSMC = 26.5 km s−1. D. Zaritsky (2006, private communication) advocates an
intermediate value for this purpose, which leads to σSMC = 25.5 km s−1.
The old stellar population in the SMC is rotating at most very slowly. Harris
& Zaritsky (2006) report a gradient across the SMC of 8.3 km s−1 deg−1, which
they note is a combination of the traverse velocity of the SMC and the solid-body
component of internal bulk motion. As the SMC proper motion was poorly
determined at the time, Harris & Zaritsky (2006) did not attempt to disentangle
these two. However, Kallivayalil et al. (2006) have now measured the SMC proper
motion to be (µN, µE) = (−1.16 ± 0.18, 1.17 ± 0.18) mas yr−1. We refit the Harris
& Zaritsky (2006) data and find ∇vr = (−10.5 ± 1.4, 5.0 ± 0.7) km s−1 deg−1.
Subtracting these two measurements (including errors and covariances) we obtain a
net internal rotation of 5.2 ± 1.6 km s−1 deg−1 with a position angle of 183 ± 30.
Since this rotation is due north-south (within errors), a direction for which the data
have a baseline of only ∼ ±1, and since solid-body rotation is unlikely to extend
much beyond the core, it appears that the amplitude of rotational motion is only
about 5 km s−1, which is very small compared to the dispersion, σSMC. Hence, we
ignore it. In addition, we note that this rotation is misaligned with the HI rotation
217
axis (Stanimirovic et al. 2004) by about 120, so its modest statistical significance
may indicate that it is not real.
As we describe below, our likelihood estimates are fairly sensitive to the tidal
radius rt of the SMC. Proper determination of the tidal radius is a complex problem.
Early studies, made before dark matter was commonly accepted, were carried out for
Kepler potentials and in analogy with stellar and solar-system problems (e.g. King
1962). Read et al. (2006) have calculated tidal radii for a range of potentials and also
for an orbital parameter α that ranges from −1 for retrograde to +1 for prograde.
We choose α = 0 as representative and evaluate their expression for an isothermal
potential and for the satellite being close to pericenter (as is appropriate for the
SMC):
rt
D=
σsat
σhost
(
1 +2 ln ξ
1 − ξ−2
)−1/2
. (5.18)
Here, ξ is the ratio of the apocenter to the pericenter of the satellite orbit, D is the
pericenter distance, and σsat and σhost are the respective halo velocity dispersions.
Because we adopt an n = 3.3 profile, the SMC halo velocity dispersion is larger than
its stellar dispersion by (3.3/2)1/2, implying that σsat/σhost = 3.31/2σSMC/vrot = 0.21.
We adopt ξ = 3 based on typical orbits found by Kallivayalil et al. (2006), which
yields rt = 0.107 DSMC = 6.8 kpc. Note, moreover, that at ξ = 3, d ln rt/d ln ξ ∼ 0.24,
so the tidal radius is not very sensitive to the assumed properties of the orbit.
218
The most critical input to the likelihood calculation is the stellar density along
the line of sight. Of course, images of the SMC give direct information only about
its surface density as a function of position. One important clue to how the two
are related comes from the HI velocity map of Stanimirovic et al. (2004), which
shows an inclined rotating disk with the receding side at a position angle of ∼ 60
(north through east). This is similar to the ∼ 48 position angle of the old-star
optical profile found by Harris & Zaritsky (2006) based on data from Zaritsky et
al. (2000). Hence, the spheroidal old-stellar population is closely aligned to the HI
disk, although (as argued above) the stars in the SMC are pressure- rather than
rotationally-supported. Our best clue to the line-of-sight profile is the major-axis
profile exterior to the position of the source, which lies about 1 kpc to the southwest
of the Galaxy center, roughly along the apparent major axis. The projected surface
density is falling roughly as r−2.3 over the ∼ 1 kpc beyond the source position, from
which we derive a deprojected exponent of n = 3.3. This is similar to the exponent
for Milky Way halo stars. While there is a clear core in the star counts, this may be
affected by crowding, and the core seems to have little impact on the counts beyond
the source position (which is what is relevant to the density profile along the line of
sight). Hence, we adopt aSMC = 1 kpc.
The likelihood ratio is most sensitive to the assumptions made about the
stellar density in the outskirts of the SMC, hence to the power law and tidal radius
adopted. This seems strange at first sight because the densities in these outlying
219
regions are certainly extremely small, whatever their exact values. This apparent
paradox can be understood as follows. For DLS ∼< rt, the leading term in η is
η(DLS) ∼ [D2LS + 4(b2 + a2
SMC)]−n/2, where b = 1.0 kpc is the impact parameter.
In the outskirts, this implies η(DLS) ∼ D−nLS . The integrand in the numerator of
equation (5.15) then scales as exp−[∆v(Λ, DLS)]2/4σ2
SMCD7/2−nLS . For fixed Λ, ∆v
is a rising function of DLS, and so for sufficiently large DLS, the exponential will
eventually cut off the integral. However, for the measured value of Λ (corresponding
to ∆v ≡ |Λ/Λ2 − (vSMC − v⊙)| ∼ 300 km s−1, the cutoff does not occur until
DLS ∼ (2σSMC/v)DSMC ∼ 10 kpc. Thus, as long as the density exponent remains
n ∼< 3.5 and as long as the density is not actually cut off by rt, the integral keeps
growing despite the very low density. On the other hand, a parallel analysis shows
that the denominator in equation (5.15) is quite insensitive to assumptions about
the outer parts of the SMC.
5.7.4. Likelihood Ratios
Using our adopted SMC parameters (eq. [5.17]), we find Lratio ≡ Lhalo/LSMC =
27.4. As discussed in § 5.7.3, this result is most sensitive to the outer SMC density
profile, set by the exponent n and the cutoff rt. If the tidal radius is increased from
rt = 6.8 kpc to rt = 10 kpc and other parameters are held fixed, Lratio = 30.3. If the
exponent is reduced from n = 3.3 to n = 2.7, then Lratio = 29.1. Hence, halo lensing
220
is strongly favored in any case, but by an amount that would vary noticeably if any
of our key model parameters were markedly off.
In carrying out these evaluations, we integrated equations (5.10) and (5.15) over
all solutions with ∆χ2 < 9. These solutions are grouped around three close-binary
minima shown in Table 2. We carried out the integration in two different ways:
over the discrete ensemble of solutions found by the Markov chains and uniformly
over the three error ellipses that were fit to these chains. The results do not differ
significantly. Figures 5.5 and 5.6 show Λ for the Markov-chain solutions superposed
on likelihood contours for SMC and halo lenses respectively.
5.7.5. Kepler Constraints for Close Binaries
Binaries move in Kepler orbits. In principle, if we could measure all the Kepler
parameters then these, together with the measured microlens parallax πE (and
the approximately known source distance DS), would fix the mass and distance to
the lens. While the two orbital parameters that we measure are not sufficient to
determine the lens mass and distance, they do permit us to put constraints on these
quantities.
Consider first the special case of a face-on binary in a circular orbit. Kepler’s
Third Law implies that GM/(brE)3 = ω2 where M is the mass of the lens, b is the
221
binary separation in units of the local Einstein radius rE, and ω is the measured
rotation parameter. Since rE = (DLS/DS)rE, this implies,
c2
4rEb3
D2S
DLD2LS
= ω2 (face − on circular) (5.19)
If one considers other orientations but remains restricted to face-on orbits, then
this equation becomes a (“greater than”) inequality because at fixed projected
separation, the apparent angular speed can only decrease.
Further relaxing to the case of non-circular orbits but with b/b = 0 (i.e., the
event takes place at pericenter), the rhs of equation (5.19) is halved, ω2 → ω2/2,
because escape speed (appropriate for near-parabolic orbits) is√
2 larger than
circular speed. Finally, after some algebra, and again working in the parabolic limit,
one finds that including non-zero radial motion leads to ω2/2 → [ω2 + (b/b)2/4]/2,
and thus to
DLD2LS
D2S
≤ c2
2rEb3
[
ω2 +(b/b)2
4
]−1
. (5.20)
If the rhs of this equation is sufficiently small, then only lenses that are near
the Sun (DL ≪ DS) or near the SMC (DLS ≪ DS) will satisfy it. In practice, we
find that this places no constraint on SMC lenses, but does restrict halo lenses to
be relatively close to the Sun, with the limit varying from ∼ 2 kpc to ∼ 10 kpc
depending on the particular solution being probed and the MCMC realization of
222
that solution. Integrating over the entire Markov chain, we find that Lratio is reduced
by a factor three, from 27.4 to 11.2.
5.7.6. Kepler Constraints for Wide Binaries
As discussed in § 5.6.1, all wide solutions are ruled out by their high χ2.
Nevertheless, for completeness it is instructive to ask what sort of constraints could
be put on the lens from measurement of the acceleration parameter αb if these
solutions had been accepted as viable. We rewrite equation (5.4) in terms of the
parameter combination T ≡ (πEγ/αb)1/2, which has units of time,
√
DL
60 kpc
DLS
DS
=cT
2
√
√
√
√
sin3 i
60 kpc AU< 0.28
T
yr. (5.21)
The five wide-binary zero-blending solutions listed in Table 2 have, respectively,
T = (0.039, 0.032, 0.035, 0.017, 0.015) yr. Thus, if these solutions had been viable,
the lens would have been firmly located in the SMC (or else, improbably, within 6
pc of the Sun). This demonstrates the power of this constraint for wide binaries,
which derives from light-curve features that arise from the motion of the center of
magnification relative to the center of mass. These obviously do not apply to close
binaries for which the center of magnification is identical to the center of mass.
However, this same relative motion makes it more difficult to constrain the
nature of the lens from measurements of the projected velocity, v. This is because
223
the parallax measurement directly yields only the projected velocity of the center of
magnification, whereas models of the lens populations constrain the motion of the
center of mass. If b is measured, it is straight forward to determine the component
of the difference between these two that is parallel to the binary axis, namely
∆v‖ =b
1 + q
AU
πE
. (5.22)
On the other hand, the measurement of αb indicates that there is motion in the
transverse direction, but specifies neither the amplitude nor sign. For example, for
circular motion with the line of nodes perpendicular to the binary axis,
∆v⊥ = ±√
αbb
1 + q
AU
πE
csc i, (5.23)
and hence,
∆v⊥v
= ±√
αbb
1 + qtE csc i, (5.24)
If this ratio (modulo the csc i term) is small, then the internal motion can be ignored
(assuming the inclination is not unluckily low). In the present case, the quantity
[αbb/(1 + q)]1/2tE for the five respective wide-binary solutions shown in Table 2 is
(0.23,0.36,0.25,1.04,0.77), implying that constraints arising from measurement of v
would be significantly weakened.
Again, however, since the wide solutions are in fact ruled out, all of these
results are of interest only for purposes of illustration.
224
5.7.7. Constraints From (Lack of) Finite-Source Effects
Finite source effects are parameterized by ρ ≡ θ∗/θE, where θ∗ is the angular
size of the source. Equivalently, ρ = (DL/DLS)(r∗/rE), where r∗ = DSθ∗ is the
physical source radius. Since the source is a clump giant, with r∗ ∼ 0.05 AU, this
implies
ρ = 0.021DL
60 kpc
5 kpc
DLS
πE
0.035(5.25)
All models described above assume ρ = 0. Since the impact parameter is u0 ∼ 0.08
and the semi-diameter of the caustic is 2Q ∼ 0.02, finite source effects should be
pronounced for ρ ∼> u0 − 2Q ∼ 0.06. In fact, we find that zero-blending models with
ρ ∼< 0.05 are contraindicated by ∆χ2 ∼ (ρ/0.0196)4. This further militates against
SMC lenses, which predict acceptable values of ρ only for relatively large source-lens
separations. If we penalize solutions with finite source effects by ∆χ2 = (ρ/0.0196)2,
we find that Lratio rises from 11.2 to 20.3.
5.8. Discussion
By combining ground-based and Spitzer data, we have measured the
microlensing parallax accurate to 0.003 units, by far the best parallax measurement
yet for an event seen toward the Magellanic Clouds. Our analysis significantly favors
225
a halo location for the lens over SMC self lensing. It excludes altogether lensing by
Galactic disk stars. Of course, with only one event analyzed using this technique, an
SMC location cannot be absolutely excluded based on the ∼ 5% probability that we
have derived. The technique must be applied to more events before firm conclusions
can be drawn. Spitzer itself could be applied to this task. Even better would be
observations by the Space Interferometry Mission (SIM) (Gould & Salim 1999), for
which time is already allocated for 5 Magellanic Cloud events. SIM would measure
both πE and θE and so determine (rather than statistically constrain) the position
of the lens.
Assuming that the lens is in the halo, what are its likely properties? The mass
is
M = 10.0 M⊙
(
πE
0.047
)−2 πrel
180 µas, (5.26)
where the fiducial πE is the best-fit value and the fiducial πrel is for a “typical”
halo lens, which (after taking account of the constraint developed in § 5.7.5) would
lie at about 5 kpc. In the best-fit model, the mass ratio is q = 2.77, which would
imply primary and secondary masses of 7.3 and 2.7 M⊙ respectively. The projected
separation would be b = 0.22 Einstein radii, i.e., 4.7 AU(DLS/DS). Note that at
these relatively close distances, main-sequence stars in this mass range would shine
far too brightly to be compatible with the strict constraints on blended light. Hence,
the lenses would have to be black holes.
226
We must emphasize that the test carried out here uses only kinematic attributes
of the lens populations and assumes no prior information about their mass functions.
Note, in particular, that if we were to adopt as a priori the lens mass distribution
inferred by the MACHO experiment (Alcock et al. 2000), then the MACHO
hypothesis would be strongly excluded. Recall from § 5.7.5 (eq. [5.20]), that the
lens is either very close to the SMC or within 10 kpc of the Sun. However, from
equation (5.26), for DL < 10 kpc, we have M ∼> 4.6 M⊙, and hence a primary mass
Mq/(1 + q) ∼> 3.4 M⊙. From Figures 12–14 of Alcock et al. (2000), such masses are
strongly excluded as the generators of the microlensing events they observed toward
the LMC.
Hence, if the MACHO hypothesis favored by this single event is correct, the
MACHO population must have substantially different characteristics from those
inferred by Alcock et al. (2000), in particular, as mentioned above, a mass scale of
order 10M⊙ or perhaps more. Alcock et al. (2001b) limit the halo fraction of such
objects to < 30% at M = 10 M⊙ and to < 100% at M = 30 M⊙. At higher masses,
the sensitivity of microlensing surveys deteriorates drastically. However, Yoo et al.
(2004c) derived important limits in this mass range from the distribution of wide
binaries in the stellar halo, putting an upper limit at 100% for M = 40 M⊙ and at
20% for M > 200 M⊙. Thus, if the MACHO hypothesis is ultimately confirmed,
this would be a new population in the mass “window” identified in Figure 7 of Yoo
et al. (2004c) between the limits set by microlensing and wide-binary surveys. This
227
again argues for the importance of obtaining space-based parallaxes on additional
microlensing events.
As we discussed in some detail in § 5.7.3 (particularly the last paragraph) our
conclusion regarding the relatively low probability of the lens being in the SMC
rests critically on the assumption that the SMC lens population falls off relatively
rapidly along the line of sight. (We adopted an n = 3.3 power law.) This is because
equation (5.16) scales ∝ D7/2−nLS . If the SMC had a halo lens population with a
much shallower falloff than we have assumed, this term could dominate the integral
in equation (5.15) even if the overall normalization of the halo were relatively low.
Hence, Magellanic Cloud halo lensing could provide an alternate explanation both
for this SMC event and the events seen by MACHO toward the LMC (Calchi
Novati et al. 2006). As mentioned above, SIM could easily distinguish between this
conjecture and the halo-lens hypothesis.
Finally, we remark that analysis of this event was extraordinarily difficult
because it was a “weak” (i.e., non-caustic-crossing) binary. If it had been either
a single-mass lens or a caustic-crossing binary, it would have been much easier to
analyze and the inferences regarding its location (in the halo or the SMC) would
have been much more transparent. We therefore look forward to applying this same
technique to more typical events.
228
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b Fb/Fbase
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
C-‖a 3593.751 −8.729 174.17 2.77 0.980 21.70 −0.0342 0.0319 0.073 5.60 0.003
1455.40 0.040 0.532 8.13 0.33 0.066 1.10 0.0076 0.0047 0.047 0.46 0.061
C+‖a 3593.687 7.763 190.03 2.15 0.850 19.34 −0.0120 0.0236 −0.141 5.83 0.115
1475.74 0.033 0.351 6.58 0.26 0.046 1.26 0.0039 0.0035 0.027 0.48 0.040
C-‖b 3593.560 −9.567 160.61 0.98 0.999 9.23 −0.0069 0.0298 0.331 0.40 −0.101
1462.89 0.036 0.509 6.69 0.25 0.063 1.89 0.0068 0.0049 0.032 0.61 0.060
C+‖b 3593.643 9.321 163.41 1.11 1.002 11.24 −0.0068 0.0199 −0.167 1.33 −0.071
1466.84 0.040 0.346 4.52 0.15 0.046 1.12 0.0036 0.0032 0.029 0.26 0.040
(cont’d)
Table 5.1. OGLE-2005-SMC-001 Light Curve Models: Free Blending.
229
Table 5.1—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b Fb/Fbase
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
C-‖c 3593.473 −9.020 171.06 0.29 0.837 −3.34 0.0070 0.0069 0.746 −3.87 −0.030
1463.52 0.042 0.352 5.89 0.04 0.041 1.31 0.0051 0.0039 0.069 0.44 0.043
C+‖c 3593.460 9.207 167.46 0.29 0.853 −3.78 0.0038 0.0029 −0.743 −3.92 −0.053
1465.15 0.042 0.276 4.15 0.03 0.035 1.24 0.0032 0.0031 0.070 0.41 0.033
C-‖d 3593.639 −9.831 157.29 0.90 1.052 8.53 −0.0121 0.0166 0.146 −0.04 −0.131
1464.13 0.058 0.480 6.18 0.15 0.062 1.18 0.0067 0.0062 0.096 0.27 0.057
C+‖d 3593.676 9.406 162.74 0.79 1.001 7.86 −0.0042 0.0135 −0.144 −0.12 −0.080
1467.70 0.056 0.333 4.42 0.14 0.043 1.47 0.0036 0.0043 0.048 0.50 0.040
(cont’d)
230
Table 5.1—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b Fb/Fbase
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
C+‖e 3593.655 9.161 165.60 1.10 0.983 11.36 −0.0055 0.0197 −0.162 1.50 −0.051
1474.32 0.047 0.498 7.24 0.20 0.060 1.42 0.0038 0.0045 0.038 0.74 0.058
C+‖f 3593.769 7.516 194.33 1.86 0.826 19.21 −0.0078 0.0170 −0.018 5.94 0.140
1474.40 0.038 0.372 7.10 0.33 0.048 1.46 0.0042 0.0038 0.041 0.72 0.042
C-⊥a 3593.251 −8.026 176.25 1.74 0.719 280.21 −0.0101 0.0309 0.758 1.10 0.061
1457.55 0.030 0.370 6.94 0.20 0.041 0.53 0.0054 0.0043 0.049 0.34 0.046
C+⊥a 3593.272 7.900 177.32 1.40 0.752 280.03 −0.0079 0.0198 −0.599 1.45 0.073
1470.74 0.035 0.298 5.33 0.25 0.052 0.58 0.0035 0.0038 0.084 0.32 0.037
(cont’d)
231
Table 5.1—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b Fb/Fbase
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
W-‖a 3593.708 −8.470 170.22 2.49 0.936 6.38 0.0073 0.0041 0.241 0.49 −0.004
1471.32 0.037 0.306 5.70 0.40 0.036 0.51 0.0056 0.0033 0.175 0.02 0.038
W+‖a 3593.703 8.568 167.72 2.48 0.950 6.37 0.0050 0.0017 0.273 0.49 −0.016
1471.29 0.035 0.273 4.70 0.40 0.034 0.49 0.0033 0.0029 0.174 0.02 0.033
W-⊥a 3593.562 −8.666 174.41 2.28 1.154 279.74 −0.0039 0.0180 0.841 −0.75 0.009
1474.79 0.067 0.361 6.31 1.03 0.048 0.61 0.0057 0.0054 0.385 0.06 0.043
W+⊥a 3593.641 9.281 165.12 7.39 1.158 279.74 −0.0036 0.0148 0.804 −0.23 −0.060
1480.79 0.062 0.293 4.04 5.71 0.048 0.60 0.0036 0.0042 0.339 0.18 0.034
(cont’d)
232
Table 5.1—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b Fb/Fbase
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
W-⊥b 3593.570 −9.209 166.04 3.21 1.207 279.24 −0.0074 0.0204 0.909 −0.16 −0.054
1475.50 0.062 0.385 5.83 1.92 0.053 0.58 0.0056 0.0051 0.360 0.18 0.046
W+⊥b 3593.659 8.897 171.08 5.28 1.125 280.32 −0.0023 0.0112 0.826 −0.77 −0.016
1480.16 0.068 0.255 3.89 3.53 0.042 0.61 0.0035 0.0046 0.305 0.06 0.030
W-⊥c 3593.492 −9.443 163.30 2.20 1.291 278.45 −0.0108 0.0195 1.094 0.49 −0.086
1474.90 0.046 0.329 5.24 0.51 0.052 0.48 0.0053 0.0050 0.244 0.05 0.040
W+⊥c 3593.534 9.492 162.68 2.96 1.267 278.50 −0.0050 0.0155 1.054 0.44 −0.091
1486.02 0.049 0.249 3.70 0.69 0.043 0.49 0.0035 0.0051 0.252 0.05 0.030
233
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
C-‖a 3593.751 −8.755 173.72 2.77 0.984 21.69 −0.0347 0.0316 0.075 5.59
1455.38 0.039 0.028 0.88 0.32 0.018 1.15 0.0027 0.0048 0.047 0.29
C+‖a 3593.648 8.728 171.75 1.42 0.940 14.45 −0.0063 0.0245 −0.191 3.21
1474.91 0.041 0.025 0.58 0.24 0.019 1.51 0.0037 0.0042 0.039 0.59
C-‖b 3593.612 −8.708 173.51 0.85 0.898 9.17 0.0026 0.0300 0.368 1.04
1463.48 0.028 0.020 0.48 0.18 0.017 1.68 0.0021 0.0046 0.030 0.59
(cont’d)
Table 5.2. OGLE-2005-SMC-001 Light Curve Models: Zero Blending.
234
Table 5.2—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
C+‖b 3593.687 8.711 172.11 0.93 0.924 10.57 −0.0008 0.0180 −0.172 1.39
1469.51 0.034 0.021 0.45 0.11 0.016 1.08 0.0016 0.0030 0.028 0.29
C-‖c 3593.488 −8.769 175.52 0.28 0.813 −3.31 0.0103 0.0074 0.753 −3.89
1464.10 0.038 0.023 0.63 0.03 0.017 1.34 0.0019 0.0039 0.062 0.47
C+‖c 3593.493 8.770 174.27 0.27 0.809 −3.67 0.0084 0.0040 −0.735 −3.88
1467.54 0.038 0.023 0.67 0.03 0.017 1.22 0.0015 0.0030 0.065 0.43
(cont’d)
235
Table 5.2—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
C+‖b 3593.687 8.711 172.11 0.93 0.924 10.57 −0.0008 0.0180 −0.172 1.39
1469.51 0.034 0.021 0.45 0.11 0.016 1.08 0.0016 0.0030 0.028 0.29
C+‖d 3593.751 8.725 172.80 0.65 0.919 7.20 0.0025 0.0115 −0.137 −0.11
1470.98 0.051 0.027 0.50 0.10 0.015 1.47 0.0015 0.0041 0.047 0.55
C-⊥a 3593.233 −8.524 166.86 1.77 0.762 279.76 −0.0168 0.0310 0.746 0.73
1460.01 0.031 0.019 0.49 0.23 0.030 0.48 0.0020 0.0049 0.055 0.25
(cont’d)
236
Table 5.2—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
C+⊥a 3593.269 8.524 166.68 1.56 0.799 279.87 −0.0119 0.0211 −0.597 0.73
1470.98 0.032 0.019 0.50 0.25 0.034 0.49 0.0017 0.0039 0.058 0.22
W+‖a 3593.590 8.522 167.60 8.33 0.953 8.38 0.0062 0.0092 0.267 −0.21
1479.54 0.059 0.020 0.46 6.18 0.016 0.78 0.0021 0.0035 0.161 0.20
W-‖a 3593.718 −8.449 170.56 2.40 0.937 6.51 0.0088 0.0033 0.296 0.49
1471.40 0.033 0.040 0.71 0.38 0.014 0.44 0.0042 0.0032 0.175 0.02
(cont’d)
237
Table 5.2—Continued
Model t0 u0 tE q Q : γ φ πE,N πE,E ω : α1/2b b/b
χ2 (day) (×100) (day) (×100) (deg) (yr−1) (yr−1)
W+‖b 3593.709 8.427 170.01 2.41 0.931 6.54 0.0055 0.0021 0.210 0.50
1471.49 0.034 0.037 0.90 0.37 0.014 0.44 0.0030 0.0028 0.163 0.02
W-⊥a 3593.552 −8.742 173.11 2.20 1.169 279.61 −0.0051 0.0179 0.880 −0.73
1474.83 0.067 0.028 1.11 0.97 0.038 0.59 0.0030 0.0055 0.398 0.25
W+⊥a 3593.642 8.759 172.88 4.51 1.115 280.27 −0.0010 0.0126 0.769 −0.76
1480.41 0.069 0.025 1.15 2.76 0.036 0.62 0.0025 0.0051 0.359 0.21
238
OGLE-2005-SMC-001
OGLE I
OGLE V
CTIO I
AUCKLAND
CTIO V
SPITZER
3560 3580 3600 3620 364017
16.5
16
15.5
15
3560 3580 3600 3620 36400.06
0.04
0.02
0
-0.02
-0.04
-0.06
Fig. 5.1.— Standard (Paczynski 1986) microlensing fit to the light curve of OGLE-2005-SMC-001, with data from OGLE I and V in Chile, µFUN I and V in Chile,Auckland clear-filter in New Zealand, and the Spitzer satellite 3.6 µm at ∼ 0.2 AUfrom Earth. The data are binned by the day. All data are photometrically alignedwith the (approximately calibrated) OGLE data. The residuals are severe indicatingthat substantial physical effects are not being modeled. The models do not includeparallax, but when parallax is included, the resulting figure is essentially identical.
239
−1 −.5 0 .5 1−1
−.5
0
.5
1(B)
Time (days)2.
5 lo
g(A
)
−40 −20 0 20 400
.5
1
1.5
2
(A)Earth
Time (days)
2.5
log(
Fs*
A+
Fb)
−40 −20 0 20 400
.5
1
1.5
2
(C)Spitzeru0 < 0
Time (days)−40 −20 0 20 400
.5
1
1.5
2
(D)Spitzeru0 > 0
Fig. 5.2.— Why 4 (not 3) Spitzer observations are needed to measure πE =(πE,‖, πE,⊥). Panel A shows Earth-based light curve of hypothetical event (black curve)with πE = (0.4,−0.2), u0 = −0.2, and tE = 40 days, together with the corresponding(red) lightcurve with zero parallax. From the asymmetry of the lightcurve, one canmeasure πE,‖ = 0.4 and |u0| = 0.2, but no information can be extracted aboutπE,⊥ or the sign of u0. Indeed, 9 other curves are shown with various values ofthese parameters, and all are degenerate with the black curve. Panel (B) shows thetrajectories of all ten models in the geocentric frame (Gould 2004) that generatethese degenerate curves. Solid and dashed curves indicate positive and negative u0,respectively, with πE,⊥ = −0.4,−0.2, 0, +0.2, +0.4 (green, black, magenta, cyan, blue).Motion is toward positive x, while the Sun lies directly toward negative x. Dotsindicate 5 day intervals. Panel C shows full light curves as would be seen by Spitzer,located 0.2 AU from the Earth at a projected angle 60 from the Sun, for the 5 u0 < 0trajectories in Panel B. The source flux Fs and blended flux Fb are fit from the twofilled circles and a third point at baseline (not shown) as advocated by Gould (1999).Note that these two points (plus baseline) pick out the “true” (black) trajectory, fromamong other solutions that are consistent with the ground-based data with u0 < 0, butPanel D shows that these points alone would pick out the magenta trajectory amongu0 > 0 solutions, which has a different πE,⊥ from the “true” solution. However, afourth measurement open circle would rule out this magenta u0 > 0 curve and soconfirm the black u0 < 0 curve.
240
Fig. 5.3.— Parallax πE = (πE,N , πE,E) 1 σ error ellipses for all discrete solutions forOGLE-2005-SMC-001. The left-hand panels show fits excluding the Spitzer data,while the right-hand panels include these data. The upper panels show fits withblending as a free parameter whereas the lower panels fix the OGLE blending at zero.The ellipses are coded by ∆χ2 (relative to each global minimum), with ∆χ2 < 1 (red),1 < ∆χ2 < 4 (green), 4 < ∆χ2 < 9 (cyan), 9 < ∆χ2 < 16 (magenta), ∆χ2 > 16(blue). Close- and wide-binary solutions are represented by bold and dashed curves,respectively. Most of the “free-blend, no-Spitzer” solutions are highly degeneratealong the πE,⊥ direction (33 north through east), as predicted from theory, becauseonly the orthogonal (πE,‖) direction is well constrained from ground-based data. Atseen from the two upper panels, the Spitzer observations reduce the errors in the πE,⊥
direction by a factor ∼ 3 when the blending is a free parameter. However, fixing theblending (lower panels) already removes this freedom, so Spitzer observations thenhave only a modest additional effect.
241
OGLE-2005-SMC-001
OGLE I
OGLE V
CTIO I
AUCKLAND
CTIO V
SPITZER
1.5
2
2.5
3
3560 3580 3600 3620 364017
16.5
16
15.5
15
3560 3580 3600 3620 36400.06
0.04
0.02
0
-0.02
-0.04
-0.06
Fig. 5.4.— Best-fit binary microlensing model for OGLE-2005-SMC-001 togetherwith the same data shown in Fig. 5.1. The model includes microlens parallax (twoparameters) and binary rotation (two parameters). The models for ground-basedand Spitzer observations are plotted in blue and red, respectively. All data arein the units of 2.5 log(A), where A is the magnification. Ground-based data arealso photometrically aligned with the (approximately calibrated) OGLE data. Theresiduals show no major systematic trends.
242
6 4 2 0 -2 -4 -6-6
-4
-2
0
2
4
6
Fig. 5.5.— Likelihood contours of the inverse projected velocity Λ ≡ v/v2 for SMClenses together with Λ values for light-curve solutions found by MCMC. The latterare color-coded for solutions with ∆χ2 within 1, 4, and 9 of the global minimum. Thelikelihood contours are spaced by factors of 5.
243
6 4 2 0 -2 -4 -6-6
-4
-2
0
2
4
6
Fig. 5.6.— Likelihood contours of the inverse projected velocity Λ ≡ v/v2 for halolenses together with Λ values for light-curve solutions found by MCMC. Similar toFigure 5.5 except in this case the contours are color coded, with black, red, yellow,green, cyan, blue, magenta, going from highest to lowest.
244
Chapter 6
Probing ∼ 100AU Intergalactic Mg II Absorbing
“Cloudlets” with Quasar Microlensing
6.1. Introduction
Mg II absorbers toward quasar sight lines have been systematically studied since
Lanzetta et al. (1987) (for more recent studies, see Zibetti et al. 2005 and references
therein). Similar Mg II absorbers were subsequently seen in gamma-ray burst (GRB)
spectra. Prochter et al. (2006) compared strong Mg II absorbers toward GRB and
quasar sight lines, and found a significantly higher incidence toward the former.
They proposed three possible effects to explain this discrepancy: (1) obscuration
of faint quasars by dust associated with the absorbers; (2) Mg II absorbers being
intrinsic to GRBs; (3) gravitational lensing of the GRB by the absorbers. However,
they concluded that none of these effects provide a satisfactory explanation.
Frank et al. (2006) proposed a simple geometric solution to this puzzle. They
argued that if Mg II absorption systems are fragmented on scales ∼< 1016cm, similar
to the beam sizes of GRBs, then the observed difference in the frequency of Mg II
245
absorbers would simply reflect the difference in the average beam sizes between
GRBs and quasars, with quasars being several times larger. This explanation
predicts that absorption features due to intervening Mg II cloud fragments should
evolve as the size of GRB afterglow changes, an effect that has now been observed
by Hao et al. (2006). However, the structures of Mg II absorbers down to scales of
∼ 1016cm cannot be directly inferred from their spectral features. Rauch et al. (2002)
put the strongest upper limits on Mg II absorber sizes to date. They observed the
spectra of three images of Q2237+0305 (Huchra et al. 1985) and found that each line
of sight contained individual Mg II absorbers at approximately the same redshift,
but with distinct spectral features. Thus these absorbers are part of a complex that
extends at least ∼ 500pc, but the sizes of the individual “cloudlets” must be smaller
than 200 − 300pc, corresponding to the separations of the macro-images.
Some, if not all, strongly lensed quasars are also gravitationally microlensed
by compact stellar-mass objects in the lensing galaxy (Wambsganss 2006), and
Q2237+0305 was the first lensed quasar found to exhibit significant microlensing
variability (Corrigan et al. 1991; Wozniak et al. 2000). The macro-image of a
microlensed quasar is split into many micro-images, and when the source moves over
the caustic networks induced by the microlenses, those micro-images expand, shrink,
appear, disappear and experience drastic astrometric shifts over timescales of months
or years (Treyer & Wambsganss 2004). The angular sizes of major micro-images are
usually of the same order as those of the quasars, and during the shape and position
246
changes of these images, absorption structures of similar scale along their sight lines
will likely imprint significant variations on the spectrum.
Brewer & Lewis (2005) pioneered the theoretical investigation of quasar
microlensing as a probe of the sub-parsec structure of intergalactic absorption
systems. They concluded that variation in the strength of the absorption lines over
timescales of years or decades caused by microlensing can be used to probe the
structures of Lyman α clouds and associated metal-line absorption systems on scales
∼< 0.1pc. However, as I will show, they significantly underestimated the relevant
timescales for spectral variability given the sizes of the systems they considered.
Thus, they substantially overestimated the scales of absorption structures that
microlensing can effectively probe.
In § 6.2, I lay out the basic theoretical framework of the method. Then in
§ 6.3, I present a numerical simulation of the microlensing of Q2237+0305. I show
that micro-images of this quasar can be used to probe structures of Mg II and other
metal-line absorbing clouds on scales of ∼ 1014 − 1016cm by monitoring the spectral
variations of absorption lines over months or years. Finally in § 6.4, I summarize the
results and discuss their implications.
247
6.2. Varying Microlensed Quasar Image as A “Ruler”
I begin with a brief summary of notation. Subscripts “l”, “s”, “o” and “c”
refer to the lens, source, observer and absorption cloud planes, respectively. The
superscript “ray” is used to refer to the light ray on the cloud plane to distinguish it
from the cloud. The angular diameter distance between object x and y is denoted
Dxy and is always positive regardless of which is closer; in particular, Dx refers to the
angular diameter distance between the observer and object x. The vector angular
position of object x is denoted θx, while its redshift is denoted zx.
Consider an absorbing cloud that is confronted with a “bundle of light rays”
making their way from the source to the lens to the observer. Let rrayc be the position
vector of a point on the plane of the cloud. The line depth 〈Aλ〉 of an absorption
line centered at wavelength λ is given by:
〈Aλ〉 =∫
σλ(rrayc )Aλ(r
rayc )d2rray
c
/∫
σλ(rrayc )d2rray
c (6.1)
where σλ(rrayc ) is the surface density of the “ray bundles” on the plane of the
absorption cloud with the rays weighted by the surface brightness profile of the
quasar and Aλ(rrayc ) is the absorption fraction at rray
c for light of wavelength λ
(Brewer & Lewis 2005).
248
6.2.1. Basic Geometric Configurations and Motions
The absorption cloud can be located either between the lens and observer or
between the lens and source. In the former case, the angular position of the light ray
at the cloud plane is θrayc = θi, so rray
c = θiDc. The projected light rays on the cloud
plane maintain the exact shapes of the quasar images, and their physical extents are
proportional to the distance to the observer.
If the cloud is between the lens and the source, it can be easily shown that θrayc
is given by (Brewer & Lewis 2005):
θrayc =
(
1 − DlcDs
DlsDc
)
θi +DlcDs
DlsDc
θs, (6.2)
so
rrayc =
(
Dc −Dlc
Dls
Ds
)
θi +Dlc
Dls
rs. (6.3)
The lens-source relative proper motion, with time as measured by the observer, is
given (in other notation) by Kayser et al. (1986),
µls =1
1 + zs
vs
Ds
− 1
1 + zl
vl
Dl
+1
1 + zl
voDls
DlDs
(6.4)
where vs, vl, vo are the transverse velocities of the source, lens and observer, relative
to the cosmic microwave background (CMB). In particular,
vo = vCMB − (vCMB · z)z, (6.5)
249
where z is the unit vector in the direction of the lens and vCMB is the heliocentric
CMB dipole velocity (Kochanek 2004).
The formulae in this subject can be greatly simplified by proper choice of
notation. To this end, I defined the “absolute” proper motion of an object x moving
at transverse velocity vx to be:
µabs,x =1
1 + zx
vx
Dx
, (6.6)
I also define the “reflex proper motion” of an object x relative to the observer-lens
axis to be:
µo,l,x = sgn(zx − zl)1
1 + zl
voDlx
DlDx
, (6.7)
Then equation (6.4) can then be simplified,
µls = µabs,s − µabs,l + µo,l,s. (6.8)
Similarly, the lens-cloud relative proper motion is given by:
µlc = µabs,c − µabs,l + µo,l,c (6.9)
Note that the last term has a different sign depending on whether the cloud is farther
or closer than the lens (see eq. [6.7]).
250
6.2.2. Bulk Motion of the Un-microlensed “Ray
Bundles”
Equations (6.2), (6.8) and (6.9) are the key formulae needed to carry out the
simulation in § 6.3. In this section, I discuss the bulk motion and relevant timescales
of the macro-image and its associated “ray bundles” on the cloud plane for the
underlying case that the image is not perturbed by microlenses.
When the source moves at µls, the angular positions of the rays that compose
the images are also changing with respect to the observer-lens axis. If the
macro-image is unperturbed by the microlenses, the relative proper motion µli is
simply given by (Kochanek et al. 1996),
µli = M · µls, (6.10)
where M is the magnification tensor. At the same time, the intersection of “ray
bundles” with the cloud plane also change its angular position relative to the lens.
If the cloud plane is between the lens and the source, then from equation (6.2), the
“ray bundle”-lens relative proper motion µraylc is a linear combination of µls and µli
weighted by distances,
µraylc =
(
1 − DlcDs
DlsDc
)
µli +DlcDs
DlsDc
µls. (6.11)
251
Substituting equation (6.10) into equation (6.11) yields the bulk proper motion of
the un-microlensed “ray bundle” relative to the lens,
µraylc,bulk =
[(
1 − DlcDs
DlsDc
)
M +DlcDs
DlsDc
I]
· µls. (6.12)
where I is the unit tensor.
When the “ray bundles” are between the source and the lens, their relative bulk
proper motion is simply,
µraylc,bulk = µli = M · µls. (6.13)
There are two major effects that may induce variability of absorption lines
toward a lensed quasar. One is the creation, destruction, distortion and astrometric
shifts of micro-images, which probe structures similar to the size of the micro-images.
The other is the bulk motion of the “ray bundles” relative to the cloud. On the
cloud plane, this motion has an angular speed of ∆µlc,bulk = µraylc,bulk − µlc, and the
time tcc required for the “ray bundles” to cross a cloud of transverse size Rc is,
tcc =Rc
Dc|∆µbulk,lc|=
Rc
Dc|µraybulk,lc − µlc|
. (6.14)
6.3. Application to Q2237+0305
In October 1998, Rauch et al. (2002) obtained high-resolution Keck spectra of
images A, B and C of Q2237+0305. They found Mg II absorption lines at redshifts
252
of z = 0.5656 and z = 0.827 in the spectra of all three images, but absorption profiles
of the individual sight lines differed (e.g., Figs. 6 and 10 of their paper). Therefore,
they concluded that the Mg II complexes giving rise to these absorption features
must be larger than ∼ 0.5kpc, while the individual Mg II components must be
smaller than ∼ 200 − 300h−150 pc. Q2237+0305 is also one of the most observed and
best studied lensed quasars with obvious microlensing features, and the properties
of the system are well known. These factors make it an ideal object to investigate.
The comprehensive statistical study of this lens by Kochanek (2004) showed
that the size of the quasar is ∼ 1015h−1cm − 1016h−1cm. Mortonson et al. (2005)
demonstrated that the source size has a much more significant effect on microlensing
models than the source brightness profile. For simplicity, in the simulation, I model
the source as uniform disks with four different sizes: 1015h−1cm, 3 × 1015h−1cm,
5 × 1015h−1cm and 1016h−1cm. Uniform grids of rays are traced from the image
plane to the source plane (Wambsganss 1990). Because structures of interest have
similar sizes as the source, finite-source effects must be taken into account. The
grid size used has an angular scale 1/10 of the smallest source. Kochanek (2004)
demonstrated that a Salpeter mass function cannot be distinguished from a uniform
mass distribution and found the mean stellar mass to be 〈M 〉 ∼ 0.037h−1M⊙. For
simplicity, I assign all stars in the simulation the same mass of 0.04h−1M⊙. Rays
are shot from a region extending 47〈θE〉 on each side, where 〈θE〉 is the Einstein
radius of a 0.04h−1M⊙ star. I adopt a convergence and shear for image A of
253
(κ, γ) = (0.394, 0.395) from Kochanek (2004), and set the stellar surface density
κ∗ = κ. Four different trajectories, oriented at 0, 30, 60 and 90 degrees with respect
to the direction of the shear are studied. Positions on both the image plane and the
source plane are recorded once a ray falls within a distance of 2 times the largest
source size from any trajectory on the source plane. A total length of 5〈θE〉 along
each trajectory is considered.
Throughout the paper, I adopt a ΛCDM cosmology with Ωm = 0.3, ΩΛ = 0.7
and H0 = 100h km s−1 Mpc−1. The lens and source are at redshifts
zl = 0.0394 and zs = 1.695 (Huchra et al. 1985). These imply (Ds, Dl, Dls) =
(1223, 113, 1180)h−1 Mpc. Based on Kochanek (2004), I adopt transverse velocities
of the lens, source and observer of (vl, vs, vo) = (300, 140, 62)km s−1. The lens
and source absolute proper motions and the source reflex proper motion are
(µabs,l, µabs,s, µo,l,s) = (0.54, 0.009, 0.11) h µas yr−1. So the lens absolute proper
motion dominates the lens-source relative proper motion. It can easily be shown
that the absolute and reflex proper motions of the cloud are much smaller than the
absolute proper motion of the lens unless the cloud redshift is close to or smaller
than the lens redshift. For Q2237+0305, the lens redshift is very small compared
to the source, so most likely zc ≫ zl. Thus, in following analysis, I focus on clouds
with redshift zc > zl, and ignore the absolute and reflex proper motions of the source
254
and the cloud. Under these assumptions, the source and the cloud share the same
relative proper motion,
µlc = µls = −µabs,l (6.15)
In the simulation, as a practical matter, I hold the positions of the observer,
lens galaxy (as well as its microlensing star field) fixed, and allow the source to move
through the source plane at µls. By equation (6.15), the cloud then has the same
relative proper motion as the source. At any given time, the angular positions of
the “ray bundles” are calculated using equation (6.2). Then by simply subtracting
the angular position of the source at that time, the “ray bundles” positions are
transformed to the reference frame of the cloud.
During short timescales, microlensing causes centroid shifts of the macro-image
(Lewis & Ibata 1998; Treyer & Wambsganss 2004) with respect to the steady bulk
motion of “ray bundles” relative to the cloud, which is described in § 6.2.2. If the
direction of relative lens-source proper motion is the same as the lens shear, then by
subtracting µls from equation (6.12), one finds that the “ray bundles” have their
maximum bulk proper motion relative to the cloud,
∆µbulk,lc,max =(
1 − DlcDs
DlsDc
)
(
1
1 − κ − γ− 1
)
µls
∼ 2(
1 − 1.037Dlc
Dc
)
h µas yr−1. (6.16)
255
If the source moves perpendicular relative to the lens shear, ∆µlc =
[1 − DLCDS/(DLSDC)]|1/(1 − κ + γ) − 1|µls, which is approximately 0 for
the (κ, γ) of image A.
Figures 6.1 and 6.2 show the results for a source trajectory that is parallel
to the shear direction. The bottom panel of Figure 6.1 shows the magnification
pattern on the source plane, with a series of 4 concentric circles centered at 5 source
positions; and the top panels show the images at these positions relative to the
source (which has the same proper motion as the cloud). Different colors represent
different source sizes. The middle panel of Figure 6.1 shows the light curves for the
4 source sizes with the blue dashed lines used to mark the times for the five source
positions. Figure 6.2 shows the “ray bundle” positions in the cloud frame at redshifts
1.69, 0.83, 0.57, 0.1. The 5 different columns show the 5 positions corresponding to
those in Figure 6.1.
In the top row of Figure 6.2, one can see that the “ray bundles” at zc = 1.69,
which is very close to the quasar, have almost exactly the same size and shape
as the source and that the bundles show almost no bulk motion. The density of
“ray bundles” clearly have the imprints from the magnification pattern shown in
the bottom panel of Figure 6.1. So if an absorption cloudlet has a size similar to
or somewhat smaller than a source that is sitting directly behind it, the depth of
its corresponding absorption line will change dramatically as the source crosses the
caustics. The magnification close to a fold caustic is proportional to the inverse
256
square root of the distance from it, so for a cloud with angular size θc < θs, the
fractional change in absorption line depth caused by the caustics scales as (θc/θs)3/4.
Hence, for a cloud close to the quasar redshift, structures on scale ∼ 1014−1016h−1cm
(depending on the source size) will be probed over few-month to few-year timescales
(i.e., the timescale of typical caustics crossings).
The fourth row of panels of Figure 6.2 shows “ray bundles” for zc = 0.1, which
is close to the lens redshift. A distinct difference between these “ray bundles”
and the ones at zc = 1.69 is that they are split into many groups of bundles,
which correspond to the micro-images in the top panel of Figure 6.1. I dub these
groups of “ray bundles” as micro-images in the following discussions. Most of these
micro-images are stretched one-dimensionally, and most rays are concentrated in
a few major micro-images. The micro-images in different columns have drastically
different morphologies and positions. If there are cloudlets of similar size as these
micro-images distributed on the cloud plane, then the absorption spectra will show
multi-component absorption features at any given time. These components will
experience drastic changes in line depth, with some components disappearing and
other new components appearing during the course of months or years, as the source
crosses the microlensing caustics. Another important characteristic is that the bulk
of these micro-images are moving in the same direction as the source. This motion
is described by equation (6.16), which yields ∼ 0.8 h µas yr. So from equation (6.14),
structures as large as ∼ 3.0n × 1016 h−1 cm will be crossed in n × 10 h−1 yr by the
257
bulk motion of the “ray bundles”. Hence, the effects caused by the micro-images
and the bulk motion of the bundles together probe scales of ∼ 1014 − 1016h−1cm on
timescales of months to years.
The second and third row of panels in Figure 6.2 refer to clouds at intermediate
redshifts between the lens and the source. Their redshifts, zc = 0.83 and zc = 0.57,
are close to the Mg II absorption systems observed by Rauch et al. (2002). As
expected, the characteristics of these “ray bundles” are intermediate between those
shown in rows 1 and 4. The overall shapes of the micro-images are close to that
of the source, with magnification patterns imprinted on them. And they also
clearly show multiple components, which appear and disappear as the source crosses
caustics. The angular sizes of the images are close to that of the (unmagnified)
quasar. These micro-images could probe clouds with angular size from a factor of
few smaller to a factor of few larger than the source size, which corresponds to scales
of ∼ 1014 − 1016h−1cm. From equation (6.14) and (6.16), the bulk motion of the
bundles will probe cloud structures ∼ 0.8n × 1016 h−1 cm and ∼ 1.3n × 1016 h−1 cm
during n × 10 h−1 yr for zc = 0.83 and zc = 0.57, respectively.
For a source trajectory that is perpendicular to the shear, the “ray bundles”
will exhibit almost no bulk motion relative to the cloud, while the magnitude
of bulk motion for intermediate trajectories is a fraction that is of the parallel
case, depending on the angles of motion relative to the shear direction. And
the micro-images of these trajectories share similar properties with those of the
258
trajectory that is parallel to the shear. Figure 6.2 shows that micro-image motions
are on the same order of magnitude as the bulk motion of the macro-image over the
timescales considered. Therefore, trajectory direction only has a modest impact on
the scales at which cloud can be probed.
These results are contrast significantly with those of Brewer & Lewis (2005),
who claimed that quasar microlensing for image A of Q2237+0305 can induce
considerable variability of absorption lines associated with structures as large as
0.1pc during the course of years to decades. According to their analysis, the effect is
largest when the cloud is very close to the source, and for example, the timescale of
line strength variation for a 0.1pc cloud very close to the source is given as ∼ 16.2yr.
However, in their analysis, they effectively assumed the relative lens-cloud proper
motion µlc = 0. This would lead to a timescale tcc ∼ RcDl/(vlDs) ∼ 16(Rc/0.1pc)yr
for an absorption cloud near the source redshift (they adopted vl = 600km s−1),
which is in agreement with column 4 of their Table 1. In fact, I showed that, when
peculiar velocities of the source and cloud are ignored, µlc = µls (eq. [6.15]), which
is not negligible. Even considering realistic peculiar motions of the source and the
cloud, it still leads to time scales that are more than one order of magnitude slower
than those predicted for µlc = 0. In addition, when the cloud is close to the source,
the angular sizes of the clouds they considered are orders of magnitudes larger than
their source size, so effects of changes in magnification pattern on the source plane
259
alone have very little impact as well. Therefore, Brewer & Lewis (2005) significantly
overestimated the cloud size to which microlensing is effectively sensitive.
6.4. Discussion and Conclusion
I have shown that there are two effects that might induce variation of
absorption lines along the sight lines to lensed quasars. One effect is caused by
the drastic morphological and positional changes of micro-images when the source
crosses the caustic network. The other effect is due to the bulk motion of the
“ray bundles” relative to the absorption clouds. I have laid out a basic framework
in studying these effects for microlensed quasars in general. And in particular, I
performed numerical simulations to apply the method to image A of Q2237+0305.
I demonstrated that the combinations of these two effects probe 1014 − 1016 h−1 cm
absorption cloudlets between the lens and the source over timescales of months to
years. The existence of these cloudlets will be revealed by either changes in line
depths or appearances/disappearances of multi-absorption components. Spectra
should preferably be taken during the course of caustic crossings, which can be
inferred from photometric monitoring programs of lensed quasars. In fact, the Mg II
lines observed by Rauch et al. (2002) about 8 years ago already show different
multi-components along sight lines to three different macro-images, implying they
might be caused by fragmented cloudlets with similar sizes as the micro-images. A
260
similar high-resolution spectrum taken in the near future would provide a definitive
test of the existence for structures of Mg II or other metal-line absorbers at the scales
of 1014 − 1016 h−1 cm. If the spectral variations are indeed observed, a statistical
study similar to Kochanek (2004) will be required to infer the properties of the
cloudlets. Moreover, a time series of spectra may provide additional constraints to
quasar models that are currently based only on photometric-monitoring data.
261
Fig. 6.1.— Caustics network (bottom panel), light curve (middle panel) and imagesrelative to the source position (top panel) for a source trajectory that is parallel to thelens shear (the direction of x-axis). Different colors represent the 4 different sourcesizes.
262
Fig. 6.2.— Physical positions of “ray bundles” in the frame of cloud at redshifts 0.1,0.57, 0.83 and 1.69. The 5 different columns correspond to the source positions shownin Figure 6.1. x-axis is the direction of the lens shear.
263
Appendix A
Two New Finite-Source Algorithms
To model planetary light curves, we develop two new binary-lens finite-source
algorithms. The first algorithm, called “map-making”, is the main work horse. For
a fixed (b, q) geometry, it can successfully evaluate the finite-source magnification of
almost all data points on the light curve and can robustly identify those points for
which it fails. The second algorithm, called “loop-linking”, is much less efficient than
map-making but is entirely robust. We use loop-linking whenever the map-making
routine decides it cannot robustly evaluate the magnification of a point. In addition,
at the present time, map-making does not work for resonant lensing geometries, i.e.,
geometries for which the caustic has six cusps. For planetary mass ratios, resonant
lensing occurs when the planet is very close to the Einstein radius, b ∼ 1. We use
loop-linking in these cases also.
A.1. Map-Making
Map-making has two components: a core function that evaluates the
magnification and a set of auxiliary functions that test whether the measurement
264
is being made accurately. If a light-curve point fails these tests, it is sent to
loop-linking.
Finite-source effects are important when the source passes over or close to
a caustic. Otherwise, the magnification can be evaluated using the point-source
approximation, which is many orders of magnitude faster than finite-source
evaluations. Hence, the main control issue is to ensure that any point that falls
close to a caustic is evaluated using a finite-source algorithm or at least is tested to
determine whether this is necessary. For very high magnification events, the peak
points will always pass close to the central caustic. Hence, the core function of
map-making is to “map” an Einstein-ring annulus in the image plane that covers
essentially all of the possible images of sources that come close to the central caustic.
The method must also take account of the planetary caustic(s), but we address that
problem further below.
We begin by inverse ray-shooting an annulus defined by APSPL > Amin, where
APSPL is the Paczynski (1986) magnification due to a point source by a point lens
and Amin is a suitably chosen threshold. For OGLE-2000-BLG-343, we find that
Amin = 75 covers the caustic-approaching points in essentially all cases. The choice
of the density of the ray-shooting map is described below. Each such “shot” results
in a four-element vector (xi, yi, xs, ys). We divide the portion of the source plane
covered by this map into a rectangular grid with k = 1, . . . , Ng elements. We choose
the size of each element to be equal to the smallest source radius being evaluated by
265
the map. Hence, each “shot” is assigned to some definite grid element k(xs, ys). We
then sort the “shots” by k. For each light-curve point to be evaluated, we first find
the grid elements that overlap the source. We then read sequentially through the
sorted file1 from the beginning of the element’s “shots” to the end. For each “shot”,
we ask whether its (xs, ys) lies within the source. If so, we weight that point by the
limb-darkened profile of the source at that radius. Note that a source with arbitrary
shape and surface brightness profile would be done just as easily.
For each light-curve point, we first determine whether at least one of the images
of the center of the source lies in the annulus. In practice, for our case, the points
satisfying this condition are just those on the night of the peak, but for other events
this would have to be determined on a point-by-point basis. We divide those points
with images outside the annulus into two classes, depending on whether they lie
inside or outside two or three rectangles that we “draw”, one around each caustic.
Each rectangle is larger than the maximum extent of the caustics by a factor of 1.5 in
each direction. If the source center lies outside all of these rectangles, we assume that
the point-source approximation applies and evaluate the magnification accordingly.
If it lies inside one of the rectangles (and so either near or inside one of the caustics),
we perform the following test to see whether the point-source approximation holds.
1Whether this “file” should actually be an external disk file or an array in internal memory
depends on both the size of the available internal memory and the total number of points evaluated
in each lens geometry. In our case, we used internal arrays.
266
We evaluate the point-source magnifications at five positions, namely, the source
center A(0, 0), two positions along the source x-axis A(±λρ∗, 0), and two positions
along the source y-axis A(0,±λρ∗), where λ ≤ 1 is a parameter. We demand that
∣
∣
∣
∣
A(λρ∗, 0) + A(−λρ∗, 0)
2A(0, 0)− 1
∣
∣
∣
∣
+∣
∣
∣
∣
A(0, λρ∗) + A(0,−λρ∗)
2A(0, 0)− 1
∣
∣
∣
∣
< 4σ, (A.1)
where σ is the maximum permitted error (defined in § A.3). We use a minimum of
five values in order to ensure that the magnification pattern interior to the source is
reasonably well sampled; the precise value is chosen empirically and is a compromise
between computing speed and accuracy. We require the number of values to be at
least 2[ρ∗/√
q] in order to ensure that small, well-localized perturbations interior to
the source caused by low mass ratio companions are not missed.
If a point passes this test, the magnification pattern in the neighborhood of the
point is adequately represented by a gradient and so the point-source approximation
holds. Points failing this test are sent to loop linking.
The remaining points, those with at least one image center lying in the annulus,
are almost all evaluated using the sorted grid as described above. However, we must
ensure that the annulus really covers all of the images. We conduct several tests to
this end.
First, we demand that no more than one of the three or five images of the
source center lie outside the annulus. In a binary lens, there is usually one image
267
that is associated with the companion and that is highly demagnified. Hence, it can
generally be ignored, so the fact that it falls outside the annulus does not present a
problem. If more than one image center lies outside the annulus, the point is sent to
loop-linking. Second, it is possible that an image of the center of the source could
lie inside the annulus, but the corresponding image of another point on the source
lies outside. In this case, there would be some intermediate point that lay directly
on the boundary. To guard against this possibility, we mark the “shots” lying within
one grid step of the boundaries of the annulus, and if any of these boundary “shots”
fall in the source, we send the point to loop-linking. Finally, it is possible that
even though the center of the source lies outside the caustic (and so has only three
images), there are other parts of the source that lie inside the caustic and so have
two additional images. If these images lay entirely outside the annulus, the previous
checks would fail. However, of necessity, some of these source points lie directly on
the caustic, and so their images lie directly on the critical curve. Hence, as long as
the critical curve is entirely covered by the annulus, at least some of each of these
two new images will lie inside the annulus and the “boundary test” just mentioned
can robustly determine whether any of these images extend outside the annulus. For
each (b, q) geometry, we directly check whether the annulus covers the critical curve
associated with the central caustic by evaluating the critical curve locus using the
algorithm of Witt (1990).
268
A.2. Loop-Linking
Loop-linking is a hybrid of two methods: inverse ray-shooting and Stokes’s
theorem. In the first method (which was also used above in “map-making”), one
finds the source location corresponding to each point in the image plane. Those
that fall inside the source are counted (and weighted according to the local surface
brightness), while those that land outside the source are not. The main shortcoming
of inverse ray-shooting is that one must ensure that the ensemble of “shots” actually
covers the entire image of the source without covering so much additional “blank
space” that the method becomes computationally unwieldy.
In the Stokes’s theorem approach, one maps the boundary of (a polygon-
approximation of) the source into the image plane, which for a binary lens yields
either three or five closed polygons. These image polygons form the (interior or
exterior) boundaries of one to five images. If one assumes uniform surface brightness,
the ratio of the combined areas of these images (which can be evaluated using
Stokes’s theorem) to the area of the source polygon is the magnification.
There are two principal problems with the Stokes’s theorem approach. First,
sources generally cannot be approximated as having uniform surface brightness.
This problem can be resolved simply by breaking the source into a set of annuli, each
of which is reasonably approximated as having uniform surface brightness. However,
this multiplies the computation time by the number of annuli. Second, there can
269
be numerical problems of several types if the source boundary passes over or close
to a cusp. First, the lens solver, which returns the image positions given the source
position, can simply fail in these regions. This at least has the advantage that one
can recognize that there is a problem and perhaps try some neighboring points. The
second problem is that even though the boundary of the source passes directly over
a cusp, it is possible that none of the vertices of its polygon approximation lie within
the caustic. The polygonal image boundary will then fail to surround the two new
images of the source that arise inside the caustic, so these will not be included in
the area of the image. Various steps can be taken to mitigate this problem, but the
problem is most severe for very low-mass planets (which are of the greatest interest
in the present context), so complete elimination of this problem is really an uphill
battle.
The basic idea of loop-linking is to map a polygon that is slightly larger than
the source onto the image plane, and then to inverse ray shoot the interior regions
of the resulting image-plane polygons. This minimizes the image-plane region
to be shot compared to other inverse-ray shooting techniques. It is, of course,
more time-consuming than the standard Stokes’s theorem technique, but it can
accommodate arbitrary surface-brightness profiles and is more robust. As we detail
below, loop-linking can fail at any of several steps. However, these failures are
always recognizable, and recovery from them is always possible simply by repeating
the procedure with a slightly larger source polygon.
270
Following Gould & Gaucherel (1997), the vertices of the source polygon are
each mapped to an array of three or five image positions, each with an associated
parity. If the lens solver fails to return three or five image positions, the evaluation is
repeated beginning with a larger source polygon. For each successive pair of arrays,
we “link” the closest pair of images that has the same parity and repeat this process
until all images in these two arrays are linked. An exception occurs when one array
has three images and the other has five images, in which case two images are left
unmatched. Repeating this procedure for all successive pairs of arrays produces a set
of linked “strands”, each with either positive or negative parity. The first element of
positive-parity strands and the last element of negative-element strands are labeled
“beginnings” and the others are labeled “ends”. Then the closest “beginning” and
“end” images are linked and this process is repeated until all “beginnings” and
“ends” are exhausted. The result is a set of two to five linked loops. As with the
standard Stokes’s theorem approach, it is possible that a source-polygon edge crosses
a cusp without either vertex being inside the caustic. Then the corresponding
image-polygon edge would pass inside the image of the source, which would cause
us to underestimate the magnification. We check for this possibility by inverse ray
shooting the image-polygon boundary (sampled with the same linear density as we
later sample the images) back into the source plane. If any of these points land in
the source, we restart the calculation with a larger source polygon.
271
We then use these looped links to efficiently locate the region in the image
plane to do inverse ray shooting. We first examine all of the links to find the largest
difference, ∆ymax, between the y-coordinates of the two vertices of any link. Next, we
sort the m = 1, . . . , n links by the lower y-coordinate of their two vertices, y−m. One
then knows that the upper vertex obeys y+m ≤ y−
m + ∆ymax. Hence, for each y-value
of the inverse ray shooting grid, we know that only links with y−m ≤ y ≤ y−
m + ∆ymax
can intersect this value. These links can quickly be identified by reading through
the sorted list from y−m = y − ∆ymax to y−
m = y. The x value of each of these
crossing links is easily evaluated. Successive pairs of x’s then bracket the regions (at
this value of y) where inverse rays must be shot. As a check, we demand that the
first of each of these bracketing links is an upward-going link and the second is a
downward-going link.
A.3. Algorithm Parameters
Before implementing the two algorithms described above, one must first specify
values for certain parameters. Both algorithms involve inverse ray shooting and
hence require that a sampling density by specified. Let g be the grid size in
units of the Einstein radius. For magnification A ≫ 1, the image can be crudely
approximated as two long strands whose total length is ℓ = 4Aρ∗ and hence of mean
width (πρ2∗A)/ℓ = (π/4)ρ∗. If, for simplicity, we assume that the strand is aligned
272
with the grid, then there will be a total of ℓ/g grid tracks running across the strand.
Each will have two edges, and on each edge there will be an “error” of 12−1/2 in the
“proper” number of grid points due to the fact that this number must be an integer,
whereas the actual distance across the strand is a real number. Hence, the total
number of grid points will be in error by [(2ℓ/g)/12]1/2, while the total number itself
is π(ρ∗/g)2A. This implies a fractional error σ,
σ−2 =[π(ρ∗/g)2A]2
(2ℓ/g)/12=
3π2
2A(ρ∗/g)3. (A.2)
In fact, the error will be slightly smaller than given by equation (A.2) in part
because the strand is not aligned with the grid, so the total number of tracks across
the strands will be lower than 2ℓ/g, and in part because the “discretization errors”
at the boundary take place on limb-darkened parts of the star, which have lower
surface brightness, so fluctuations here have lower impact. Hence, an upper limit to
the grid size required to achieve a fractional error σ is
g
ρ∗
=(
3π2
2
)1/3
σ2/3A1/3. (A.3)
For each loop-linking point, we know the approximate magnification A because
we know the weighted parameters of the single-lens model. We generally set σ at
1/3 of the measurement error, so that the (squared) numerical noise is an order of
magnitude smaller than that due to observational error. Using equation (A.3) we
can then determine the grid size.
273
For the map-making method, the situation is slightly more complicated. Instead
of evaluating one particular point as in the loop-linking method, all of the points on
the source plane with the same (b, q, u0, ρ) are evaluated within one map. Therefore,
the grid size for this map is the minimum value from equation (A.3) to achieve the
required accuracy for all of these points. We derive the following equation from
equation (A.3) to determine the grid size in the map-making method:
g =[
3π2
2
Q(u0)
u0
]1/3
ρ∗ (A.4)
where
Q(u0) = minAi>75
F (ti) − Fb(u0)
Fmax(ti) − Fb(u0)σ2
i
(A.5)
We find that Q(u0) = 2.65×10−7 is independent of u0 for both the observational
and simulated data of OGLE 2004-BLG-343. In principle, one could determine a
minimum g for all (u0, ρ) combinations and generate only one map for a given (b, q)
geometry, but this would render the calculation unnecessarily long for most (u0, ρ)
combinations. Instead we evaluate g for each (u0, ρ) pair and create several maps,
one for each ensemble of (u0, ρ) pairs with similar g’s. The sizes of the ensembles
should be set to minimize the total time spent generating, loading and employing
maps. Hence, they will vary depending on the application.
274
Appendix B
MOA-2003-BLG-32/OGLE-2003-BLG-219
MOA-2003-BLG-32/OGLE-2003-BLG-219, with a peak magnification
Amax = 525 ± 75 is most sensitive to low-mass planets to date (Abe et al. 2004).
However, instead of fitting the simulated binary-lens light curves to single-lens
models, Abe et al. (2004) obtain their ∆χ2 by directly subtracting the χ2 of a
simulated binary-lens light curve from that of the light curve that is the best fit
to the data. Since the source star of this event could reside in the Sagittarius
dwarf galaxy, which makes the Galactic modeling rather complicated, we do not
attempt to apply our entire method to this event. We calculate planet exclusion
regions with the same ∆χ2 thresholds (60 for q = 10−3 and 40 for the q < 10−3) as
Abe et al. (2004) but using our method of obtaining ∆χ2 by fitting the simulated
binary-lens light curves to single-lens models. Figure B.1 shows our results for
the exclusion regions at planet-star mass ratios q = 10−5, 10−4, and 10−3 for
MOA-2003-BLG-32/OGLE-2003-BLG-219. The exclusion region we have obtained
at q = 10−3 is about 1/4 in vertical direction and 1/9 in horizontal direction relative
to the corresponding region in Abe et al. (2004), and the size of our exclusion
region at 10−4 is about 60% in each dimension relative to that in Abe et al. (2004).
275
Although according to our analysis, Abe et al. (2004) overestimate the sensitivity of
MOA-2003-BLG-32/OGLE-2003-BLG-219 to both Jupiter-mass and Neptune-mass
planets, their estimates of sensitivity to Earth-mass planets are basically consistent
with our results and MOA-2003-BLG-32/OGLE-2003-BLG-219 nevertheless retains
the best sensitivity to planets to date.
276
Fig. B.1.— Planetary exclusion regions for microlensing event MOA-2003-32/OGLE-2003-BLG-219 as a function of projected coordinate bx and by at planet-star massratio q = 10−5 (green), 10−4 (red) and 10−3 (blue). The source size (normalized to θE)ρ∗ is equal to 0.0007. The black circle is the Einstein ring, i.e., b = 1.
277
Appendix C
Extracting Orbital Parameters for Circular
Planetary Orbit
OGLE-2005-BLG-071 is the first planetary microlensing event for which the
effects of planetary orbital motion in the light curve have been fully analyzed. The
distortions of the light curve due to the orbital motion are modeled by ω and b as
discussed in § 4.3.2. In addition, the lens mass M and distance Dl are determined,
so we can directly convert the microlens light-curve parameters that are normalized
to the Einstein radius to physical parameters. In this section, we show that under
the assumption of a circular planetary orbit, the planetary orbital parameters can
be deduced from the light-curve parameters. Let r⊥ = DlθEd be the projected
star-planet separation and let r⊥γ be the instantaneous planet velocity in the
plane of the sky, i.e. r⊥γ⊥ = r⊥ω is the velocity perpendicular to this axis and
r⊥γ‖ = r⊥d/d is the velocity parallel to this axis. Let a be the semi-major axis and
define the ı, , k directions as the instantaneous star-planet-axis on the sky plane,
278
the direction into the sky, and k = ı × . Then the instantaneous velocity of the
planet is
v =
√
GM
a[cos θk + sin θ(cos φı − sin φ)], (C.1)
where φ is the angle between star-planet-observer (i.e., r⊥ = a sin φ) and θ is the
angle of the velocity relative to the k direction on the plane that is perpendicular to
the planet-star-axis. We thus obtain
γ⊥ =
√
GM
a3
cos θ
sin φ, γ‖ =
√
GM
a3sin θ cot φ. (C.2)
To facilitate the derivation, we define
A ≡ γ‖
γ⊥
= − tan θ cos φ, B ≡ r3⊥γ2
⊥
GM= cos2 θ sin φ, (C.3)
which yield as an equation for sin φ:
B = F (sin φ); F (x) =x(1 − x2)
A2 + 1 − x2. (C.4)
Note that F ′(sin φ) = 0 when sin2 φ∗ = (3/2)A2 + 1 − |A|√
(9/4)A2 + 2 . So
equation (C.4) has two degenerate solutions when B < F (sin φ∗) and has no
solutions when B > F (sin φ∗). Subsequently, one obtains,
a =r⊥
sin φ, cos i = − sin φ cos θ, K =
√
GM
aq sin i, (C.5)
279
where i is the inclination and K is the amplitude of radial velocity.
The Jacobian matrix used to transform from P (r⊥, γ⊥, γ‖) to P (a, φ, θ) is given
below,
∂(r⊥, γ⊥, γ‖)
∂(a, φ, θ)=
GM
a3
∣
∣
∣
∣
∣
sin φ a cos φ 0
− 32a
cos θsin φ
− cos θ cos φsin2 φ
− sin θsin φ
− 32a
sin θ cot φ − sin θsin2 φ
cos θ cot φ
∣
∣
∣
∣
∣
=GM
a3cot2 φ
(
1
2− sin2 θ tan2 φ
)
. (C.6)
Then for an arbitrary function H(a),
∂(r⊥, γ⊥, γ‖)
∂(H(a), cos φ, θ)=
∂(r⊥, γ⊥, γ‖)
∂(a, φ, θ)× 1
sin φH ′(a), (C.7)
which, for the special case of a flat distribution, H(a) = ln a, yields,
∂(r⊥, γ⊥, γ‖)
∂(ln(a), cos φ, θ)=
GM
r2⊥
cos2 φ
sin φ.(
1
2− sin2 θ tan2 φ
)
(C.8)
280
Appendix D
Failure of Elliptical-Source Models for
MOA-2007-BLG-400
Because MOA-2007-BLG-400 is the first microlensing event with a completely
buried caustic, it is important to rule out other potential causes of the deviations
seen in the light curve (apart from a planetary companion to the lens). The
principal features of these deviations are the twin “spikes” in the residuals, which are
approximately centered on the times when the lens enters and exits the source. In
the model, these crossings occur at about HJD’ = 4354.53 and HJD’ = 4354.63, i.e.,
very close to the spikes in Figure 2.1. In principle, one might be able to induce such
spikes by displacing the model source crossing times from the true times. The only
real way to achieve this (while still optimizing the overall fit parameters) would be if
the source were actually elliptical, but were modeled as a circle (which, of course, is
the norm).
One argument against this hypothesis is the similarity of the I and H residuals
(§ 2.3). If the source were an ellipsoidal variable, then one would expect color
gradients due to “gravity darkening”.
281
Nevertheless, we carried out two types of investigation of this possibility. First,
we modeled the light curve as an elliptical source magnified by a point (non-binary)
lens. In addition to the linear flux parameters (source flux plus blended flux for each
observatory) there are 6 model parameters, the three standard point-lens parameters
(t0, u0, tE), plus the source semimajor and semiminor axes (ρa, ρb) and the angle of
the source trajectory relative to the source major axis, α. We find that the elliptical
source reduces χ2 by about 200, but it does not remove the “spikes” from the
residuals, which was the primary motivation for introducing it. Instead, essentially
all of the χ2 improvement comes from eliminating the asymmetries from the rest
of the light curve. Recall, however, that the planetary model removes both these
asymmetries and the “spikes”. Moreover, the best-fit axis ratio is quite extreme,
ρb/ρa = 0.7, which would produce very noticeable ellipsoidal variations unless the
binary were being viewed pole on.
Next we looked for sinusoidal variations in the baseline light curve. The
individual OGLE errorbars at baseline are smaller than for MOA, and since
ellipsoidal variations are strictly periodic, the longer OGLE baseline (about
T = 2000 days versus T = 800 days for MOA) does a better job of isolating this
signal from various possible systematics. Therefore, for this purpose, the OGLE
data are more suitable than MOA. The OGLE data are essentially all baseline (only
two magnified points out of 452). Their periodogram shows several spikes at the
0.01 mag level, and a maximum ∆χ2 = 20, which are consistent with noise. The
282
width of the spikes is extremely narrow, consistent with the theoretical expectation
for uniformly sampled data of σ(P )/P 2 ∼√
24/∆χ2/(2πT ) ∼ 10−4 day−1, indicating
that the data set is behaving normally.
In brief, our investigation finds no convincing evidence for ellipticity of the
source, certainly not for the several tens of percent deviation from circular that
would be needed to significantly ameliorate the deviations seen near peak in the
light curve. Moreover, even arbitrary source ellipticities cannot reproduce the light
curve’s most striking features: the two “spikes” in the residuals that occur when the
lens crosses the source boundary.
283
BIBLIOGRAPHY
Abe, F., et al. 2004, Science, 305, 1264
Afonso, C., et al. 2000, ApJ, 532, 340
Afonso, C. et al. 2003, A&A, 400, 951
Albrow, M. D., et al. 1998, ApJ, 509, 687
Albrow, M. D., et al. 1999, ApJ, 512, 672
Albrow, M.D. et al. 1999, ApJ, 522, 1022
Albrow, M.D. et al. 2000, ApJ, 534, 894
Albrow, M. D., et al. 2000, ApJ, 535, 176
Albrow, M. D., et al. 2001, ApJ, 556, L113
Albrow, M. D., et al. 2002, ApJ, 572, 1031
Alcock, C. et al. 1993, Nature, 365, 621
Alcock, C. et al. 1995, ApJ, 454, L125
Alcock, C. et al. 1997, ApJ, 491, 436
Alcock, C. et al. 2000, ApJ, 542, 281
Alcock, C., et al. 2001, Nature, 414, 617
Alcock, C. et al. 2001, ApJ, 550, L169
An, J. H. 2005, MNRAS, 356, 1409
An, J. H. & Han, C. 2002, ApJ, 573, 351
An, J. H., et al. 2002, ApJ, 572, 521
284
Anderson, J. & King, I. R. 2004, Hubble Space Telescope Advanced Camera forSurveys Instrument Science Report 04-15
Assef, R. J. et al. 2006, ApJ, 649, 954
Aubourg, E. et al. 1993, Nature, 365, 623
Bailey, J., et al. 2009, ApJ, 690, 743
Bean, J. L., Benedict, G. F., & Endl, M. 2006, ApJ, 653, L65
Beaulieu, J.-P., et al. 2006, Nature, 439, 437
Benedict, G. F., et al. 2002, ApJ, 581, L115
Bennett, D. P., & Rhie, S. H. 1996, ApJ, 472, 660
Bennett, D. P., & Rhie, S. H. 2002, ApJ, 574, 985
Bennett, D. P., et al. 2006, ApJ, 647, L171
Bennett, D. P., Anderson, J., & Gaudi, B. S. 2007, ApJ, 660, 781
Bennett, D. P., et al. 1999, Nature, 402, 57
Bennett, D.P. et al., 2008, ApJ, 684, 663
Bessell, M. S., & Brett, J. M. 1988, PASP, 100, 1134
Bond, I. A., et al. 2001, MNRAS, 327, 868
Bond, I. A., et al. 2002, MNRAS, 333, 71
Bond, I. A., et al. 2004, ApJ, 606, L155
Boss, A. P. 2002, ApJ, 567, L149
Boss, A. P. 2006, ApJ, 643, 501
Boutreux, T. & Gould, A. 1996, ApJ, 462, 705
Brewer, B. J., & Lewis, G. F. 2005, MNRAS, 356, 703
Burrows, A., Hubbard, W. B., Saumon, D., & Lunine, J. I. 1993, ApJ, 406, 158
Burrows, A., et al. 1997, ApJ, 491, 856
Butler, R. P., et al. 2006, PASP, 118, 1685
Carpenter, J. 2001, AJ, 121, 2851
285
Calchi Novati, S., de Luca, F., Jetzer, P., & Scarpetta, G. 2006, A&A, 459, 407‘
Chang, K., & Refsdal, S. 1979, Nature, 282, 561
Chang, K. & Refsdal, S. 1984, A&A, 130, 157
Chung, S.-J., et al. 2005, ApJ, 630, 535
Claret, A. 2000, A&A, 363, 1081
Corrigan, R. T., et al. 1991, AJ, 102, 34
Cox, A. N. 2000, Allen’s Astrophysical Quantities (4th Ed.; New York; Springer)
Cumming, A., et al. 2008, PASP, 120, 531
Delfosse, X., et al. 1998, A&A, 338, L67
Demarque, P., Woo, J.-H., Kim, Y.-C., & Yi, S. K. 2004, ApJS, 155, 667
Dominik, M. 1999a, A&A, 341, 943
Dominik, M. 1999b, A&A, 349, 108
Dominik, M. 2006, MNRAS, 367, 669
Dong, S., et al. 2006, ApJ, 642, 842
Dong, S., et al. 2009, ApJ, 695, 970
Dong, S., et al. 2009, ApJ, 698, 1826
Doran, M., & Mueller, C. M. 2004, Journal of Cosmology and Particle Astrophysics,9, 3
Drake, A. J., Cook, K. H., & Keller, S. C. 2004, ApJ, 607, L29
Duquennoy, A., & Mayor, M. 1991, A&A, 248, 485
Einstein, A. 1936, Science, 84, 506
Eisenhauer, F., et al. 2005, ApJ, submitted (astro-ph/0502129)
Endl, M., et al. 2006, ApJ, 649, 436
European Space Agency, 1997, The Hipparcos and Tycho Catalogues (SP-1200;Noordwijk: ESA)
Fischer, D. A., & Valenti, J. 2005, ApJ, 622, 1102
286
Frank, S. et al. 2007, Astrophysics and Space Science, 312, 3
Gaudi, B. S., & Gould, A. 1997, ApJ, 486, 85
Gaudi, B. S., & Sackett, P. D. 2000, ApJ, 528, 56
Gaudi, B. S., et al. 2002, ApJ, 566, 463
Gaudi, B. S. 2008, Astronomical Society of the Pacific Conference Series, 398, 479,ed. D. Fischer, F. Rasio, S. Thorsett, & A. Wolszczan, Extreme Solar Systems(arXiv:0711.1614)
Gaudi, B. S., Naber, R. M., & Sackett, P. D. 1998, ApJ, 502, L33
Gaudi, B. S., et al. 2008, Science, 319, 927
Ghosh, H., et al. 2004, ApJ, 615, 450
Gould, A. 1992, ApJ, 392, 442
Gould, A. 1994a, ApJ, 421, L71
Gould, A. 1994b, ApJ, 421, L75
Gould, A. 1995, ApJ, 441, L21
Gould, A. 1999, ApJ, 514, 869
Gould, A. 2000a, ApJ, 535, 928
Gould, A. 2000b, ApJ, 542, 785
Gould, A. 2004, ApJ, 606, 319
Gould, A. 2008, ApJ, 681, 1593
Gould, A. 2009, Astronomical Society of the Pacific Conference Series, 403, 86,ed. K.Z. Stanek, The Variable Universe: A Celebration of Bohdan Paczynski(arXiv:0803.4324)
Gould, A. & An, J.H. 2002, ApJ, 565, 1381
Gould, A. & Gaucherel, C. 1997, ApJ, 477, 580
Gould, A., & Loeb, A. 1992, ApJ, 396, 104
Gould, A. & Salim, S. 1999 ApJ, 524, 794
Gould, A., Miralda-Escude, J., & Bahcall, J. N. 1994, ApJ, 423, L105
287
Gould, A., Stutz, A., & Frogel, J. A. 2001, ApJ, 547, 590
Gould, A., Bennett, D. P., & Alves, D. R., 2004, ApJ, 614, 404
Gould, A. et al. 2006, ApJ, 644, L37
Gould, A., Gaudi, B. S., & Bennett, D. P. 2007, NASA/NSF Exoplanet Task ForceWhite Paper (arXiv:0704.0767)
Gould, A., et al. 2009, ApJ, 698, L147
Griest, K. & Safizadeh, N. 1998, ApJ, 500, 37
Han, C. 2007, ApJ, 661, 1202
Han, C., & Gould, A. 1995, ApJ, 447, 53
Han, C. & Gould, A. 1996, ApJ, 467, 540
Han, C. & Gould, A. 2003, ApJ, 592, 172
Han, C., & Gaudi, B. S. 2008, ApJ, 689, 53
Hao, H., et al. 2007, ApJ, 659, 99
Harris, J. & Zaritsky, D. 2006, AJ, 131, 2514
Hauschildt, P. H., Allard, F., & Baron, E. 1999, ApJ, 512, 377
Haywood, M. 2008, A&A, 482, 673
Huchra, J. et al. 1985, AJ, 90, 691
Ida, S., & Lin, D. N. C. 2004, ApJ, 616, 567
Ida, S., & Lin, D. N. C. 2005, ApJ, 626, 1045
Jiang, G. et al. 2005, ApJ, 617, 1307
Johnson, J. A., et al. 2007a, ApJ, 665, 785
Johnson, J. A., et al. 2007b, ApJ, 670, 833
Kallivayalil, N., van der Marel, R. P., & Alcock, C. 2006, ApJ, 652, 1213
Kayser, R. et al. 1986, A&A, 166, 36
Kennedy, G. M., Kenyon, S. J., & Bromley, B. C. 2006, ApJ, 650, L139
Kennedy, G. M., & Kenyon, S. J. 2008, ApJ, 673, 502
288
Kervella, P., et al. 2004, A&A, 426, 297
King, I. 1962, AJ, 67, 471
Kiraga, M., & Paczynski, B. 1994, ApJ, 430, L101
Kochanek, C. S. 1996, ApJ, 473, 610
Kochanek, C. S. 2004, ApJ, 605, 58
Kornet, K., Wolf, S., & Rozyczka, M. 2006, A&A, 458, 661
Kozlowski, S., Wozniak, P. R., Mao, S., & Wood, A. 2007, ApJ, 671, 420
Kubas, D., et al. 2008, A&A, 483, 317
Lanzetta, K. M. et al. 1987, ApJ, 322, 739
Laughlin, G., Bodenheimer, P., & Adams, F. C. 2004, ApJ, 612, L73
Lecar, M., Podolak, M., Sasselov, D., & Chiang, E. 2006, ApJ, 640, 1115
Lejeune, T., Cuisinier, F., & Buser, R. 1997, A&AS, 125, 229
Lejeune, T., Cuisinier, F., & Buser, R. 1998, A&AS, 130, 65
Lewis, G. F., & Ibata, R. A. 1998, ApJ, 501, 478
Liebes, S., Jr. 1964, Phys.Rev., 133, 835
Lin, D. N. C., Bodenheimer, P., & Richardson, D. C. 1996, Nature, 380, 606
Lissauer, J. J. 1987, Icarus, 69, 249
Manfroid, J. 1995, A&AS, 113, 587
Mao, S., & Paczynski, B. 1991, ApJ, 374, L37
Marcy, G. W., et al 1998, ApJ, 505, L147
Marcy, G. W., et al 2001, ApJ, 556, 296
Mao, S. & Paczynski, B. 1991, ApJ, 374, L37
Mortonson, M. J. et al. 2005, ApJ, 628, 594
Nishiyama, S., et al. 2005, ApJ, 621, L105
Paczynski, B. 1986, ApJ, 304, 1
289
Park, B.-G. et al. 2004, ApJ, 609, 166
Park, B.-G., Jeon, Y.-B., Lee, C.-U., & Han, C. 2006, ApJ, 643, 1233
Pejcha, O., & Heyrovsky, D. 2009, ApJ, 690, 1772
Pollack, J. B., Hubickyj, O., Bodenheimer, P., Lissauer, J. J., Podolak, M., &Greenzweig, Y. 1996, Icarus, 124, 62
Poindexter, S. et al. 2005, ApJ, 633, 914
Prochter, G. E., et al. 2006, ApJ, 648, L93
Rasio, F. A., & Ford, E. B. 1996, Science, 274, 954
Rattenbury, N. J., Bond, I. A., Skuljan, J., & Yock, P. C. M. 2002, MNRAS, 335,159
Rauch, M. et al. 2002, ApJ, 576, 45
Reach, W. T., et al. 2005, PASP, 117, 978
Read, J.I., Wilkinson, M.I., Evans, N.W., Gilmore, G., Kleyna, J.T. 2006, MNRAS,366, 429
Refsdal, S. 1966, MNRAS, 134, 315
Reid, N. 1991, AJ, 102, 1428
Rhie, S. H., et al. 2000, ApJ, 533, 378
Rivera, E. J., et al. 2005, ApJ, 634, 625
Sackett, P. D. 1997, ESO Document: SPG-VLTI-97/002 (astro-ph/9709269)
Sahu, K. C., et al. 2001, Nature, 411, 1022
Sahu, K. C., Anderson, J., & King, I. R. 2002, ApJ, 565, L21
Santos, N. C., Israelian, G., & Mayor, M. 2004, A&A, 415, 1153
Schechter, P.L., Mateo, M. & Saha, A. 1993, PASP, 105 1342
Smith, M. C., Mao, S., & Paczynski, B. 2003, MNRAS, 339, 925
Stanek, K. Z., et al. 1997, ApJ, 477, 163
Stanimirovic, S. Staveley-Smith, L., & Jones, P.A. ApJ, 604, 176
Sumi, T. 2004, MNRAS, 349, 193
290
Tisserand, P. et al. 2007, A&A, 469, 387
Treyer, M., & Wambsganss, J. 2004, A&A, 416, 19
Udalski, A. 2003, Acta Astron., 53, 291
Udalski, A., et al. 1993, Acta Astronomica, 43, 289
Udalski, A., et al. 2005, ApJ, 628, L109
Walker, M. A. 1995, ApJ, 453, 37
Wambsganss, J. 1990, Ph.D. Thesis
Wambsganss, J. 2006, Saas-Fee Advanced Course 33: Gravitational Lensing: Strong,Weak and Micro, 453
Ward, W. R. 1997, Icarus, 126, 261
Witt, H. J. 1990, A&A, 236, 311
Wolszczan, A., & Frail, D. A. 1992, Nature, 355, 145
Wozniak, P. R. 2000, Acta Astron., 50, 421
Wozniak, et al. 2000, ApJ, 529, 88
Yoo, J., et al. 2004a, ApJ, 603, 139
Yoo, J., et al. 2004b, ApJ, 616, 1204
Yoo, J., Chaname, J., & Gould, A. 2004, ApJ, 601, 311
Zaritsky, D., Harris, J., Grebel, E.K., & Thompson, I.B. 2000, ApJ, 534, L53
Zibetti, S., et al. 2005, ApJ, 631, L105
291