Modern Machine Learning: New Challenges and...

Modern Machine Learning: New Challenges and Connections

Maria-Florina Balcan

Machine Learning is Changing the World

“A breakthrough in machine learning would be worth ten Microsofts” (Bill Gates, Microsoft)

“Machine learning is the hot new thing” (John Hennessy, President, Stanford)

“Web rankings today are mostly a matter of machine learning” (Prabhakar Raghavan, VP Engineering at Google)

The World is Changing Machine Learning

New applications Explosion of data

http://images.google.com/imgres?imgurl=http://www.dynamicdrive.com/dynamicindex4/lightbox2/flower.jpg&imgrefurl=http://www.dynamicdrive.com/dynamicindex4/lightbox2/index.htm&usg=__6G24ltZqq0UGAerNdiENN4jOqqQ=&h=509&w=500&sz=46&hl=en&start=4&sig2=P_pNfXebhTzd8n0DrpgNhw&zoom=1&itbs=1&tbnid=--IQZoA0t4jdOM:&tbnh=131&tbnw=129&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://images.nymag.com/news/features/newface080811_lede_560.jpg&imgrefurl=http://nymag.com/news/features/48948/&usg=__cimnczuBny94AVQhDb04KyNUIm4=&h=482&w=560&sz=50&hl=en&start=11&sig2=4wGA_vH60nBqlHqvekl3lA&zoom=1&itbs=1&tbnid=gg1XN95D7aTn3M:&tbnh=114&tbnw=133&prev=/images?q=face&hl=en&gbv=2&tbs=isch:1&ei=dMtyTbuKL8OqlAecwcFN

http://images.google.com/imgres?imgurl=http://www.teachenglishinasia.net/files/u2/purple_lotus_flower.jpg&imgrefurl=http://www.teachenglishinasia.net/asiablog/asian-water-lilies-and-lotus-flowers&usg=__jQMsElfDOQGWm-hebjVtJDqL-40=&h=335&w=500&sz=28&hl=en&start=3&sig2=hmb0FR5kLCXdU2BdwjCNcA&zoom=1&itbs=1&tbnid=r_JpUcIxETAUoM:&tbnh=87&tbnw=130&prev=/images?q=flower&hl=en&gbv=2&tbs=isch:1&ei=Is1yTcKaOcGAlAe0wqGmAQ

http://images.google.com/imgres?imgurl=http://www.classicsavers.com/casablanca.jpg&imgrefurl=http://www.classicsavers.com/Casablanca.html&h=600&w=800&sz=72&tbnid=wSXOd5UUibIJ:&tbnh=106&tbnw=141&start=3&prev=/images?q=casablanca&hl=en&lr=&ie=UTF-8

http://images.google.com/imgres?imgurl=http://www.nrl.navy.mil/content_images/clem.jpg&imgrefurl=http://www.nrl.navy.mil/media/popular-images/&usg=__Cj15wZbDIqpRCeVmO7SsP5XIOzM=&h=1213&w=1720&sz=1347&hl=en&start=3&sig2=TRnaOP-O9wUtTBwS0_vDZA&zoom=1&itbs=1&tbnid=EprKgSAflWdsYM:&tbnh=106&tbnw=150&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF


Modern applications: massive amounts of raw data.

Only a tiny fraction can be annotated by human experts.

Billions of webpages Images Protein sequences

Modern ML: New Learning Approaches


http://images.google.com/imgres?imgurl=http://www.soccerballworld.com/images/Ball-UEFA-FINALE.jpg&imgrefurl=http://www.soccerballworld.com/UEFA-Champions-Finale.htm&usg=__IkP2FDq2F5yEyjqW0wDIthrifzc=&h=300&w=300&sz=15&hl=en&start=5&sig2=Sw_H6_M6TzTbVg5PqjP6lw&zoom=1&itbs=1&tbnid=E1Xl3YAzGDlvgM:&tbnh=116&tbnw=116&prev=/images?q=ball&hl=en&gbv=2&tbs=isch:1&ei=-8xyTbz0OISclge849RL

http://images.google.com/imgres?imgurl=http://ceoworld.biz/ceo/wp-content/uploads/2009/07/microsoft_logo.jpg&imgrefurl=http://ceoworld.biz/ceo/2009/07/13/microsoft-to-sell-razorfish-wpp-omnicom-publicis-groupe-interpublic-group-and-dentsu-possible-bidder&usg=__D-yYVRjhhUHbi2sYkh2hyP_5EK4=&h=299&w=300&sz=5&hl=en&start=2&sig2=4orOiNNbbV2i6Ii7SFMdmg&zoom=1&itbs=1&tbnid=Sk8jPaSR61onuM:&tbnh=116&tbnw=116&prev=/images?q=microsoft&hl=en&gbv=2&tbs=isch:1&ei=1MtyTbH9IYOglAeJrq2bAQ

http://images.google.com/imgres?imgurl=http://www.treehugger.com/ec-rnd-005.jpg&imgrefurl=http://www.treehugger.com/files/2008/07/17-electric-cars-overview-2005-to-2008.php&usg=__4G2QjNYXIHEYVXE6DoqIE5zzzrY=&h=322&w=468&sz=40&hl=en&start=9&sig2=P7jmw1vmKb715x8CELHezQ&zoom=1&itbs=1&tbnid=iGci7mkzT2KN2M:&tbnh=88&tbnw=128&prev=/images?q=car&hl=en&gbv=2&tbs=isch:1&ei=qMtyTcilGIOClAey9IGqCw




http://images.google.com/imgres?imgurl=http://www.flash-slideshow-maker.com/images/help_clip_image020.jpg&imgrefurl=http://www.flash-slideshow-maker.com/flash-images/&usg=__jMIGXrfx-_pXlchniUcjT2UYT9k=&h=422&w=554&sz=35&hl=en&start=1&sig2=YCfTn6kxhcufspbxJdXhrQ&zoom=1&itbs=1&tbnid=_ruOuQ2EjL3wOM:&tbnh=101&tbnw=133&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF



http://images.google.com/imgres?imgurl=http://hirise.lpl.arizona.edu/HiBlog/wp-content/uploads/2008/03/mex_phobos.jpg&imgrefurl=http://hirise.lpl.arizona.edu/HiBlog/category/hirise/news-events/special-images/&usg=__DLojkIcAQfQzGQ4oBrU4aT3OcHI=&h=400&w=400&sz=29&hl=en&start=15&sig2=cqA6Gj5KQHXSZIuOwlp0iQ&zoom=1&itbs=1&tbnid=QIVdyHeoWfEyFM:&tbnh=124&tbnw=124&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://www.mabima.net/company/images/Tucker_House_2003_640.JPG&imgrefurl=http://www.mabima.net/company/&usg=__yK2ELUzEORpde1XjSFJTrRDtFxc=&h=426&w=640&sz=69&hl=en&start=4&sig2=Q0e47SnpfYjO5jf5DQQk-Q&zoom=1&itbs=1&tbnid=GD54wp6WHlrhMM:&tbnh=91&tbnw=137&prev=/images?q=house&hl=en&gbv=2&tbs=isch:1&ei=KMtyTZnAE8aqlAegju1U

http://images.google.com/imgres?imgurl=http://www.popsci.com/files/imagecache/article_image_large/articles/ie2.jpg&imgrefurl=http://www.popsci.com/gadgets/article/2010-08/microsoft-fight-between-revenue-and-privacy-money-1-privacy-zero&usg=__wfaTvyWsRtJGRnAP5yJyb-kp3u0=&h=333&w=500&sz=22&hl=en&start=1&sig2=PTP24GqOtv3iVaxKrsl8dg&zoom=1&itbs=1&tbnid=_nE8aJVhvxfpbM:&tbnh=87&tbnw=130&prev=/images?q=internet&hl=en&gbv=2&tbs=isch:1&ei=FsxyTaDEPMWAlAePlcFS

Modern applications: massive amounts of raw data.


Expert

• Semi-supervised Learning, (Inter)active Learning.

Techniques that best utilize data, minimizing need for

expert/human intervention.

Modern applications: massive amounts of data

distributed across multiple locations.


http://cdn.wikimg.net/strategywiki/images/f/fa/Globe.svg


• scientific data

Key new resource communication.

• video data

E.g.,

Modern applications: massive amounts of data

distributed across multiple locations.


Resource constraints. E.g.,

• Computational efficiency (noise tolerant algos)

• Communication

• Human labeling effort

• Statistical efficiency

• Privacy/Incentives

New approaches. E.g.,

• Semi-supervised learning

• Distributed learning

• Interactive learning

• Multi-task/transfer learning

• Never ending learning

• Community detection

A2 Â


My research: foundations, robust algorithms.

Applications: vision, comp. biology, social networks.








Machine Learning is Changing the World

My research: MLT way to model and tackle challenges other fields. E.g.,

Game Theory Approximation Algorithms

Matroid Theory

Machine Learning Theory

Discrete Optimization

Mechanism Design

Control Theory

• Machine Learning Lenses in Other Areas

Outline of the talk

• Modern Learning Paradigms

• Interactive Learning

• Submodularity, implications to Matroid Theory, Algorithmic Game Theory, Optimization

;

V

• Discussion, Other Exciting Directions

Supervised Learning • E.g., which emails are spam and which are important.

Not spam spam

• E.g., classify objects as chairs vs non chairs.

Not chair chair

Labeled Examples

Learning Algorithm

Expert / Oracle

Data Source

c* : X ! {0,1}

h : X ! {0,1}

(x1,c*(x1)),…, (xm,c*(xm))

• Algo sees (x1,c*(x1)),…, (xm,c*(xm)), xi i.i.d. from D

Distribution D on X

Statistical / PAC learning model

+

+

- - + + - -

-

• Does optimization over S, finds hypothesis h 2 C.

• Goal: h has small error, err(h)=Prx 2 D(h(x) c*(x))

• c* in C, realizable case; else agnostic

14

Two Main Aspects in Classic Machine Learning

Algorithm Design. How to optimize?

Automatically generate rules that do well on observed data.

Generalization Guarantees, Sample Complexity

Confidence for rule effectiveness on future data.

E.g., Boosting, SVM, etc.

O1

ϵVCdim C log

1

ϵ+ log

1

δ

Interactive Machine Learning

• Active Learning

• Learning with more general queries; connections

Active Learning

face

O

O

O

Expert Labeler

raw data

Classifier

not face


http://images.google.com/imgres?imgurl=http://www.socksoff.co.uk/00001/page05/Lone_Tree_1600.jpg&imgrefurl=http://www.hongkiat.com/blog/100-absolutely-beautiful-nature-wallpapers-for-your-desktop/&usg=__dsyO8OzB7rmE16btYKAa2WrNJtQ=&h=1200&w=1600&sz=693&hl=en&start=9&sig2=N22ozJ4mtT9DuMR9P-wNOg&zoom=1&itbs=1&tbnid=UU66VMJn9kPePM:&tbnh=113&tbnw=150&prev=/images?q=tree&hl=en&gbv=2&tbs=isch:1&ei=YsxyTearFsSclge1p_hf

http://images.google.com/imgres?imgurl=http://www.soccerballworld.com/images/Ball-UEFA-FINALE.jpg&imgrefurl=http://www.soccerballworld.com/UEFA-Champions-Finale.htm&usg=__IkP2FDq2F5yEyjqW0wDIthrifzc=&h=300&w=300&sz=15&hl=en&start=5&sig2=Sw_H6_M6TzTbVg5PqjP6lw&zoom=1&itbs=1&tbnid=E1Xl3YAzGDlvgM:&tbnh=116&tbnw=116&prev=/images?q=ball&hl=en&gbv=2&tbs=isch:1&ei=-8xyTbz0OISclge849RL

http://images.google.com/imgres?imgurl=http://www.popsci.com/files/imagecache/article_image_large/articles/ie2.jpg&imgrefurl=http://www.popsci.com/gadgets/article/2010-08/microsoft-fight-between-revenue-and-privacy-money-1-privacy-zero&usg=__wfaTvyWsRtJGRnAP5yJyb-kp3u0=&h=333&w=500&sz=22&hl=en&start=1&sig2=PTP24GqOtv3iVaxKrsl8dg&zoom=1&itbs=1&tbnid=_nE8aJVhvxfpbM:&tbnh=87&tbnw=130&prev=/images?q=internet&hl=en&gbv=2&tbs=isch:1&ei=FsxyTaDEPMWAlAePlcFS


http://images.google.com/imgres?imgurl=http://www.flash-slideshow-maker.com/images/help_clip_image020.jpg&imgrefurl=http://www.flash-slideshow-maker.com/flash-images/&usg=__jMIGXrfx-_pXlchniUcjT2UYT9k=&h=422&w=554&sz=35&hl=en&start=1&sig2=YCfTn6kxhcufspbxJdXhrQ&zoom=1&itbs=1&tbnid=_ruOuQ2EjL3wOM:&tbnh=101&tbnw=133&prev=/images?q=images&hl=en&sa=X&gbv=2&tbas=0&tbs=isch:1&ei=n8hyTej5N8X_lgffrdVF

http://images.google.com/imgres?imgurl=http://www.wallpaperbase.com/wallpapers/animals/fish/fish_4.jpg&imgrefurl=http://www.wallpaperbase.com/animals-fish.shtml&usg=__2GMQzBfy-zcwD5e3ceBpUaGJGsE=&h=768&w=1024&sz=115&hl=en&start=6&sig2=86Ql8saH12Fk6g3FoyAagw&zoom=1&itbs=1&tbnid=vPHzH3MqjP6xOM:&tbnh=113&tbnw=150&prev=/images?q=fish&hl=en&gbv=2&tbs=isch:1&ei=TstyTb3AB8Oclget89ChAw






Active Learning in Practice

• Text classification: active SVM (Tong & Koller, ICML2000).

• e.g., request label of the example closest to current separator.

• Video Segmentation (Fathi-Balcan-Ren-Regh, BMVC 11).

w

+ -

Exponential improvement.

• Sample with 1/ unlabeled examples; do binary search. -

Active: only O(log 1/) labels.

Passive supervised: (1/) labels to find an -accurate threshold.

+ -

Active Algorithm

• Canonical theoretical example [CAL92, Dasgupta04]

Provable Guarantees, Active Learning

• First noise tolerant algo [Balcan, Beygelzimer, Langford, ICML’06]

[Hanneke07, DasguptaHsuMontleoni’07, Wang’09 , Fridman’09, Koltchinskii10, BHW’08, BeygelzimerHsuLangfordZhang’10, Hsu’10, Minsker’12, Ailon’12, …]

Generic (any class), adversarial label noise.

Substantial follow-up on work.

Lots of exciting activity in recent years.

My work:

Provable Guarantees, Active Learning

• First computationally efficient algos

[Balcan-Long COLT’13] [Awasthi-Balcan-Long STOC’14]

Margin Based Active Learning

• Realizable: exponential improvement in label complex. over passive [only O(d log 1/) labels to find w error ].

[Balcan-Long COLT’13] [Awasthi-Balcan-Long STOC’14]

Learning linear separators, when D logconcave in Rd.

• Agnostic: poly-time AL algo outputs w with err(w) =O(´), ´ =err(best lin. sep).

+

+

- - + + - -

-

• Unexpected Implications for Passive Learning!

• Computational complexity: improves significantly on the best guarantees known for passive [KKMS’05], [KLS’09].

• Sample complexity: resolves open question about ERM.

Margin Based Active-Learning, Realizable Case

Draw m1 unlabeled examples, label them, add them to W(1). iterate k = 2, …, s

• find a hypothesis wk-1 consistent with W(k-1).

• W(k)=W(k-1).

• sample mk unlabeled samples x

satisfying |wk-1T ¢ x| · k-1

• label them and add them to W(k).

w1

1

w2

2

w3

Log-concave distributions: log of density fnc concave.

• wide class: uniform distr. over any convex set, Gaussian, etc.

Theorem If then err(ws)· D log-concave in Rd.

after

Active learning Passive learning

rounds using

label requests label requests

unlabeled examples

labels per round.

Margin Based Active-Learning, Realizable Case

Analysis: Aggressive Localization

Induction: all w consistent with W(k), err(w) · 1/2k

w

w*



wk-1

w

k-1

w*

Suboptimal

wk-1

w

k-1

w*



wk-1

w

k-1

w*

· 1/2k+1



wk-1

w

k-1

w*

· 1/2k+1

Enough to ensure

Need only labels in round k.

Localization in concept space.

Margin Based Active-Learning, Agnostic Case

Draw m1 unlabeled examples, label them, add them to W.

Localization in instance space.

iterate k=2, …, s

• find wk-1 in B(wk-1, rk-1) of small

tk-1 hinge loss wrt W. • Clear working set.

• sample mk unlabeled samples x

satisfying |wk-1 ¢ x| · k-1 ;

• label them and add them to W.

end iterate

• Analysis: exploit localization & variance analysis control the gap between hinge loss and 0/1 loss in each round.

Improves over Passive Learning too!

Passive Learning

Prior Work

Our Work

Malicious

Agnostic

Active Learning [agnostic/malicious]

[KLS’09]

NA

[KLS’09]

err(w) = 𝑂 𝜂 log21

𝜂 err(w) = 𝑂 𝜂1/3 log2/3

𝑑

𝜂

err(w) = 𝑂 𝜂1/3 log1/31

𝜂 err(w) = 𝑂 𝜂 log2

1

𝜂

err(w) = 𝑂 𝜂 log21

𝜂

Improves over Passive Learning too!

Passive Learning

Prior Work

Our Work

Malicious

Agnostic

Active Learning [agnostic/malicious]

Info theoretic optimal

[KKMS’05]

[KLS’09]

[KKMS’05]

NA Info theoretic optimal

Slightly better results for the uniform distribution case.

err(w) = 𝑂(𝜂)

err(w) = 𝑂(𝜂)

err(w) = 𝑂(𝜂)

err(w) = 𝑂(𝜂𝑑1/4)

err(w) = 𝑂 𝜂 log (1/𝜂)

err(w) = 𝑂 𝜂 log (𝑑/𝜂)

Useful for active and passive learning!

Localization both algorithmic and analysis tool!

Important direction: richer interactions with the expert.

Expert

Fewer queries

Natural interaction

Better Accuracy

raw data

Expert Labeler

Classifier

New Types of Interaction

Class Conditional Query

Mistake Query raw data

Expert Labeler

) Classifier

dog cat penguin

wolf


http://images.google.com/imgres?imgurl=http://static.howstuffworks.com/gif/house-selling-1.jpg&imgrefurl=http://home.howstuffworks.com/real-estate/house-selling.htm&usg=__qnE74MTnGIEKpohCSP6iTM9E8EA=&h=327&w=400&sz=24&hl=en&start=3&sig2=G6Bq69GdXS9x6hMpDk3HfA&zoom=1&itbs=1&tbnid=3fiW8K30_ygY1M:&tbnh=101&tbnw=124&prev=/images?q=house&hl=en&gbv=2&tbs=isch:1&ei=RF9sTcWkOsL88AbtremUBQ

http://images.google.com/imgres?imgurl=http://beautiful-island.50webs.com/beautiful-island/sunny-beach-palm.jpg&imgrefurl=http://beautiful-island.50webs.com/&usg=__C0upeEPAbXCnDz2nt6iLsdArUW4=&h=467&w=600&sz=31&hl=en&start=2&sig2=IuliF_QG6NjIn68oHyMX-g&zoom=1&itbs=1&tbnid=hLcdvK7I-FvOmM:&tbnh=105&tbnw=135&prev=/images?q=beach&hl=en&gbv=2&tbs=isch:1&ei=rGltTe-0KMH88AbLg6GODQ

http://images.google.com/imgres?imgurl=http://www.harlemfur.com/images/Dog_Olive.jpg&imgrefurl=http://www.harlemfur.com/dogs/&usg=__JozjGZc1FsasEHphia0uB0XqbX8=&h=530&w=600&sz=137&hl=en&start=2&sig2=zaKS235mWgElfTu3cHnq0w&zoom=1&itbs=1&tbnid=ENH8mUtG9B1SEM:&tbnh=119&tbnw=135&prev=/images?q=dog&hl=en&gbv=2&tbs=isch:1&ei=wmltTbvHNcP78Ab4rqyODQ

http://images.google.com/imgres?imgurl=http://www.imageof.net/bulkupload/wallpapers1/Animals/Grey_Wolf.jpg&imgrefurl=http://caligagan.com/images/index.php?q=grey+wolf+life+cycle&usg=__uURVHNJhQN9ahCWQPUpcKPJNGqk=&h=1024&w=1280&sz=378&hl=en&start=6&zoom=1&itbs=1&tbnid=tVdevTk5HwbM1M:&tbnh=120&tbnw=150&prev=/images?q=wolf&hl=en&gbv=2&tbs=isch:1&ei=DIB2Tc-2BNG90QHQ66W1Bw

http://images.google.com/imgres?imgurl=http://www.catfacts.org/cat-facts.jpg&imgrefurl=http://www.catfacts.org/&usg=__29ES21S16jR7j4jQvqkSF3S1w3Q=&h=320&w=450&sz=24&hl=en&start=3&zoom=1&itbs=1&tbnid=fXHhKgslCgucdM:&tbnh=90&tbnw=127&prev=/images?q=cat&hl=en&gbv=2&tbs=isch:1&ei=sYF2TZv7J42E0QHfvomzBw

http://images.google.com/imgres?imgurl=http://www.imageof.net/bulkupload/wallpapers1/Animals/Grey_Wolf.jpg&imgrefurl=http://caligagan.com/images/index.php?q=grey+wolf+life+cycle&usg=__uURVHNJhQN9ahCWQPUpcKPJNGqk=&h=1024&w=1280&sz=378&hl=en&start=6&zoom=1&itbs=1&tbnid=tVdevTk5HwbM1M:&tbnh=120&tbnw=150&prev=/images?q=wolf&hl=en&gbv=2&tbs=isch:1&ei=DIB2Tc-2BNG90QHQ66W1Bw

Class Conditional & Mistake Queries

• Used in practice, e.g. Faces in IPhoto.

• Lack of theoretical understanding.

• Realizable (Folklore): much fewer queries than label requests.

[Balcan-Hanneke COLT’12]

Tight bounds on the number of CCQs to learn in the presence of noise (agnostic and bounded noise)

Our Work


http://images.google.com/imgres?imgurl=http://static.howstuffworks.com/gif/house-selling-1.jpg&imgrefurl=http://home.howstuffworks.com/real-estate/house-selling.htm&usg=__qnE74MTnGIEKpohCSP6iTM9E8EA=&h=327&w=400&sz=24&hl=en&start=3&sig2=G6Bq69GdXS9x6hMpDk3HfA&zoom=1&itbs=1&tbnid=3fiW8K30_ygY1M:&tbnh=101&tbnw=124&prev=/images?q=house&hl=en&gbv=2&tbs=isch:1&ei=RF9sTcWkOsL88AbtremUBQ

http://images.google.com/imgres?imgurl=http://beautiful-island.50webs.com/beautiful-island/sunny-beach-palm.jpg&imgrefurl=http://beautiful-island.50webs.com/&usg=__C0upeEPAbXCnDz2nt6iLsdArUW4=&h=467&w=600&sz=31&hl=en&start=2&sig2=IuliF_QG6NjIn68oHyMX-g&zoom=1&itbs=1&tbnid=hLcdvK7I-FvOmM:&tbnh=105&tbnw=135&prev=/images?q=beach&hl=en&gbv=2&tbs=isch:1&ei=rGltTe-0KMH88AbLg6GODQ

Important direction: richer interactions with the expert.

Expert

Fewer queries

Natural interaction

Better Accuracy

3.5

2.4

9.1

Interaction for Faster Algorithms

E.g., clustering protein sequences by function

Expert an expensive computer program.

Voevodski-Balcan-Roglin-Teng-Xia, [UAI 10, SIMBAD 11, JMLR’12]

• Strong guarantees under natural stability condition:

produce accurate clusterings with only k one-vs-all queries.

• BLAST to computes similarity to all other objects in

dataset (one-vs-all queries).

• Better accuracy than state of the art techniques using limited info on standard datasets (pFam and SCOP datasets).

http://images.google.com/imgres?imgurl=http://www.ebgm.jussieu.fr/~debrevern/PBs/images/protein_04.jpg&imgrefurl=http://www.ebgm.jussieu.fr/~debrevern/PBs/coding.html&h=496&w=709&sz=50&hl=en&start=8&tbnid=RCESdcwRtVouHM:&tbnh=98&tbnw=140&prev=/images?q=protein&gbv=2&hl=en




Voevodski-Balcan-Roglin-Teng-Xia [UAI 10, SIMBAD 11, JMLR’12]

Better accuracy than state of the art techniques using limited info on standard datasets (pFam and SCOP datasets).

Interaction for Faster Algorithms

Interactive Learning

• First noise tolerant poly time, label efficient algos for high dim. cases. [BL’13] [ABL’14]

• Learning with more general queries.

• Active & Differentially Private [Balcan-Feldman, NIPS’13]

[BH’12]

• Communication complexity, distributed learning.

Cool Implications:

• Sample & computational complexity of passive learning

[Balcan-Blum-Fine-Mansour, COLT’12] Runner UP Best Paper

Summary:

Related Work:

Learning Theory Lenses on Submodularity

Submodular functions • Discrete fns that model laws of diminishing returns.

• Optimization, operations research

• Algorithmic Game Theory

• Machine Learning

[Lehman-Lehman-Nisan’01], ….

[Bilmes’03] [Guestrin-Krause’07], …

• Social Networks [Kleinberg-Kempe-Tardos’03]

[Balcan&Harvey, STOC 2011 & Necktar Track, ECML-PKDD 2012]

Novel structural results

New Lenses on Submodularity

Learning Submodular Functions

Alg. Game Theory Economics

Matroid Theory


Applications: Economics, Social Networks, …

Submodular functions

x S T

x

+

+

Large improvement

Small improvement

For T µ S, xS, f(T [ {x}) – f(T) ¸ f(S [ {x}) – f(S)

• V={1,2, …, n}; set-function f : 2V ! R submodular if

E.g.,

+

Submodular functions

• Concave Functions Let h : R ! R be concave. For each S µ V, let f(S) = h(|S|)

• Vector Spaces Let V={v1,,vn}, each v

i 2 Rn.

For each S µ V, let f(S) = rank(V[S])

More examples:

• Coverage Fns Let A1,,An be sets. For each S µ V, let f(S) = | 𝐴𝑗𝑗∈𝑆 |

Learning submodular functions

• Influence Function in Social Networks • V = set of nodes.

• f(S) = expected influence when S is the

originating set of nodes in the difusion.

• Valuation Functions in Economics • V = all the items you sell.

• f(S) = valuation on set of items S.

PMAC model for learning real valued functions

Distribution D on 2[n]

Labeled Examples

Learning Algorithm

Expert / Oracle

Data Source

Alg.outputs f : 2[n] R+ g :2[n] R+

(S1,f(S1)),…, (Sm,f(Sm))

• Algo sees (S1,f(S1)),…, (Sm,f(Sm)), Si i.i.d. from D, produces g.

Probably Mostly Approximately Correct

• With probability ¸ 1-± we have PrS[g(S) · f(S)· ® g(S)]¸ 1-²

Learning submodular functions [BH’11]

No algo can PMAC learn the class of submodular fns with approx. factor õ(n1/3).

Theorem: (General lower bound)

Corollary: Matroid rank fns do not have a concise, approximate representation.

Surprising answer to open question in Economics of

Theorem: (General upper bound) Poly time alg. for PMAC-learning (w.r.t. an arbitrary

distribution) with an approx. factor ®=O(n1/2).

• Even if value queries allowed; even for rank fns of matroids.

Noam Nisan Paul Milgrom

A General Lower Bound

Construct a hard family of matroid rank functions.

A1 A2 AL A3

X

X X

Low=log2n

High=n1/3

X

… … …. ….

L=nlog log n

No algo can PMAC learn the class of non-neg., monotone, submodular fns with an approx. factor õ(n1/3).

Theorem

Vast generalization of partition matroids.

Exploit Additional Properties

• E.g., product distribution

• BH’11, Lipschitz submodular

• FV’13 general submodular

• Learning valuation fns from AGT and Economics.

• Learning influence function in social networks.

Learning Valuation Functions Target dependent learnability for classes from Algorithmic Game Theory and Economics

Additive µ OXS µ GS µ Submodular µ XOS µ Subadditive

XOR

OR

({1}, $2) ({2}, $5)

Romania Switzerland OR

({2}, $6) ({1}, $10) ({3}, $5)

[Balcan-Constantin-Iwata-Wang, COLT 2012]

Theorem: (Poly number of XOR trees) O(n²) approximation in time O(n1/²).

Learning the Influence Function Influence function in social networks: coverage function.

[Du, Liang, Balcan, Song 2014]

• Maximum likelihood approach; promising performance

• Memetracker Dataset, blog data cascades : “apple and jobs”, “tsunami earthquake”, “william kate marriage”

[Balcan&Harvey, STOC 2011 & Necktar Track, ECML-PKDD 2012]

Novel structural results

New Lenses on Submodularity

Learning Submodular Functions

Algo Game Theory Economics

Matroid Theory


Follow-up work: Algo Game Theory, Machine Learning, Discrete Optimization, Privacy.

Discussion. Other Exciting Directions

• Foundations for Modern ML and Applications

Themes of My Research

Game Theory Approx. Algorithms

Matroid Theory



Mechanism Design

Control Theory

• Learning Lens on Other Fields, Connections

Foundations for Modern Paradigms

Resource constraints. E.g.,

• Computational efficiency (noise tolerant algos)

• Communication

• Human labeling effort

• Statistical efficiency

• Privacy/Incentives








Analysis Beyond the Worst Case

• Identify ways to capture structure of real world instances, analysis beyond the worst case.

• E.g., circumvent NP-hardness barriers for objective based clustering (e.g., k-means, k-median)

• Perturbation resilience: small changes

to input distances shouldn’t move

optimal solution by much.

• Approximation stability: near-optimal

solutions shouldn’t look too different from

optimal solution.

0.7

[BBG’09] [BB’09] [BRT’10] [BL’12]






http://images.google.com/imgres?imgurl=http://www.treehugger.com/ec-rnd-005.jpg&imgrefurl=http://www.treehugger.com/files/2008/07/17-electric-cars-overview-2005-to-2008.php&usg=__4G2QjNYXIHEYVXE6DoqIE5zzzrY=&h=322&w=468&sz=40&hl=en&start=9&sig2=P7jmw1vmKb715x8CELHezQ&zoom=1&itbs=1&tbnid=iGci7mkzT2KN2M:&tbnh=88&tbnw=128&prev=/images?q=car&hl=en&gbv=2&tbs=isch:1&ei=qMtyTcilGIOClAey9IGqCw

Analysis Beyond the Worst Case

• Understand characteristics of practical tasks that makes them easier to learn.

• Develop learning tools that take advantage of such characteristics.

Major Direction:

• Identify ways to capture structure of real world instances, analysis beyond the worst case.

Understanding & Influencing Multi-agent Systems

What do we expect to happen?

More realistic: view agents as adapting, learning entities.

Major Question: Can we “guide” or “nudge” dynamics to high-quality states quickly [using minimal intervention]?

Traditional: analyze equilibria (static concept)

• Foundations for Modern ML and Applications

Themes of My Research

Game Theory Approx. Algorithms

Matroid Theory



Mechanism Design

Control Theory

• Learning Lens on Other Fields, Connections

Modern Machine Learning: New Challenges and...

Documents

Transcript of Modern Machine Learning: New Challenges and...