The Cross Language Information Access (CLIA) Projectfire/2012/slides/clia_mandar_fire2012.pdf ·...
Transcript of The Cross Language Information Access (CLIA) Projectfire/2012/slides/clia_mandar_fire2012.pdf ·...
The Cross Language Information Access (CLIA)Project
Mandar Mitra
Indian Statistical Institute
M.Mitra (ISI) CLIA project 1 / 9
Sandhan: a search engine
http://tdil-dc.in/sandhan
M.Mitra (ISI) CLIA project 2 / 9
Goal
Assigned task: Create a portal where1 a user will be able to give a query in one Indian language;
2 s/he will be able to access documents available in the language ofthe query, Hindi (if the query language is not Hindi), and English,
3 all presented to the user in the language of the query.
Languages:Bangla Hindi MarathiPunjabi Tamil TeluguAssamese Gujarati Odia
M.Mitra (ISI) CLIA project 3 / 9
Goal
Assigned task: Create a portal where1 a user will be able to give a query in one Indian language;
2 s/he will be able to access documents available in the language ofthe query, Hindi (if the query language is not Hindi), and English,
3 all presented to the user in the language of the query.
Languages:Bangla Hindi MarathiPunjabi Tamil TeluguAssamese Gujarati Odia
M.Mitra (ISI) CLIA project 3 / 9
Goal
Assigned task: Create a portal where1 a user will be able to give a query in one Indian language;
2 s/he will be able to access documents available in the language ofthe query, Hindi (if the query language is not Hindi), and English,
3 all presented to the user in the language of the query.
Languages:Bangla Hindi MarathiPunjabi Tamil Telugu
Assamese Gujarati Odia
M.Mitra (ISI) CLIA project 3 / 9
Goal
Assigned task: Create a portal where1 a user will be able to give a query in one Indian language;
2 s/he will be able to access documents available in the language ofthe query, Hindi (if the query language is not Hindi), and English,
3 all presented to the user in the language of the query.
Languages:Bangla Hindi MarathiPunjabi Tamil TeluguAssamese Gujarati Odia
M.Mitra (ISI) CLIA project 3 / 9
Sandhan: a search engine
http://tdil-dc.in/sandhan
M.Mitra (ISI) CLIA project 4 / 9
History
Sponsored by the TDIL group, Dept. of Information Technology(DIT), Govt. of India
Sanctioned in August 2006
Work started in early 2007
Phase I ended in 2010 (tourism)
Officially launched by DIT in September, 2012
Mirrored at IIT Bombay
Phase II started in 2011 (general purpose)
M.Mitra (ISI) CLIA project 5 / 9
Consortium members
Anna University – College of Engg., Guindy
Anna University – KBC centre
CDAC – Noida
CDAC – Pune
IIIT Hyderabad
IIT Bombay (coordinating instt.)
IIT Kharagpur
Indian Statistical Institute
Jadavpur University
Utkal University
+
Guwahati University
DAIICT Gandhinagar
IIIT Bhubaneswar
M.Mitra (ISI) CLIA project 6 / 9
Consortium members
Anna University – College of Engg., Guindy
Anna University – KBC centre
CDAC – Noida
CDAC – Pune
IIIT Hyderabad
IIT Bombay (coordinating instt.)
IIT Kharagpur
Indian Statistical Institute
Jadavpur University
Utkal University
+
Guwahati University
DAIICT Gandhinagar
IIIT Bhubaneswar
M.Mitra (ISI) CLIA project 6 / 9
Where does FIRE fit in?
Two major components: system development + evaluation
Evaluation component ⇒ FIRE(general domain from the outset)
Institutes: ISI + DAIICT
M.Mitra (ISI) CLIA project 8 / 9