Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct...
Transcript of Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct...
![Page 1: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/1.jpg)
Carlos Maltzahn, UC Santa CruzPresented at BCS Open Soure Specialist Group, June 11, 2020
Some lessons learned from creating and using the Ceph opern source storage system
![Page 2: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/2.jpg)
About me
Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz
Founder & Director, Center for Research in Open Source Software (cross.ucsc.edu)
Co-Founder & Director, UCSC Systems Research Laboratory (SRL)
1999-2004: Performance Engineer, Netapp
Advising 5 Ph.D. students
Graduated 9 Ph.D. students, 9 M.S. students
people.ucsc.edu/carlosm
Current Research:
● Programmable Storage Systems (programmability.us)
● Big Data Storage & Processing● Scalable Data Management● Distributed Systems Performance Management ● Practical Falsifiable Research (falsifiable.us)
Past Research:
● Team processes in repositories [IJICIS’92]● Network Intermediaries [SIGMETRICS’97]● Automatic Behavioral Modeling of HDDs [MSST’14]● Data Management Games [GamifIR’14]
2The picture can't be displayed.
The picture can't be displayed.
![Page 3: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/3.jpg)
Current Funding
3
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
NSF/OAC: Institute for Research and Innovation in Software for High Energy Physics (IRIS-HEP)(2018-2023)
NSF/CNS: Declarative Programmable Storage (with Peter Alvaro, 2018-2021)
NSF/CNS: Data Center Scale Programmable Storage (with Dirk Grunwald, CU Boulder, 2017-2020)
DOE/SSIO: SIRIUS: Science-driven Data Management for Multi-tiered Storage (with ORNL, Sandia, Rutgers U, 2016-2019)
NSF/OAC: Big Weather Web: A Common and Sustainable Big Data Infrastructure in Support of Weather Prediction Research and Education in Universities (with 7 other Universities, NCAR, and one startup, 2015-2019)
CROSS: Skyhook: Programmable Storage for Databases (2016-)
CROSS: Eusocial Storage Devices (2019-)
CROSS: Black Swan: The Popper Reproducibility Platform (2019-)
CROSS: Mapping Datasets to Object Storage (2019-)The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
![Page 4: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/4.jpg)
The picture can't be displayed.
Ph.D. students, research faculty, and staff
4
Graduated PhD in 2019
Dr. Jeff LeFevre
The picture can't be displayed.
Jianshen Liu Yiming Zhang
The picture can't be displayed.
Graduated PhDs in previous yearsResearch StaffAlexander Ames (2011, LLNL)
Joe Buck (2014, Okera)
Adam Crume (2015, Google)
Latchesar Ionkov (2018, LANL)
Michael Sevilla (2018, TidalScale)
Andrew Shewmaker (2016, OpenEye)
Dimitrios Skourtis (2014, IBM Almaden)
Noah Watkins (2018, vectorized.io)
3rd Year4th Year5th Year
The picture can't be displayed.
Ivo Jimenez(UCSC)
The picture can't be displayed.
Dr. Ivo Jimenez
The picture can't be displayed.
The picture can't be displayed.
Dr. Kate Compton
1st Year
The picture can't be displayed.
Saheed Adepoju
The picture can't be displayed.
Esmaeil Mirvakili
Research Faculty
Aaron Chu
The picture can't be displayed.
![Page 5: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/5.jpg)
The picture can't be displayed.
Sage Weil & the beginnings of Ceph
• 2004: Sage’s summer internship at LLNL, SC’04 paper• 2005: “Ceph” summer project -> thesis topic• 2006: OSDI’06 and SC’06 papers
• Published source code licensed under LGPL-2.1
• 2007: Sage’s Ph.D. defense• 2008: Sage continues to work on Ceph• 2010: Ceph Linux kernel client in 2.6.34• 2011: Inktank Storage startup• 2014: Red Hat acquires Inktank for $175m
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.The picture
can't be displayed.
![Page 6: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/6.jpg)
The picture can't be displayed.
6
Users & Contributors in Industry
A wide range of companies are making long-term strategic bets on Ceph, and that number is growing!
Here are a few users to give you an idea of the breadth of use that Ceph is seeing in the ecosystem today.
The picture can't be displayed. The picture can't be displayed. The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.The picture can't be displayed.The
picture can't be displayed.
The picture can't be displayed.
Source: ceph.io
![Page 7: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/7.jpg)
A Transformative Gift by Alumnus Sage Weil
7
❑ $500K Gift + $500K UC Matching for The Sage Weil Presidential Chair in Open Source Software
❑ $2M gift to establish CROSS and support open source software research
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
![Page 8: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/8.jpg)
Sage Weil & the beginnings of Ceph
• 2004: Sage’s summer internship at LLNL, SC’04 paper• 2005: “Ceph” summer project -> thesis topic• 2006: OSDI’06 and SC’06 papers
• Published source code licensed under LGPL
• 2007: Sage’s Ph.D. defense• 2008: Sage continues to work on Ceph• 2010: Ceph Linux kernel client in 2.6.34• 2011: Inktank Storage startup• 2014: Red Hat acquires Inktank for $175m
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.The picture can't be displayed.
Gap!
![Page 9: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/9.jpg)
The picture can't be displayed.
• Bridges gap between student research & open source projects• Funded by Sage Weil endowment & corporate memberships
9
Education
Research
Incubator
UC Santa Cruz
Faculty and
Students
Members of
Industry & Gov’t
Talent, Projects, Technologies
Teach students on how to productively engage in open source communities
Fund high-impact research with plausible path to successful open source projects
Incubate developer communities around research prototypes
![Page 10: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/10.jpg)
The picture can't be displayed.
• Bridges gap between student research & open source projects• Funded by Sage Weil endowment & corporate memberships
10
Education
Research
Incubator
UC Santa Cruz
Faculty and
Students
Members of
Industry & Gov’t
Talent, Projects, Technologies
Teach students on how to productively engage in open source communities
Fund high-impact research with plausible path to successful open source projects
Incubate developer communities around research prototypes
4
Carlos Maltzahn, Director
Governance
Lavinia Preston, Admin Assistant
Stephanie Lieggi, Assistant Director
Industry Advisory BoardDoug Cutting (Chief Architect, Cloudera)Founder of numerous groundbreaking open-source projects, including Lucene, Nutch, Avro, Hadoop
Karen Sandler (Executive Director, Software Freedom Conservancy) Former executive director of GNOME Foundation, former general counsel of Software Freedom Law Center
Nissa Strottman (VP, Technology, IP, Innovation, Strategic Partnerships, VISA), Years of experience in legal and M&A in the open-source software space.
Sage Weil (Chief Architect of Ceph, Red Hat) Creator of WebRing, founder of DreamHost, founder and CTO of Inktank, recipient of O’Reilly Open Source Award
James Davis (Professor of Computer Science and Engineering)He was Founding Director of UCSC Center for Entrepreneurship, sits on advisory councils for startups and nonprofits. Interested in research applying technology to address global social issues, and has developed an award winning course around this theme.
Advisory Committee
![Page 11: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/11.jpg)
Doug Cutting (Chief Architect, Cloudera)Founder of numerous groundbreaking open-source projects, including Lucene, Nutch, Avro, Hadoop
Karen Sandler (Executive Director, Software Freedom Conservancy) Former executive director of GNOME Foundation, former general counsel of Software Freedom Law Center
Nissa Strottman (VP, Technology, IP, Innovation, Strategic Partnerships, VISA), Years of experience in legal and M&A in the open-source software space.
Sage Weil (Chief Architect of Ceph, Red Hat) Creator of WebRing, founder of DreamHost, founder and CTO of Inktank, recipient of O’Reilly Open Source Award
James Davis (Professor of Computer Science and Engineering)He was Founding Director of UCSC Center for Entrepreneurship, sits on advisory councils for startups and nonprofits. Interested in research applying technology to address global social issues, and has developed an award winning course around this theme.
11
Advisory CommitteeThe picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
![Page 12: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/12.jpg)
4
Carlos Maltzahn, Director
Governance
Lavinia Preston, Admin Assistant
Stephanie Lieggi, Assistant Director
Industry Advisory BoardDoug Cutting (Chief Architect, Cloudera)Founder of numerous groundbreaking open-source projects, including Lucene, Nutch, Avro, Hadoop
Karen Sandler (Executive Director, Software Freedom Conservancy) Former executive director of GNOME Foundation, former general counsel of Software Freedom Law Center
Nissa Strottman (VP, Technology, IP, Innovation, Strategic Partnerships, VISA), Years of experience in legal and M&A in the open-source software space.
Sage Weil (Chief Architect of Ceph, Red Hat) Creator of WebRing, founder of DreamHost, founder and CTO of Inktank, recipient of O’Reilly Open Source Award
James Davis (Professor of Computer Science and Engineering)He was Founding Director of UCSC Center for Entrepreneurship, sits on advisory councils for startups and nonprofits. Interested in research applying technology to address global social issues, and has developed an award winning course around this theme.
Advisory CommitteeIndustry Advisory BoardWhy these
companies?Leverage Ceph to create and shape new markets!
The picture can't be displayed.
The picture can't be displayed.
$100,000 / year / member
![Page 13: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/13.jpg)
Operational Model • Modelled after NSF’s I/UCRCs• Adds open-source software focus• Sustained through membership fees• Call for research and incubator proposals twice a year
• Topics of interest informed by industry members• Review of projects twice a year
• Project selection for funding informed by industry members
Research Incubator
Open-Source Software Education & Practice
University ResearchUndergrad Graduate Postdoc
Open-Source Software Experts
Industry Advisory Board Review
Spring Fall Phase
Early January May Topics of interest input
Mid January June Call for proposals
Mid February Begin September Proposals due
End February Mid September Greenlighting
Mid March Begin October IAB meeting
![Page 14: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/14.jpg)
Postdocs building dev communities for their research prototypesSeeded by students working on term, senior, master, or PhD projects
Graduated with Ph.D. and is well-published expert
Starts out with a significant code base from Ph.D. project
Leverages at least one well-established OSS community
Wants to become an OSS leader
The picture can't be displayed.
The picture can't be displayed.
Jeff LeFevre: SkyhookDM – Programmable Storage for Databasesskyhookdm.com
Ivo Jimenez: Black Swan – The Practical Reproducibility Platformfalsifiable.us
The picture can't be displayed.
Community seeding via “Research Experience” Programs:• Tap into pool of students who
need project topics• CROSS as Google Summer of
Code Mentor Organization• Great community management
training• Great driver for community
infrastructure
Evaluation metric: number of contributors from number of organizationsExit: when external funding becomes available or project fails reviewsExpected runtime: 2-4 years
Req
uire
men
ts
Incu
bato
r Fel
low
s
Incubator
The picture can't be displayed.
The picture can't be displayed.
Kate Compton: Tracery 2 & Chancery – Getting poets to program AItracery.io
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.
![Page 15: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/15.jpg)
Incubator creating a market for open source project ideasMatching mentors, students, and funders
Research Experience
The picture can't be displayed.
The picture can't be displayed.The picture can't be
displayed.
![Page 16: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/16.jpg)
Cutting-edge research projects with plausible paths to successful open source software projects
Research SymposiumShowing off student work at CROSS
and other UC Santa Cruz research programs
Annual 2-day event with 2 tracks of program and “Systems Oktoberfest”,next event: Oct 6-9, 2020*
Centers technical program around current CROSS research and incubator projects
Shows off student work and research programs
Establishes interested communities of students, industry, government, and faculty
Located at Baskin School of Engineering on UC Santa Cruz campus
cross.ucsc.edu/symposium
Addresses a fundamental research question
Is advised by UC Santa Cruz faculty
Is not required to create any software
Opens a plausible path to open source software that might be widely adopted
Has completedcoursework required by UC Santa Cruz Ph.D. program
Ivo Jimenez (now incubator fellow): Popper - Practical Falsifiable Research (Advisor: Carlos Maltzahn) * Noah Watkins (vectorized.io): Zlog - Distributed Shared-log for Software-Defined Storage (Advisor: Carlos Maltzahn) * Michael Sevilla (TidalScale): Mantle - A Programmable Metadata Load Balancer for the Ceph File System (Advisor: Carlos Maltzahn) * Brendan Short: Strong Consistency in Dynamic Wireless Networks for Better Navigation of Autonomous Vehicles (Advisor: Ricardo Sanfelice)G
radu
ated
The picture can't be displayed.
Akhil Dixit: CAvSAT - A System for Query Answering over Inconsistent Databases (Advisor: Phokion Kolaitis)
Xiaowei Chu: Mapping Datasets to Object Storage (Advisor: Carlos Maltzahn)
The picture can't be displayed.
The picture can't be displayed.
Jianshen Liu: Eusocial Storage Devices (Advisor: Carlos Maltzahn)
The picture can't be displayed.T
he picture can't be displayed.
Sheng Hong Wang: Live Hardware Development (LiveHD) (Advisor: Jose Renau)
The picture can't be displayed.
Aaron Hunter: OSAV – Autonomous Vehicle Controller (Advisor: Gabriel Elkaim) New!
The picture can't be displayed.
Req
uire
men
ts
Res
earc
h Fe
llow
s
![Page 17: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/17.jpg)
What really works in CROSS• “Ceph as catalyst”: Engagement of industry in research prototypes via open source
• Curate and market open source portfolio to attract talent and industry funding• Incubator projects: Engagement of top postdoc talent in incubating developer
communities • Maintain advisory committee of world-class open source software experts
• Research Experience: Engagement of students to seed developer communities in incubator
• Promote education to productively engage in open source communities• Help produce students with cutting-edge skills that are immediately marketable
• Industry Practitioner-in-Residence: Engagement in student mentorship and industry membership drive
• Engage with government, industry, academia, and foundations via ”standardization incubator”• Call for proposals: Topics of interest by industry, research propsals by students/faculty,
incubator proposals by postdocs• Match focused groups of funders, faculty, mentors, and students
New industry-
wide innovation platforms
![Page 18: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/18.jpg)
Ceph as catalyst
• As adoption of Ceph goes up, …• Value of Ceph-based research increases
• Example of CMU’s recent SOSP’19 paper on Ceph’s BlueStore• Job market value of students increases• Funding opportunities multiply
• Example of NSF IRIS-HEP, NSF funding for programmable storage
• Ceph as open source software provides framework for new standards• CROSS as a “standards incubator”• Component industry is too low-margin to afford research labs, must rely on
academia
![Page 19: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/19.jpg)
Universities need to value open source
• Linux, Drupal, Spark, LLVM, Ray, Ceph, BerkeleyDB, …• Long list of valuable projects that came out of universities• Actual impact on society – not counted.• Opportunities for mentorship, funding, industry relations – mostly
ignored.• Research groups are using open source software for teaching and prototyping
• Software itself not considered academic contribution• Technology transfer offices often don’t know how to value of open source
• Time to create Open Source Program Offices for universities!• Contact me if you are interested!
![Page 20: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/20.jpg)
Questions? Comments? Feedback?
The picture can't be displayed.
![Page 21: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/21.jpg)
Ceph Overview
RADOS: Storage Servers
Clients
… …
![Page 22: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/22.jpg)
Ceph: Storage servers act collectively
RADOS Storage Service
Clients
… …
![Page 23: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/23.jpg)
Ceph: new storage service by server
RADOS Storage Service
Clients
… …
Name: XYZ
API:ReadWrite
Objects
Scope of method call: local object
![Page 24: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/24.jpg)
Ceph: new storage service by server
RADOS Storage Service
Clients
… …
Name: XYZ
API:ReadWriteFooBar
Scope of method call: local object
Custom methods using Ceph CLS (aka custom object classes)
![Page 25: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/25.jpg)
Ceph: new storage service by server
RADOS Storage Service
… …
BlueStoreChunks K/Vs
(RocksDB)
Name: XYZ
API:ReadWriteFooBar
Scope of method call: local object
Custom methods using Ceph CLS (aka custom object classes)
Chunks K/Vs(RocksDB)Chunks K/Vs
(RocksDB)Chunks K/Vs(RocksDB)Chunks K/Vs
(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB) Chunks K/Vs
(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB)
Chunks K/Vs(RocksDB)
RADOS Storage Server
![Page 26: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/26.jpg)
Metadata Service
Ceph: new storage service by client
RADOS Storage Service
Clients
… …
Scope of service: sets of objects
![Page 27: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/27.jpg)
Metadata Service (MDS)
Ceph: new storage service by client
RADOS Storage Service
Clients
…
…
File System Clients (CephFS)Scope of service:
sets of objects
![Page 28: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/28.jpg)
X Service
Ceph: new storage service by client
RADOS Storage Service
Clients
…
…
X Service ClientsScope of service:
sets of objects
![Page 29: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/29.jpg)
CephFS: File MappingStrategies:
Striping: e.g. stripe unit, stripe count, object size
Object naming: e.g. prefix = filename, suffix = seq#
Object placement: e.g. CRUSH(hash(objectname))
MDS provides striping and object namingRADOS provides hash, CRUSH, and clustermap
The picture can't be displayed.
Object naming • Striping (here: n, 1, n)
hash
CRUSH• Primary copy replication or erasure coding • Writes: atomic+transactional • Given object name: Client can calculate location • Partial reads/writes of objects • Automatically redistributes data during
failures/add/remove servers
![Page 30: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/30.jpg)
SkyhookDM: Tabular DataStrategies:
Striping: e.g. stripe unit, stripe count, object size
Object naming: e.g. prefix = filename, suffix = seq#
Object placement: e.g. CRUSH(hash(objectname))
MDS provides striping and object namingRADOS provides hash, CRUSH, and clustermap
The picture can't be displayed.
Object naming • Striping (here: n, 1, n)
hash
CRUSH• Primary copy replication or erasure coding • Writes: atomic+transactional • Given object name: Client can calculate location • Partial reads/writes of objects • Automatically redistributes data during
failures/add/remove servers
Table (schema, operations)Table
Mapping data and operations
The picture can't be displayed.
![Page 31: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/31.jpg)
The picture can't be displayed.
The picture can't be displayed.tree -d
ceph/srcThe picture can't be displayed.
What is SkyhookDM? An object “class” for Ceph• No upstream modifications required• Inherits Ceph’s properties now and
in the future• Can use all other object extensions• Not a database
Storing tabular data in objects• Columnar data: • Row data: FlatBuffers
Object read/write operations• Select, Project, Aggregate• Create, append rows/columns• Indexing• Intra- & inter-object transformations
SkyhookDM
The picture can't be displayed.
The picture can't be displayed.
Growth of mainlineobject classes2010-2018
Carlos Maltzahn, UC Santa Cruz
github.com/uccross/skyhookdm-ceph-cls
![Page 32: Some lessons learned from creating and using the Ceph opern … · 2020-06-11 · About me Adjunct Professor, Step 4, Computer Science & Engineering, UC Santa Cruz Founder & Director,](https://reader034.fdocuments.in/reader034/viewer/2022050518/5fa1f872bfe695335d32d236/html5/thumbnails/32.jpg)
Thanks to:Sage Weil, Scott Brandt, Frank Howley, Lisa Coscarelli, Karen Sandler, Zulah Eckert, Joe Konopelski, Kathy Beattie, Maureen Mclean, Cynthia McCarley, Andrew Shewmaker, Deborah Bryant, Ike Nassi, Linda Rosewood, Karsten Wade, Tom Callaway, Grant Mackey, Ibrahim Haddad, Nicole Ayres, Helena Alshanetsky, Andy Hospodor, Valerie Aurora, Lavinia Preston, Stephanie Lieggi, Alexander Wolf, Jeffrey Jue, Doug Cutting, Nithya Ruff, Nissa Strottman, Jim Whitehead, James Davis, Jeff LeFevre, Ivo Jimenez, Kate Compton, Michael Sevilla, Noah Watkins, Philip Kufeldt, Quincy Wofford, Google Summer of Code Students, and many more!
Thanks to our sponsors: Kioxia, Micron, SK Hynix, Seagate, Western Digital, Huawei, Samsung, Fujitsu
Contact: Carlos Maltzahn, [email protected]
32
The picture can't be displayed.
The picture can't be displayed.
The picture can't be displayed.