Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies...
Transcript of Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies...
![Page 1: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/1.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 39 Institute for Web Science and Technologies · University of Koblenz-Landau, Germany
Lesson6: Modeling the Web as a graph
Unit4: Topology of the Web graph
Rene Pickhardt
Introduction to Web Science Part 2 Emerging Web Properties
![Page 2: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/2.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 40
Completing this unit you should • Understand the notion of a path in a (directed)
graph
• Know that shortest paths between nodes need not be unique
• Understand the notion of a strongly connected component
• Know about the diameter of a graph
• Be aware of the bow tie structure of the Web
![Page 3: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/3.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 41
Paths in a graph • An ordered set is called a path through a graph if:
–
• Example: – is a path – is not a path
P = {v1, . . . , vn}
(vi, vi+1) 2 E, 1 i < n
{a, f, c, d, e, b, h}{a, c, b} ⇢ {a, f, c, d, e, b, h}
![Page 4: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/4.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 42
Shortest paths in a graph • A path between and is called a shortest path if:
– No path between and exists such that
• Example: – is a path but not a shortest one – is a shortest path
P = {v1, . . . , vn}
{a, f, c, d, e, b, h}
v1 vnv1 vnP 0
|P 0| < |P |
{a, f, c, b, h}
![Page 5: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/5.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 43
Useful information on shortest paths • Shortest paths need not to be unique
• Useful algorithms – Breadth first search
• Unweighted graphs – Dijkstra Algorithm
• Weighted graphs (positive weights) – Bellman Ford Algorthm
• negative weights – Floyd Warshall
• All pair shortest paths
![Page 6: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/6.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 44
(Strongly)Connected components • A Set of vertices such that
–
– is a path
–
– Such that is a bath between and is called strongly connected component in a directed and connected component in an undirected graph
C
8u, v 2 C9P ⇢ C
P
@v0 2 V � C : 9c 2 C : 9P
P c v0
![Page 7: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/7.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 45
Strongly conected components • How many strongly connected components
exist on Simple English Wikipedia? – It is a good exercise to count them on the html
dump we provided
• Size of the largest connected component – 54‘524 vertices
• The largest strongly connected component is usually the part of the web we experience
![Page 8: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/8.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 46
Bow tie structure of the Web
![Page 9: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/9.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 47
• Each pair of nodes in a SCC is connected by a path
• There exists a shortest path between each pair of nodes
• The length of the longest shortest path between any two nodes within a SCC is called its diameter.
• How large do you expect the diameter of Simple Wiki‘s SCC?
Diameter: Describing the strongly connected component (SCC)
![Page 10: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/10.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 48
Distance distribution of nodes from the SCC
![Page 11: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/11.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 49
• Size and Volume • Degree distributions • Clustering coefficients • (effective) Diameter • (Strongly) connected componentns • Bow Tie structure • Assortativity • Lorenz curve, Gini coefficient • ...
Statistical measures for graphs that you should know
![Page 12: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/12.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 50 Institute for Web Science and Technologies · University of Koblenz-Landau, Germany
Thank you for your attention!
Contact: Rene Pickhardt Institute for Web Science and Technologies Universität Koblenz-Landau [email protected]
![Page 13: Topology of the web graph - Wikimedia...Rene PickhardtInstitute for Web Science and Technologies CC-BY-SA-3.0 Modeling the Web as a Graph · University of Koblenz-Landau, Germany 39](https://reader034.fdocuments.in/reader034/viewer/2022052015/602cc9a9a23d10589a4424e0/html5/thumbnails/13.jpg)
Rene Pickhardt CC-BY-SA-3.0 Modeling the Web as a Graph 51
• https://commons.wikimedia.org/wiki/File:6n-graf.svg By User:AzaToth (Image:6n-graf.png simlar input data) [Public domain], via Wikimedia Commons • https://pixabay.com/en/lightbulb-electric-light-bright-idea-31254/ Public domain • https://commons.wikimedia.org/wiki/File:Simple-bipartite-graph.svg By MistWiz (Own work) [Public domain], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:CPT-Graphs-undirected-weighted-loop-multiedge.svg By Pluke (Own work) [CC0], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:Matrix.svg By Lakeworks (Own work) [GFDL (http://www.gnu.org/copyleft/fdl.html) or CC BY-SA 4.0-3.0-2.5-2.0-1.0 (http://
creativecommons.org/licenses/by-sa/4.0-3.0-2.5-2.0-1.0)], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:Ego_network.png By Houl0078 (Own work) [CC0], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:Path-graph-theory.svg By MartinThoma (Own work) [GFDL (http://www.gnu.org/copyleft/fdl.html) or CC BY 3.0 (http://
creativecommons.org/licenses/by/3.0)], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:%22Bow-tie%22_diagram_of_components_in_a_directed_network_SVG.svg By Limaner (Own work) [CC BY-SA 4.0 (http://
creativecommons.org/licenses/by-sa/4.0)], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:Economics_Gini_coefficient2.svg By Reidpath [Public domain], via Wikimedia Commons • http://konect.uni-koblenz.de/statistics/volume • https://www.flickr.com/photos/dannysullivan/369540238 via flickr CC-BY 2.0 by Danny Sullivan • Graph statistics via http://konect.uni-koblenz.de/networks/link-dynamic-simplewiki
• https://commons.wikimedia.org/wiki/File:WWW_logo_by_Robert_Cailliau.svg public domain • https://commons.wikimedia.org/wiki/File:WorldWideWeb.1.png public domain • https://en.wikipedia.org/w/index.php?title=World_Wide_Web&oldid=692731715 (definition of world wide web) • https://commons.wikimedia.org/wiki/File:Visualization_of_the_world_wide_web_common_crawl_2012.png By Sebastian Schelter [CC BY-SA 3.0 (http://creativecommons.org/
licenses/by-sa/3.0)], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:WorldWideWebAroundWikipedia.png Chris 73 / Wikimedia Commons [GFDL 1.3 (www.gnu.org/licenses/fdl-1.3.html) or CC BY-SA 3.0
(http://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons • https://commons.wikimedia.org/wiki/File:Word_Cloud_Readers_Survey.jpg By Manipande (Own work) [CC BY-SA 3.0 (http://creativecommons.org/licenses/by-sa/3.0)], via
Wikimedia Commons • https://www.flickr.com/photos/valeriebb/290711738 via Flickr and Valerie Everett • https://commons.wikimedia.org/wiki/File:Inner-product-angle.png CC-BY-SA by CSTAR & Oleg Alexandrov • https://commons.wikimedia.org/wiki/File:Synoptic_word-for-word.png By Alecmconroy (Own work) [GFDL (http://www.gnu.org/copyleft/fdl.html) or CC BY 3.0 (http://
creativecommons.org/licenses/by/3.0)], via Wikimedia Commons
Pictures: