TextRank: Bringing Order into Texts

Rada Mihalcea and Paul Tarau

Presented by :

Sharath T.S

Shubhangi Tandon

The TextRank Algorithm

1. Identify text units that best define the task at hand,and add them as vertices in the graph.

2. Identify relations that connect such text units, and use these relations to draw edges between vertices in the graph. Edges can be directed or undirected, weighted or unweighted.

3. Iterate the graph-based ranking algorithm until convergence.

4. Sort vertices based on their final score. Use the values attached to each vertex for ranking/selection decisions.

The TextRank Model

■ G = (V, E)■ V = Set of vertices , E = Set of Edges■ V(in) = Set of incoming edges■ V(out) = Set of outgoing edges■ d = damping factor■ In addition, W = set of edge weights ■ Note : For undirected graphs, V(in) = V(out)

ConvergenceConvergence of 4 different kinds of graphs

with respect to directed/undirected and

weighted unweighted.

KeyWord ExtractionHow is the graph built?

● Each word(lexical unit) is a node.● A co-occurrence relation, two vertices are connected if their

corresponding lexical units co-occur within a window of maximum words, where it can be set anywhere from 2 to 10 words.

Example

Results for Keyword Extraction

Sentence Extraction

● Goal is to rank entire sentences, vertex = sentence. ● Co-occurrence cannot be used. Why ?● We need a new relation for our edges : Similarity. ● Measured as content overlap between two sentences( nodes).

Evaluation● Single Document Summarisation ● Data : DUC (2002) , 567 news articles● Evaluation metrics :ROUGE ● Compared against 15 systems , including baseline provided by DUC

Results● Highly Dense Graph● Output compared to human

summaries

Comparison - TextRank and Opinosis● Both are unsupervised graphical algorithms● Both try to identify the regions most traversed node/path in a

graph(topics, content described most about)● TextRank uses node importances(as a word and sentence) for KeyWord

extraction and summarization whereas Opinosis uses path weights across nodes(words) to generate fine-grained summaries.

Observations1. Common pattern : usage of text-unit co-occurrence as a feature in all

supervised topic modelling algorithms ( LDA, BTM, TextRank )2. Future work : http://web.fi.uba.ar/~fbarrios/tprofesional/articulo-en.pdf3. Industry started :Included as a module in gensim

TextRank: Bringing Order into Texts

Data & Analytics

Transcript of TextRank: Bringing Order into Texts

Spanish colonial texts and Pre-colonial texts Compilation

Poe Texts Nonfiction Texts and Organizational Patterns

Lena Kamenjasˇ - COnnecting REpositories3.1.1 TextRank model TextRank model zasniva se na modelu dokumenta opisanog grafom, gdje su vrhovi teks- tualne jedinice (rijeˇci, izrazi)

Texts and Context MODULE A: Comparative Study of Texts and Context Texts in Time Elective 2: Texts in Time texts composed in different times and contexts.

Alternative Texts: Primary Texts: Resources for Faculty ...

Introduction Donnéesadrien-bougouin.github.io/publications/2013/state_of_the_art_recital... · [5] Mihalcea R. et Tarau P. : TextRank : Bringing Order Into Texts. [6] Paukkeri M.S

Bringing the Generations Together Bringing the Generations Together.

TextRank : Bringing Order into Texts

Bringing the War Home Liz Kotzewkotz/texts/Kotz-2012-Fast.pdf · The codes and conventions ... For viewers of The Casting, the opening scene between filmmaker and military subject

From Oxford University Press - EaZy Solutions · Oxford University Press' prestigious medical publishing, bringing together authoritative texts by world-renowned authors. These highly-regarded

PartI:QualityPhraseMiningink-ron.usc.edu/xiangren/ · •TextRank [Mihalceaet al.’04] •TF-IDF 8 •Minimal Grammatical Segments óPhrases ... •If a phrase can be decomposed

GCSE DRAMA: BRINGING TEXTS TO LIFE

Social and Emotional Diverse Texts Small-Group Texts

Algorithmic handwriting analysis of Judah s military ... · the inscriptions (ostraca 17 and 39) are inscribed on both sides of the sherd, bringing the number of texts under investigation

Multi-Document Text SummarizationWe tackle the problem of multi-document extractive summarization by implementing two well-known algorithms for single-text summarization – TEXTRANK

Effects of overlaying ontologies to TextRank graphs Project Report By Kino Coursey.

BERT-Supervised Encoder-Decoder for Restaurant ...web.stanford.edu/class/cs224n/reports/custom/15815045.pdf · more explicit extractive approach (Li et al., 2017 [5], TextRank algorithm

Bringing Students and Texts Together Chapter 10 By Dawn Oliver.

Exemplar Texts - English Language Arts (ELA)languageartsreading.dadeschools.net/pdf/CCSS... · Exemplar Texts Text samples ... Learning Objectives ... Author’s Toolbox for Bringing

oi.uchicago · Positions of lines in Coffin Texts ... THE THE PYRAMID TEXTS ancient Egyptian Pyramid Texts ... to later use of the Pyramid Texts themselves. oi.uchicago.edu.