Detecting Extraneous Content in Podcasts

Reddy, Sravana; Yu, Yongze; Pappu, Aasish; Sivaraman, Aswin; Rezapour, Rezvaneh; Jones, Rosie

Computer Science > Computation and Language

arXiv:2103.02585 (cs)

[Submitted on 3 Mar 2021]

Title:Detecting Extraneous Content in Podcasts

Authors:Sravana Reddy, Yongze Yu, Aasish Pappu, Aswin Sivaraman, Rezvaneh Rezapour, Rosie Jones

View PDF

Abstract:Podcast episodes often contain material extraneous to the main content, such as advertisements, interleaved within the audio and the written descriptions. We present classifiers that leverage both textual and listening patterns in order to detect such content in podcast descriptions and audio transcripts. We demonstrate that our models are effective by evaluating them on the downstream task of podcast summarization and show that we can substantively improve ROUGE scores and reduce the extraneous content generated in the summaries.

Comments:	EACL 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2103.02585 [cs.CL]
	(or arXiv:2103.02585v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2103.02585

Submission history

From: Sravana Reddy [view email]
[v1] Wed, 3 Mar 2021 18:30:50 UTC (114 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Aasish Pappu
Aswin Sivaraman
Rezvaneh Rezapour
Rosie Jones

export BibTeX citation

Computer Science > Computation and Language

Title:Detecting Extraneous Content in Podcasts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Detecting Extraneous Content in Podcasts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators