Ingesting High-Velocity Streaming Graphs from Social Media Sources

Dasgupta, Subhasis; Bagchi, Aditya; Gupta, Amarnath

Computer Science > Databases

arXiv:1905.08337 (cs)

[Submitted on 20 May 2019]

Title:Ingesting High-Velocity Streaming Graphs from Social Media Sources

Authors:Subhasis Dasgupta, Aditya Bagchi, Amarnath Gupta

View PDF

Abstract:Many data science applications like social network analysis use graphs as their primary form of data. However, acquiring graph-structured data from social media presents some interesting challenges. The first challenge is the high data velocity and bursty nature of the social media data. The second challenge is that the complex nature of the data makes the ingestion process expensive. If we want to store the streaming graph data in a graph database, we face a third challenge -- the database is very often unable to sustain the ingestion of high-velocity, high-burst data. We have developed an adaptive buffering mechanism and a graph compression technique that effectively mitigates the problem. A novel aspect of our method is that the adaptive buffering algorithm uses the data rate, the data content as well as the CPU resources of the database machine to determine an optimal data ingestion mechanism. We further show that an ingestion-time graph-compression strategy improves the efficiency of the data ingestion into the database. We have verified the efficacy of our ingestion optimization strategy through extensive experiments.

Subjects:	Databases (cs.DB); Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Cite as:	arXiv:1905.08337 [cs.DB]
	(or arXiv:1905.08337v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.1905.08337

Submission history

From: Subhasis Dasgupta [view email]
[v1] Mon, 20 May 2019 20:29:44 UTC (4,949 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SI

< prev | next >

new | recent | 2019-05

Change to browse by:

cs
cs.DB
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Subhasis Dasgupta
Aditya Bagchi
Amarnath Gupta

export BibTeX citation

Computer Science > Databases

Title:Ingesting High-Velocity Streaming Graphs from Social Media Sources

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Ingesting High-Velocity Streaming Graphs from Social Media Sources

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators