copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Apache hadoop goes realtime at Facebook

D. Borthakur, J. Gray, J. Sarma, K. Muthukkaruppan, N. Spiegelberg, H. Kuang, K. Ranganathan, D. Molkov, A. Menon, S. Rash, R. Schmidt, and A. Aiyer. Proceedings of the 2011 international conference on Management of data, page 1071--1080. New York, NY, USA, ACM, (2011)
DOI: 10.1145/1989323.1989438

Abstract

Facebook recently deployed Facebook Messages, its first ever user-facing application built on the Apache Hadoop platform. Apache HBase is a database-like layer built on Hadoop designed to support billions of messages per day. This paper describes the reasons why Facebook chose Hadoop and HBase over other systems such as Apache Cassandra and Voldemort and discusses the application's requirements for consistency, availability, partition tolerance, data model and scalability. We explore the enhancements made to Hadoop to make it a more effective realtime system, the tradeoffs we made while configuring the system, and how this solution has significant advantages over the sharded MySQL database scheme used in other applications at Facebook and many other web-scale companies. We discuss the motivations behind our design choices, the challenges that we face in day-to-day operations, and future capabilities and improvements still under development. We offer these observations on the deployment as a model for other companies who are contemplating a Hadoop-based solution over traditional sharded RDBMS deployments.

Description

Apache hadoop goes realtime at Facebook

Links and resources

BibTeX key: Borthakur2011
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 2011 international conference on Management of data
year: 2011
pages: 1071--1080
publisher: ACM
series: SIGMOD '11
acmid: 1989438
location: Athens, Greece
isbn: 978-1-4503-0661-4
numpages: 10
DOI: 10.1145/1989323.1989438
url: http://doi.acm.org/10.1145/1989323.1989438

@stroeh's tags highlighted

Cite this publication

@inproceedings{Borthakur2011, abstract = {Facebook recently deployed Facebook Messages, its first ever user-facing application built on the Apache Hadoop platform. Apache HBase is a database-like layer built on Hadoop designed to support billions of messages per day. This paper describes the reasons why Facebook chose Hadoop and HBase over other systems such as Apache Cassandra and Voldemort and discusses the application's requirements for consistency, availability, partition tolerance, data model and scalability. We explore the enhancements made to Hadoop to make it a more effective realtime system, the tradeoffs we made while configuring the system, and how this solution has significant advantages over the sharded MySQL database scheme used in other applications at Facebook and many other web-scale companies. We discuss the motivations behind our design choices, the challenges that we face in day-to-day operations, and future capabilities and improvements still under development. We offer these observations on the deployment as a model for other companies who are contemplating a Hadoop-based solution over traditional sharded RDBMS deployments.}, acmid = {1989438}, added-at = {2011-07-08T15:15:52.000+0200}, address = {New York, NY, USA}, author = {Borthakur, Dhruba and Gray, Jonathan and Sarma, Joydeep Sen and Muthukkaruppan, Kannan and Spiegelberg, Nicolas and Kuang, Hairong and Ranganathan, Karthik and Molkov, Dmytro and Menon, Aravind and Rash, Samuel and Schmidt, Rodrigo and Aiyer, Amitanand}, biburl = {https://www.bibsonomy.org/bibtex/2a63e92a70fb2fec45926e6a66f5f3f81/stroeh}, booktitle = {Proceedings of the 2011 international conference on Management of data}, description = {Apache hadoop goes realtime at Facebook}, doi = {10.1145/1989323.1989438}, interhash = {99f6a46f9bb424e63ee898ebbc13f13c}, intrahash = {a63e92a70fb2fec45926e6a66f5f3f81}, isbn = {978-1-4503-0661-4}, keywords = {comparison hadoop hbase rdms}, location = {Athens, Greece}, numpages = {10}, pages = {1071--1080}, publisher = {ACM}, series = {SIGMOD '11}, timestamp = {2011-07-08T15:15:52.000+0200}, title = {Apache hadoop goes realtime at Facebook}, url = {http://doi.acm.org/10.1145/1989323.1989438}, year = 2011 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Apache hadoop goes realtime at Facebook

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Apache hadoop goes realtime at Facebook

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Apache hadoop goes realtime at Facebook

Comments and Reviews
(0)