CiteULike is a free online bibliography manager. Register and you can start organising your references online.
Tags

The Hadoop Distributed File System

by: K. Shvachko, Hairong Kuang, S. Radia, R. Chansler
In Mass Storage Systems and Technologies (MSST), 2010 IEEE 26th Symposium on (May 2010), pp. 1-10, doi:10.1109/msst.2010.5496972  Key: citeulike:8493345

Formatted Citation


Show HTML

Likes (beta)

This copy of the article hasn't been liked by anyone yet.

View FullText article


Abstract

The Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably, and to stream those data sets at high bandwidth to user applications. In a large cluster, thousands of servers both host directly attached storage and execute user application tasks. By distributing storage and computation across many servers, the resource can grow with demand while remaining economical at every size. We describe the architecture of HDFS and report on experience using HDFS to manage 25 petabytes of enterprise data at Yahoo!.


qfzhang's tags for this article

Citations (CiTO)

No CiTO relationships defined

X There is 1 review Average rating 4.0

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History


X Export records

Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.