CiteULike is a free online bibliography manager. Register and you can start organising your references online.

MapReduce: simplified data processing on large clusters

Commun. ACM, Vol. 51, No. 1. (January 2008), pp. 107-113.

X Abstract

MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks. Programmers find the system easy to use: more than ten thousand distinct MapReduce programs have been implemented internally at Google over the past four years, and an average of one hundred thousand MapReduce jobs are executed on Google's clusters every day, processing a total of more than twenty petabytes of data per day.

View the full article here:

ACM, DOI

This article has been bookmarked 28 times, initially on 2008-01-10.

2009-11-25 User imrchen
2009-11-11 User myui
2009-10-21 Group large-scale-ml
2009-10-05 User jweslley
2009-08-17 User zhaomin
2009-06-20 User yingbo
2009-06-18 User akshayk
2009-05-20 User mfisk
2009-04-08 User rijo
2009-02-09 User pprett
2009-01-22 User urvoy
2009-01-06 User verma7
2008-11-23 User jborn
2008-10-01 User jorritschippers
2008-07-22 User aespinosa
2008-06-24 User jliegl
2008-06-01 User quanpt
2008-05-16 User mpotamias
2008-05-12 User dmeister
2008-03-28 User thiagomanel
Group Desktop Data Grid
2008-02-27 User echi
2008-02-18 User knowlengr , 1 note

Consider using this for IP traffic data reduction / visualization

2008-02-18 00:48:13
2008-01-22 User wentrue
2008-01-14 User jwong
2008-01-10 User dullhunk
User arsyed
User mircea
Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.