Renjitha Kforrenjithak.hashnode.net·Mar 30, 2023Demystifying Big Data Analytics with Apache Spark : Part-1Posted by Renjitha K in Renjitha K's Blog on Mar 25, 2023 2:27:13 PM As the amount of data generated by individuals and businesses continue to grow exponentially, the need for technologies like Apache Spark that can process and analyze large dataset...Discuss·2 likes·101 readsspark
ZiyunforZiyun's blogziyun.hashnode.net·Jul 26, 2022Paper review: MapReduceRecently, I started a study group to work through the famous MIT 6.824: Distributed Systems together with some friends. The course includes a number of readings that are closely related to the labs. Instead of simply skimming through the papers, this...Discuss·57 readsPaper Reviewdistributed system
Ashwin TelmoreforMy Notesashwintelmore.hashnode.net·Dec 7, 2022Run a basic word count map reduce program to understand map reduce paradigmJava code Create a .java file and paste following code import java.io.IOException; import java.util.StringTokenizer; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.IntWritable; import org....DiscussBig data AnalyticsPractical
Renjitha Kforrenjithak.hashnode.net·Mar 30, 2023Demystifying Big Data Analytics with Apache Spark : Part-1Posted by Renjitha K in Renjitha K's Blog on Mar 25, 2023 2:27:13 PM As the amount of data generated by individuals and businesses continue to grow exponentially, the need for technologies like Apache Spark that can process and analyze large dataset...Discuss·2 likes·101 readsspark
Renjitha Kforrenjithak.hashnode.net·Mar 27, 2023Understanding MapReduce: A Beginners GuideMost of us have been hearing the term MapReduce for a long while now, I have been wondering what this term means, Let's try to understand the basics of the same. So, MapReduce is a powerful programming model and software framework for processing larg...Discuss·173 readsBigData
Rishabh BassiforBussierishabhbassi.hashnode.net·Feb 19, 2023Parallel Computing Decision FactorsParallel computing is the use of multiple processors or computers to solve a computational problem simultaneously. The technique has become increasingly popular as a means of accelerating the performance of computationally intensive tasks, such as si...Discuss#parallel computing
padmanabha reddyforPadmanabha'spadmanabha.hashnode.net·Jan 8, 2023Distributed computing framework - MapReduceWhat is MapReduce? MapReduce is a software framework for processing large data sets that are distributed over several machines. MapReduce facilitates concurrent processing by splitting petabytes of data into smaller chunks and processing them in para...Discussmapreduce
Yash Srivastavaforyash722.hashnode.net·Dec 25, 2022MapReduce in HadoopAlthough MapReduce is not much used in solving Big Data problems nowadays because of its poor performance compared to spark. But it's still a very good approach to understanding how distributed computing works in Big Data. What is Hadoop MapReduce? H...Discuss·41 readsmapreduce
Ashwin TelmoreforMy Notesashwintelmore.hashnode.net·Dec 7, 2022Run a basic word count map reduce program to understand map reduce paradigmJava code Create a .java file and paste following code import java.io.IOException; import java.util.StringTokenizer; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.IntWritable; import org....DiscussBig data AnalyticsPractical
Vijay Saravana JaishankerforFlyte Blogacidic-committee-improve-52.hashnode.net·Oct 4, 2022Flyte Map Tasks: A Simpler Alternative to Apache SparkBy Vijay Saravana Jaishanker, Alex Bain and Varsha Parthasarathy Vijay Saravana Jaishanker is an intern at Woven Planet in the Machine Learning Frameworks Team. Vijay and his colleagues Alex Bain, Alex Pozimenko and Varsha Parthasarathy are helping t...Discuss·1.2K readsspark
ZiyunforZiyun's blogziyun.hashnode.net·Jul 26, 2022Paper review: MapReduceRecently, I started a study group to work through the famous MIT 6.824: Distributed Systems together with some friends. The course includes a number of readings that are closely related to the labs. Instead of simply skimming through the papers, this...Discuss·57 readsPaper Reviewdistributed system