Principle and application of Hadoop Technology
Hadoop data processing (sophomore training in 2020)
1, Project background
The training content is the statistical analysis of automobile sales data. Through this project, we will deepen our understanding of HDFS distributed file system and MapReduce distributed parallel computing framework, master and apply them skillfully, experience the dev ...
Posted by Garcia on Sat, 19 Feb 2022 11:54:35 +0100
MapReduce processing pictures
Reference link 1 Reference link 2 The code comes from link 2 and has been modified by yourself. The level is limited. I hope to point out some mistakes.
hadoop3. 2.1 write code under centos 7 window, package and submit it to Hadoop cluster on centos for operation. ideas: put the picture on hdfs, and then write the path of each im ...
Posted by benyhanna on Fri, 18 Feb 2022 06:16:31 +0100
Hadoop[03-03] access count test based on DFS and ZKFC (Hadoop 2.0)
Hadoop[03-03] access count test based on DFS and ZKFC (Hadoop 2.0)
Prepare the environment
Prepare multiple virtual machines and start dfs and zookeeper See link for details: Hadoop2.0 start DFS and Zookeeper
Some data of multiple virtual machines are as follows
numberhost nameHost domain nameip address①ToozkyToozky192.168.64.220②Toozky2T ...
Posted by chantown on Fri, 11 Feb 2022 01:20:17 +0100
Distributed computing framework Map/reduce
Introduction:
MapReduce is a cluster based high-performance parallel computing platform. MapReduce is a software framework for parallel computing and operation. MapReduce is a parallel programming model and methodcharacteristic:
① The distribution is reliable. The operation of the data set is distributed to multiple nodes in the cluster to ac ...
Posted by warren on Thu, 10 Feb 2022 19:39:51 +0100
Hadoop installation complete
HADOOP installation Linux stand-alone
Download Hadoop
Hadoop3.xx download address: http://archive.apache.org/dist/hadoop/common/hadoop-3.1.3/
Upload to Linux via FTP
Decompression software
tar -zxvf hadoop-3.1.3.tar.gz -C /opt/module/
Configure HADOOP environment variables
Create custom profile
vim /etc/prof ...
Posted by cesarcesar on Mon, 07 Feb 2022 09:17:24 +0100
Hadoop ecosystem - Introduction and basic theory of MapReduce
preface
Part of the content is extracted from the training materials of Shang Silicon Valley, dark horse and so on
1. Get to know MapReduce
1.1 understand MapReduce idea
MapReduce thought can be seen everywhere in life, and everyone has been exposed to it more or less. The core idea of MapReduce is "divide and then combine ...
Posted by mkili on Thu, 03 Feb 2022 16:06:08 +0100
Exceptions and solutions when Hadoop runs MapReduce task
Exception code description
Just beginning to contact Hadoop,about MapReduce From time to time, I especially understand that the following records the problems and solutions that have been tangled for a day
1. Execute MapReduce task
hadoop jar wc.jar hejie.zheng.mapreduce.wordcount2.WordCountDriver /input /output
2. Jump out of exception ...
Posted by burzvingion on Thu, 03 Feb 2022 14:16:53 +0100
hadoop2.6.5 Mapper class source code analysis
Mapper class
//
// Source code recreated from a .class file by IntelliJ IDEA
// (powered by Fernflower decompiler)
//
package org.apache.hadoop.mapreduce;
import java.io.IOException;
import org.apache.hadoop.classification.InterfaceAudience.Public;
import org.apache.hadoop.classification.InterfaceStability.Stable;
@Public
@Stable
public cla ...
Posted by warydig on Mon, 31 Jan 2022 03:11:39 +0100
Big data learning road Hadoop
1. Introduction to big data
1.1 big data concept
big data refers to a data set that cannot be captured, managed and processed by conventional software tools within a certain time range. It is a massive, high growth rate and diversified information asset that requires a new processing mode to have stronger decision-making power, insight an ...
Posted by monkuar on Sat, 29 Jan 2022 15:27:44 +0100
Eclipse builds Hadoop environment and actual resource sharing
First, build haoop2.0 of eclipse 7.1 development environment, the resources used are linked as follows:
Install Hadoop 2.0 for windows 7.1 environment
Building hadoop development environment under eclipse
In this way, we can develop hadoop in eclipse
catalogue
1, Introduction to MapReduce model
1. Map and Reduce functions
2. MapReduce a ...
Posted by clown[NOR] on Fri, 21 Jan 2022 03:56:50 +0100