Hive code analysis report: semantic analysis ③

2021SC@SDUSC catalogue summary Semantic analyzer class analysis: ① From analyzeInternal(ASTNode ast) to genResolvedParseTree(ASTNode, planercontext) summary Earlier, I made a simple analysis of the basesemanticananalyzer class and learned that this class is the base class of each semantic analyzer, and its derived subclasses include sema ...

Posted by Ludo Lambrechts on Tue, 16 Nov 2021 10:57:20 +0100

CDH environment construction

CDH is the most complete and tested popular distribution of Apache Hadoop and related projects. CDH provides the core elements of Hadoop (scalable storage and distributed computing), as well as Web-based user interface and important enterprise functions. Installing virtual machine CentOs I installed the version of CentOs7 myself. There are ...

Posted by mahaguru on Thu, 11 Nov 2021 00:00:32 +0100

Hadoop getting started note 23: MapReduce performance optimization - file types

1, Overview of MapReduce performance optimization1. Application scenario of MapReduceHadoop includes HDFS, an open source implementation of GFS (Hadoop distributed file system) and the open source implementation of MapReduce framework. Hadoop has attracted the attention of enterprises and academia. Many companies and technical groups such as Ya ...

Posted by ded on Sun, 07 Nov 2021 04:51:42 +0100

Practice of running Hadoop WordCount program locally

^_^ 1. Configure local hadoop Hadoop 2.7.5 link: https://pan.baidu.com/s/12ef3m0CV21NhjxO7lBH0Eg Extraction code: hhhh Unzip the downloaded hadoop package to disk D for easy search Then right-click the computer and click Properties → click Advanced system settings on the right → click environment variables → select the Path b ...

Posted by hori76 on Fri, 05 Nov 2021 19:53:07 +0100

Hive based on big data -- hive actual combat (various problems of statistical film ranking)

Author: duktig Blog: https://duktig.cn (first article) Excellent still work hard. May you enjoy what you give and enjoy what you get. See github knowledge base for more articles: https://github.com/duktig666/knowledge background After learning Hadoop, do you feel that writing a MapReduce program is very complex, and it requires a lot ...

Posted by IceD on Tue, 02 Nov 2021 06:06:37 +0100

[pit filling trip hadoop-05] Hadoop 2.10.1 Mr custom sorting grouping FlowSumArea/SortMR Partitioner/ WritableComparable inverted index

job submission process - supplemental map task returns the processing result to MRAPPmaster reduce task combines the map processing results, processes the operations, and then outputs them 1. Steps to realize Zoning: 1.1 First analyze the specific business logic to determine how many partitions there are 1.2 First write a class that inher ...

Posted by zhabala on Mon, 01 Nov 2021 01:05:59 +0100

Hadoop 3. X operation environment construction of big data (hand-in-hand cluster construction)

🌹 Write at the beginning Xiao Yuan began to update Hadoop series teaching articles to introduce you to big data from zero and look forward to your attention (according to the blog notes written by Hadoop 3. X in Silicon Valley) ❤️❤️ First article: Hadoop graphical overview of big data Second article: Hadoop template virtual machine con ...

Posted by misteraven on Thu, 28 Oct 2021 01:39:47 +0200

hadoop basic configuration and pseudo distribution implementation

hadoop basic configuration and pseudo distribution implementation catalogue 1, Basic environmental preparation Operating system preparation Installing ubuntu virtual machines Mirror address: https://mirrors.nju.edu.cn/ubuntu-releases/18.04/ubuntu-18.04.6-desktop-amd64.iso If docker is used docker pull ubuntu:18.04 Source change ...

Posted by FFEMTcJ on Tue, 26 Oct 2021 16:07:53 +0200

Hadoop, zookeeper, spark installation

New Folder: Compressed Package Folder, Software Installation Directory Folder The following does not indicate which host operations are all Master host operations # Recursively Create Compressed Package Folder mkdir -p /usr/tar # Recursively create the software installation directory folder mkdir -p /usr/apps Install upload and color code c ...

Posted by Skepsis on Thu, 21 Oct 2021 15:31:14 +0200

hadoop learning notes: running wordcount to count file strings

I recently built a distributed hadoop environment using four Centos virtual machines, which simply simulated the online hadoop real distributed cluster, which is mainly used for amateur learning big data related systems. One server serves as a NameNode, one as a Secondary NameNode, and the other two as DataNodes node servers, similar to the fo ...

Posted by aaadispatch on Wed, 20 Oct 2021 06:28:42 +0200