Hive based on big data -- hive actual combat (various problems of statistical film ranking)

Author: duktig Blog: https://duktig.cn (first article) Excellent still work hard. May you enjoy what you give and enjoy what you get. See github knowledge base for more articles: https://github.com/duktig666/knowledge background After learning Hadoop, do you feel that writing a MapReduce program is very complex, and it requires a lot ...

Posted by IceD on Tue, 02 Nov 2021 06:06:37 +0100

Hive of big data foundation -- DDL statement and DML statement

Author: duktig Blog: https://duktig.cn (first article) Excellent still work hard. May you enjoy what you give and enjoy what you get. See github knowledge base for more articles: https://github.com/duktig666/knowledge background After learning Hadoop, do you feel that writing a MapReduce program is very complex, and it requires a lot ...

Posted by KindMan on Tue, 02 Nov 2021 05:46:50 +0100

Hadoop 3. X operation environment construction of big data (hand-in-hand cluster construction)

🌹 Write at the beginning Xiao Yuan began to update Hadoop series teaching articles to introduce you to big data from zero and look forward to your attention (according to the blog notes written by Hadoop 3. X in Silicon Valley) ❤️❤️ First article: Hadoop graphical overview of big data Second article: Hadoop template virtual machine con ...

Posted by misteraven on Thu, 28 Oct 2021 01:39:47 +0200

[big data Java foundation - JVM 03] class loading subsystem class loader subsystem

Class loader subsystem 1.1 what is a class loading subsystem? The Class loader subsystem is responsible for loading Class files from the file system or network. The Class file has a specific file ID (magic number) at the beginning of the file. The Class loader subsystem (ClassLoader) is only responsible for loading the Class file. Whether it ...

Posted by semsem on Sun, 24 Oct 2021 21:58:15 +0200

Azkaban deployment of big data

1, Introduction to Azkaban 1.1 why workflow scheduling system is needed A complete data analysis system is usually composed of a large number of task units: Shell script program, Java program, MapReduce program, Hive script, etc;There are time sequence and before and after dependency among task units;In order to organize such a complex ex ...

Posted by tarun on Sat, 23 Oct 2021 10:00:10 +0200

Spark operator - Python

1, Theoretical basis Spark operators can be divided into: Transformation Transformation/Conversion operator: this transformation does not trigger the submission of the job and completes the intermediate process of the job. Transformation Operations are deferred, that is, from a RDD Transform to generate another RDD The operation is not perfor ...

Posted by blackcow on Fri, 22 Oct 2021 09:24:04 +0200

JDK growth 12: ThreadLocal

In the last section, you understood what ThreadLocal is, its basic usage, and the underlying principle of the get method. In this section, let's continue to study in depth: ThreadLocal set source code principleStrong reference, weak reference, soft reference and virtual reference in JVMApplication of weak reference in ThreadLocalAnalysis of ...

Posted by wmolina on Thu, 21 Oct 2021 01:55:35 +0200

hadoop learning notes: running wordcount to count file strings

I recently built a distributed hadoop environment using four Centos virtual machines, which simply simulated the online hadoop real distributed cluster, which is mainly used for amateur learning big data related systems. One server serves as a NameNode, one as a Secondary NameNode, and the other two as DataNodes node servers, similar to the fo ...

Posted by aaadispatch on Wed, 20 Oct 2021 06:28:42 +0200

Object oriented and advanced syntax of Scala

1, Object oriented 1. Class and object details (1) Class composition structure Constructor, member variable, member method (function), local variable, code block, internal class (2) Constructor scala has two types of constructors: primary and secondaryThe main constructor follows the class name, such as class Student2(val name: String, ...

Posted by tartou2 on Wed, 20 Oct 2021 02:05:53 +0200

Principle analysis of Eureka heartbeat mechanism and automatic protection mechanism

Eureka heartbeat mechanism: After the application starts, the nodes will report to Eureka   Server sends heartbeat. The default cycle is 30 seconds. If Eureka   The server does not receive the heartbeat of a node in multiple heartbeat cycles, Eureka   The server will remove the service node from the service registry (90 seconds ...

Posted by misterguru on Mon, 18 Oct 2021 07:48:43 +0200