Day 20 of learning big data - Collection, list

The 20th day of learning big data - Collection and List (because you need to accompany your family for the new year, there is no time for learning to continue recording, and it will continue to be updated every day from today) Learning step by step according to the appeal map can get twice the result with half the effort aggregate The differ ...

Posted by danaman on Fri, 11 Feb 2022 05:20:04 +0100

Basic operation of SparkStreaming in PySpark

Basic operation of SparkStreaming in PySpark preface Stream data has the following characteristics: • data arrives quickly and continuously, and the potential size may be endless • numerous data sources and complex formats • large amount of data, but do not pay much attention to storage. Once processed, it will either be discar ...

Posted by andrew10181 on Fri, 11 Feb 2022 04:41:05 +0100

Database lab challenge TASK-3 (Alibaba cloud)

brief introduction The scenario introduces how to visually configure the data of student achievement through AnalyticDB, generate the large screen and dashboard of student achievement distribution with one click, and produce the achievement report by cycle through task arrangement. relevant Experience Lab Developers master what and how of cl ...

Posted by freakus_maximus on Fri, 11 Feb 2022 04:07:53 +0100

There are three most common ways for python to parse XML?

XML (eXtensible Markup Language) refers to eXtensible Markup Language, which is designed to transmit and store data. It has increasingly become the core of many new technologies and has different applications in different fields. It is the inevitable product of the development of web to a certain stage. It not only has the core characteristics ...

Posted by westair on Fri, 11 Feb 2022 03:05:52 +0100

Introduction to azkaban and azkaban deployment, principles and usage

Introduction to azkaban and azkaban deployment, principles and usage Introduction to azkaban Azkaban is a simple task scheduling service that consists of three parts: web server, dbserver, executor server. Azkaban is a Java project from Linkedin Open Source, a batch workflow task scheduler. Used to run a set of work and processes in a specifi ...

Posted by ravi181229 on Thu, 10 Feb 2022 18:43:13 +0100

kafka2.6.0 installation configuration

kafka installation record: Official website: http://kafka.apache.org/downloads.html 1. Download yum install -y wget wget https://mirrors.bfsu.edu.cn/apache/kafka/2.6.0/kafka_2.12-2.6.0.tgz 2. Decompression: tar -zxvf kafka_2.12-2.6.0.tgz -C /opt/ 3. Change of name mv kafka_2.12-2.6.0 kafka 4. Create files cd kafka Create un ...

Posted by juschillinnow on Thu, 10 Feb 2022 13:58:48 +0100

Fish on time, close the net on time, and Python realizes the countdown after work! Never work overtime

  Have you ever had time to fish In the Internet circle, it is often said that 996 work system, but there are also 965, especially 007, and 007 feels like an ICU. Therefore, everyone will sneak around and occasionally touch fish. There are many ways to fish. Have you ever fished at work? What did you do during your fishing time? If you fini ...

Posted by silverspy18 on Thu, 10 Feb 2022 11:51:44 +0100

SQL project practice: user behavior analysis of Taobao

1. Introduction to data: Tianchi Taobao User Data 2. Data Import and Cleaning SQL interface tool used this time - SQL workbench 2.1 Importing data create table userbehavior( userID int, itemID int, categoryID int, bahaviortype text, timestamp int ); load data infile "C:/ProgramData/MySQL/MySQL Server 8.0/Uploads/UserBehavior.csv" into ta ...

Posted by CBR on Thu, 10 Feb 2022 10:17:54 +0100

Mslab (memory local allocation buffer) of HBase

Mslab (memory local allocation buffer) of HBase preface This paper briefly introduces the write cache MemStore and data structure of HBase, as well as the function and source code analysis of MSLAB, which is the main component of write cache. MSLAB is the abbreviation of MemStore local allocation buffer. It carries out reasonable planning ...

Posted by wheeler08 on Tue, 08 Feb 2022 18:41:18 +0100

Azkaban deployment and configuration

1. Installation and deployment 1.1. Download source code https://github.com/azkaban/azkaban/tags 1.2. Decompression source code: tar -zxf azkaban-3.85.1.tar.gz -C /opt/soft 1.3. Compile source code You need to install GIT and Java environment # Build Azkaban ./gradlew build # Clean the build ./gradlew clean # Build and install dis ...

Posted by peterg012 on Tue, 08 Feb 2022 16:01:19 +0100