Learning notes of financial risk control training camp game question understanding (task 2)

This learning note is the learning content of Alibaba cloud Tianchi Longzhu Docker training camp. The learning links are: https://tianchi.aliyun.com/notebook-ai/detail?spm=5176.20850282.J_3678908510.4.f2984d57AnIOhB&postId=170949 Task 2 purpose: 1. The value of EDA mainly lies in being familiar with the basic situation of the whole dat ...

Posted by ina on Sat, 19 Feb 2022 04:52:35 +0100

You have to learn the complete ELK construction tutorial

1, ELK introduction 1. What is ELK? "ELK" is an acronym for three open source projects: Elasticsearch, Logstash and Kibana. Elasticsearch is a search and analysis engineLogstash is a server-side data processing pipeline, which can collect data from multiple sources at the same time, convert data, and then send data to "reposit ...

Posted by mlewczuk on Fri, 18 Feb 2022 22:51:44 +0100

Mysql million data import

Mysql million data import 1. Method Import directly from the source sql file (not recommended)Load data infile command for import (recommended)Replace the engine, change the InnoDB engine of the table to MyISAM, and then import the data (not recommended)Import a single SQL statement of imported data into multiple SQL statements (recommended)O ...

Posted by showman on Fri, 18 Feb 2022 22:25:03 +0100

2021-5-5 101701Hive command

The first chapter uses Hive to start Hadoop cluster Start HDFS HA (check whether the 50070 web page port is started successfully) Each machine starts the zookeeper service zkServer.sh start Start HDFS service sbin/start-dfs.sh Start YARN HA (check whether the 8088 web page port is started successfully) Start YARN sbin/start-yar ...

Posted by mohabitar on Fri, 18 Feb 2022 15:01:55 +0100

MapReduce processing pictures

Reference link 1 Reference link 2 The code comes from link 2 and has been modified by yourself. The level is limited. I hope to point out some mistakes. hadoop3. 2.1 write code under centos 7 window, package and submit it to Hadoop cluster on centos for operation.   ideas:   put the picture on hdfs, and then write the path of each im ...

Posted by benyhanna on Fri, 18 Feb 2022 06:16:31 +0100

2021 big data common language Scala: introduction to Scala

1, Introduction to Scala scala is a multi paradigm programming language running on the JVM. It supports both object-oriented and function oriented programming Multi paradigm: it contains a variety of programming ideas. At present, there are four mainstream programming ideas, namely object-oriented, process-oriented, function oriented and gene ...

Posted by tycragg on Fri, 18 Feb 2022 01:12:33 +0100

Flink keying state AggregatingState development example

1, Keying status description Referring to the description on the official website, several keying states are introduced as follows: ValueState: save a value that can be updated and retrieved (as mentioned above, each value corresponds to the key of the current input data, so each key received by the operator may correspond to a value). This v ...

Posted by kumar_ldh on Thu, 17 Feb 2022 17:11:52 +0100

Message queuing: pulsar installation and deployment

Hostnames are confused 1. Preconditions Install java1 8. Zookeeper is installed independently. There will be no detailed installation here 2. Download apache-pulsar-2.9.1-bin.tar.gz https://pulsar.apache.org/en/download/ 3. Initialize and create a cluster Cluster metadata initialization tip:  you only need to write these metadata once. b ...

Posted by studio805 on Thu, 17 Feb 2022 13:42:13 +0100

SQL ability practice of deep love

@SQL skill improvement MySQL index At present, I'm a junior. I want to be engaged in data warehouse development, but the ability of data development is also particularly high in the market, I also interviewed many small companies and agreed that I didn't have a deeper understanding of the workflow and project expansion of the whole project A ...

Posted by markmil2002 on Thu, 17 Feb 2022 12:54:18 +0100

After using ElasticSearch, the query speed of the company's system is 50 times faster

The company's OA system needs to query the data from the Activiti table when querying the function of the process I initiated When the interface becomes slower and slower, we will respond to this problem So I'm going to rewrite the company's query process function and give it a major system upgrade: you can search keywords in full text. So I ...

Posted by Vebut on Thu, 17 Feb 2022 02:57:39 +0100