spark-day03-core programming RDD operator

1: RDD operator RDD operators, also known as RDD methods, are mainly divided into two categories. Conversion operator and action operator. 2: RDD conversion operator According to different data processing methods, the operators are divided into value type, double value type and key value type 2.1: map value conversion package com.atg ...

Posted by lightpace on Tue, 15 Feb 2022 06:38:42 +0100

GEE series: unit 8 time series analysis in Google Earth engine [time series]

Unit 8: time series analysis in Google Earth engine 1 Introduction In this module, we will discuss the following concepts: Process remote sensing images of the ocean. Create a video from an image time series. Time series analysis in GEE. Add basic elements to the graphical user interface. 2 background The Deepwat ...

Posted by lynxus on Tue, 15 Feb 2022 02:33:56 +0100

GEE series: Unit 1: introduction to Google Earth engine

Unit 1: introduction to Google Earth engine Developed by the Colorado State University graduate degree program in ecology and the laboratory of natural resource ecology Authors: Peder Engelstad, Dan Carver Updated may 2021 1 Introduction2 background3 start using Google Earth engine 3.1 browsing user interface3.2 introducti ...

Posted by unreel on Tue, 15 Feb 2022 00:08:59 +0100

linux common commands 2

preparation # Switch the directory to / export / and empty the contents cd /export/ && rm -rf /export/* && tree # Add a directory and switch directories and add files mkdir -p /export/aaa/bbb/ccc/ddd/eee/ touch /export/aaa/bbb/ccc/ddd/eee/abc.txt touch /export/aaa/123.txt touch /export/aaa/312.txt # View the contents of ...

Posted by buzz on Sun, 13 Feb 2022 17:43:49 +0100

What is the RDD operator in Spark

Operator of RDD What is the 1-operator? API, method, behavior What are the classes of 2-operators- transformation and action 3-transformation features: convert to new RDD and delay loading What operators does - transformation have- See the table, such as map filter, etc - transformation continue classification eg: glom - Elements of each partit ...

Posted by kingdm on Sat, 12 Feb 2022 17:44:38 +0100

Big data Hadoop installation and configuration

Big data, Spark, Hadoop, python Big data Hadoop installation and configuration 1, Hadoop pseudo distributed configuration 1. Create Hadoop user: sudo useradd -m hadoop -s /bin/bash # Create hadoop user sudo passwd hadoop # Change Password sudo adduser hadoop sudo # Add administrator privileges Log out and log in ...

Posted by jamesh on Sat, 12 Feb 2022 04:41:59 +0100

Play Hudi Docker Demo based on Ubuntu -- Spark write and query

brief introduction Last article Playing Hudi Docker Demo based on Ubuntu (2) -- writing test data to Kafka Describes how to write test data to fkaka cluster. This article describes how to use Spark to consume Kafka data and write the data to HDFS. Hudi is introduced into Spark in the form of Jar package. Types of Hudi tables and queries Tabl ...

Posted by dilum on Fri, 11 Feb 2022 17:01:45 +0100

Spark: JupyterNotebook integrates PySpark development environment

Record Basic environment JDK8Python3.7 Setting up Spark environment in Window First install JDK8 and python 3, which will not be repeated here Install Hadoop 2 seven Download address: http://archive.apache.org/dist/hadoop/core/hadoop-2.7.7/hadoop-2.7.7.tar.gz decompression Download winutils of hadoop: https://github.com/stevelou ...

Posted by cl77 on Fri, 11 Feb 2022 15:37:58 +0100

log4j vulnerability es upgrade

edition Version upgrade instructions: prevent risks caused by log4j vulnerabilities Unified Version Description: unify the version to prevent unnecessary accidents Version selection Description: elasticsearch: 7.16.2 logstash: 7.16.2 file beat: 7.16.2 Download instructions Enter the official website es official website Select the required ...

Posted by chintansshah on Fri, 11 Feb 2022 14:18:56 +0100

ElasticSeach detailed installation tutorial -- graphic introduction

catalogue preparation in advance Create user Download the installation package of ElasticSeach. This case uses version 6.2.4 Modify the configuration item JVM options Modify the configuration item elasticsearch yml Resolve error reporting Start Elasticsearch 1.3.1. Error 1: kernel too low 1.3.2. Error 2: insufficient file permissions ...

Posted by KoshNaranek on Fri, 11 Feb 2022 06:22:01 +0100