Installation, configuration and use of HBase
catalogue
Installation, configuration and use of HBase
preface
(1) Produced by Xiao Du (special version for big data of Haida Jike), please use it by the recipient!!!
(2) Pay attention to the directory of this tutorial and modify it according to the specific situation of users!
(3) Most directories with "~" should use the install ...
Posted by irkevin on Wed, 15 Dec 2021 19:59:14 +0100
Hbase index (Phoenix secondary index)
1. Introduction to Phoenix
Hbase is suitable for storing a large number of NOSQL data with low requirements for relational operations. Due to the limitations of Hbase design, it is not possible to directly use the native API to perform the operations such as condition judgment and aggregation commonly used in relational databa ...
Posted by natronp on Fri, 03 Dec 2021 22:04:23 +0100
III-3. Interaction between HBase and Hive
III-3. Interaction between HBase and Hive
3.1 comparison between HBase and Hive
[Hive]
OutlineElaborate1. Data warehouseThe essence of Hive is actually equivalent to making a bijection relationship between the files already stored in HDFS in Mysql toIt is convenient to use HQL to manage queries.2. Used for data analysis and cleaningHive is s ...
Posted by abitshort on Sun, 28 Nov 2021 11:10:16 +0100
2021SC@SDUSC Hbase project code analysis - LruBlockCache
2021SC@SDUSC
catalogue
1, Introduction
2, Cache level
3, Implementation analysis of LruBlockCache
4, Implementation analysis of obsolete cache
1, Introduction
If you access hfile every time you read data, the efficiency is very low, especially when reading random small amounts of data. In order to improve th ...
Posted by ganlal on Thu, 25 Nov 2021 21:58:48 +0100
Configure Hadoop 2.7.1 + HBase 1.1.5 + mysql8.0.27 + hive1.2.1 + sqoop1.4.6 experimental environment in docker
preface
Recently, a course experiment needs to be configured with the environment shown in the title, so the author takes this opportunity to use docker to build a returnable container experiment environment to complete the experiment, and also records some errors encountered in the construction process.
1, Environmental description
...
Posted by porco on Sun, 21 Nov 2021 00:36:54 +0100
Hbase configuration of stand-alone version and pseudo distributed version and solutions to common problems
Hbase configuration of stand-alone version and pseudo distributed version and solutions to common problems
Download Hbase
The Hbase version used by the author is 2.3.7, which is applicable to hadoop-2.10.x and hadoop-3.x. The following is the image download website:
Click here to jump to the image download website of Hbase-2.3.7
Download th ...
Posted by Dynamis on Tue, 16 Nov 2021 17:13:12 +0100
2021SC@SDUSC Code analysis of HBase project -- Region load balancing of Region mechanism
2021SC@SDUSC
1, Overview
In the distributed system, load balancing is a very important function. In HBase, Region is a data Region segmented by the table according to the row direction, which is managed by the RegionServer and provides data reading and writing services to the outside. If there are too many regions on a RegionServer, the co ...
Posted by xxreenaxx1 on Thu, 11 Nov 2021 00:13:05 +0100
2021SC@SDUSC hbase source code analysis and HLog analysis
2021SC@SDUSC hbase source code analysis (V) HLog analysis
2021SC@SDUSC2021SC@SDUSC
2021SC@SDUSC2021SC@SDUSC
2021SC@SDUSC2021SC@SDUSC
HLog
By default, the data of all write operations (write, update and delete) are written to the HLog in the form of append first, and then to the MemStore. In most cases, the HLog will not be read, but if ...
Posted by HockeyDevil07 on Tue, 26 Oct 2021 08:13:55 +0200
Cluster deployment of Apache HBase 1.2.12 under CentOS
Cluster deployment of Apache HBase 1.2.12 under CentOS
1, Dependent environment
Pre dependency and installation
CentOS 7. X, JDK, Hadoop 2.7.7 cluster, zookeeper-3.4.14 clusterThe three node zookeeper is deployed this time. The corresponding machine hostname s are bigdata02, bigdata03 and bigdata04
The cluster planning is shown in the follo ...
Posted by sdallas411 on Sun, 24 Oct 2021 18:03:25 +0200
2021SC@SDUSC Hbase project source code analysis - flush
2021SC@SDUSC
catalogue
1, Introduction
2, Source code analysis
1, Introduction
HBase is a distributed database based on LSM model (log structured merge trees). Compared with the B + tree used in Oracle ordinary index, a major feature of LSM model is that it adopts a balance between reading and writing, sacrificing the performance of some ...
Posted by Ruski on Fri, 15 Oct 2021 20:35:07 +0200