Implementation of backward recalculation in OneFlow: time for space, greatly reducing the occupation of video memory

Author Zhao Luyang In 2016, Chen Tianqi's team proposed "gradient/activation checkpointing" and other technologies related to sub linear memory optimization [1], aiming to reduce the memory occupation caused by intermediate activation in the process of deep learning and training. Checkpointing technology is a kind of sub linear mem ...

Posted by dfarrar on Wed, 09 Feb 2022 18:55:16 +0100

Quantitative technology of data - Part III understanding backtrade

Interested children, especially children's shoes, especially in the application of big data and artificial intelligence in quantification, can pay attention to my official account. If the script code is not clear, please see the official account number: datahomex: Through the introduction of the previous two articles in this series, we must ...

Posted by leocon on Wed, 09 Feb 2022 16:13:08 +0100

[machine learning notes] tear GBDT by hand

Source: https://iyinst.github.io/2021/05/19/%E6%9C%BA%E5%99%A8%E5%AD%A6%E4%B9%A0%E7%AC%94%E8%AE%B0-%E6%89%8B%E6%92%95GBDT/ The principle of gradient lifting decision tree (GBDT) will not be repeated. Unfamiliar students can refer to Mr. Li Hang's statistical learning method. Here is an implementation of GBDT. Before coding, we should first co ...

Posted by izy on Wed, 09 Feb 2022 01:30:10 +0100

Introduction to Machine Learning--Polynomial Regression

1. Polynomial Regression If your data points are obviously unsuitable for linear regression (straight lines between data points), polynomial regression may be an ideal choice. Like linear regression, polynomial regression uses the relationship between variables x and y to find the best way to plot data point lines. 2. Working principle ...

Posted by phpshift on Tue, 08 Feb 2022 18:38:32 +0100

Naive Bayes of sklearn machine learning

๐ŸŒŸ Last time we talked about the decision tree algorithm, which is based on sklearn. This time, we want to learn about naive Bayes, what is "naive". The meaning of simplicity is that the features are independent of each other and have no correlation. Small partners interested in previous content can refer to the following content ๐Ÿ‘‡: ...

Posted by mitsubishi2002 on Mon, 07 Feb 2022 22:22:01 +0100

Python time series analysis and Practice

Time series analysis ARIMA Stationarity: stationarity requires that the fitting curve obtained through the sample time series can continue along the existing form "inertia" in the future Stationarity requires that the mean and variance of the series do not change significantly Strict and weak stability: Yan pingwen: the distr ...

Posted by trinitywave on Mon, 07 Feb 2022 13:31:46 +0100

Gradient descent using RMSProp from scratch based on Python

[translated from: Gradient Descent With RMSProp from Scratch] [Note: Jason Brownlee likes PhD's articles very much, so he will do some translation and learning practice in his spare time. Here is the practice record of the corresponding work, hoping to help people in need!] Gradient descent is an optimization algorithm, which follows the neg ...

Posted by tekcap on Mon, 07 Feb 2022 10:31:30 +0100

Realization of fully connected neural network in C + +

To illustrate the environment, the code is implemented in C + +, but the matrix operation is not implemented in C + +. Making wheels by yourself is a waste of time and unnecessary. Therefore, the Eigen library is used for matrix operation, and the codes of other functions are implemented by yourself. 1, Environment configuration The Eigen mat ...

Posted by lady_bug on Mon, 07 Feb 2022 09:22:03 +0100

2, Communication mechanism of ROS - topic

The basic communication mechanism in ROS mainly has the following three implementation strategies: Topic communication (publish subscribe mode) Service communication (request response mode) Parameter server (parameter sharing mode) 1. Topic communication: publish subscribe mode One node publishes messages to the topic, and the other node ...

Posted by naggi on Mon, 07 Feb 2022 07:06:07 +0100

Clustering algorithm notes

Clustering algorithm - Kmeans 1. Unsupervised learning and clustering algorithm Supervised learning model algorithm, the sample data required by the model needs both characteristic matrix X and real label y.Unsupervised learning means that the model only needs to use the characteristic matrix X without the real label y. clustering algorithm i ...

Posted by luiddog on Mon, 07 Feb 2022 05:09:51 +0100