GBDTE

It's quite hard to navigate the channel, so I created this navigation/summary post. It's about a pet project I started about ten years ago. The main idea is that we can use slightly modified gradient boosted decision trees to both group objects and find trends for these groups.

๐Ÿ“ˆbeginning - the very first picture, the whole idea
๐Ÿ“ˆcredit scoring - problem statement, temporal instability
๐Ÿ“ˆdataset - dataset preparation, ytzaurus vs Oracle
๐Ÿ“ˆVanilla GBDTE - experiment with math in instant view
๐Ÿ“ˆSmall MSE Dataset - the first approach to synthetic dataset for MSE GBDTE
๐Ÿ“‰Extracting components - how to get perfect components from chaotic signal
๐Ÿ“‰Leafs and components - check tree leaves and plot components
๐Ÿ“‰Evil of defaults - a debugging session, culprit - default parameters
๐Ÿ“‰Big MSE dataset - scatterplot with more clear "Gradient Boosting" message
๐Ÿ“‰LogLoss dataset - non-stationary dataset for binary classification
๐ŸŽฒExperiment on LogLoss dataset - first approach for running the algorithm on the dataset
๐ŸŽฒbad results - a very important mistake! Why you shouldn't use interpolation factors as extrapolating ones
๐ŸŽฒillustration for unstable class - a picture for a presentation
๐ŸŽฒlearning curves LogLoss - learning curves for LogLoss case (non-stationary binary classification)