GBDTE
It's quite hard to navigate the channel, so I created this navigation/summary post. It's about a pet project I started about ten years ago. The main idea is that we can use slightly modified gradient boosted decision trees to both group objects and find trends for these groups.
๐beginning - the very first picture, the whole idea
๐credit scoring - problem statement, temporal instability
๐dataset - dataset preparation, ytzaurus vs Oracle
๐Vanilla GBDTE - experiment with math in instant view
๐Small MSE Dataset - the first approach to synthetic dataset for MSE GBDTE
๐Extracting components - how to get perfect components from chaotic signal
๐Leafs and components - check tree leaves and plot components
๐Evil of defaults - a debugging session, culprit - default parameters
๐Big MSE dataset - scatterplot with more clear "Gradient Boosting" message
๐LogLoss dataset - non-stationary dataset for binary classification
๐ฒExperiment on LogLoss dataset - first approach for running the algorithm on the dataset
๐ฒbad results - a very important mistake! Why you shouldn't use interpolation factors as extrapolating ones
๐ฒillustration for unstable class - a picture for a presentation
๐ฒlearning curves LogLoss - learning curves for LogLoss case (non-stationary binary classification)
