SMOTE is a synthetic minority oversampling technique based on nearest neighbor information. It was first developed for a numeric column where the minority class is upsampled by taking each sample of the minority class and its nearest neighbors and forming a linear combination of them. SMOTEN-C also takes categorical columns into account and selects the most frequent category of nearest neighbors. This interpolation is better than naive resampling, which only randomly resamples for an existing sample, but on the other hand, linear interpolation cannot achieve the complexity of new minority samples that can be achieved using synthetic data generation.
We use third-party web analytics tools to analyze website usage and measure the success of advertising campaigns. Cookies are set in the process and data is partly transferred to the USA. Further details can be found in our privacy policy. You can revoke or adjust your selection at any time under Settings.
Here you will find an overview of all cookies used. You can give your consent to whole categories or display further information and select certain cookies.