Madhu Chavva’s Post

View profile for Madhu Chavva, graphic

Vice President of Engineering | Technology Advisor | Angel Investor

Personalized Programs - Iteration 1 When building personalized programs using machine learning algorithms like Random Forest and K-Means, feature scaling is a crucial preprocessing step. The choice of feature scaling method can significantly influence model performance. Although StandardScaler is commonly used for various machine learning algorithms, it assumes that the features follow a normal distribution. In our initial approach, we used StandardScaler with a plan to handle outliers during the data preprocessing stages. However, the features did not follow a normal distribution and were light-tailed, with a kurtosis value of less than 3 and were right-skewed in nature. So, the alternative was to use advanced scalers and we opted for RobustScaler to maintain the integrity of the feature ranges. The added complexity to our pipeline was justified as we observed improved results. We then incorporated the RandomizedSearchCV technique to find the optimal hyperparameters for our Random Forest model, which further boosted performance. That's the end of iteration 1. What techniques do you follow to increase the accuracy of an ML model? #machinelearningengineering #mlops #datapreprocessing #modelperformance #modeloptimization #featurescaling

To view or add a comment, sign in

Explore topics