Skip to main content

All Questions

1vote
1answer
659views

Inverse Scaling Partitioned Data

I have scaled an original matrix A with sklearn's StandardScaler, resulting to a matrix S. I then partitioned the result into ...
ccccc's user avatar
1vote
1answer
316views

Scale dataset while preserving relative distributions between columns

I have a large dataset with 460 columns. The columns have names such as 'AppOpen_1day', 'AppOpen_2day', ...... 'AppOpen_15day', 'Dig_Pos_1day', 'Dig_Pos_2day', ...... 'DigPos_15day' etc. Each column ...
SacredMechanic's user avatar
1vote
1answer
125views

Out-of-Range Target Variable in Sequence-based Machine Learning Model

I'm encountering a scaling issue in a machine learning project. I'm predicting a target variable from an input sequence (and doing this for many). However, I've encountered a challenge where the ...
Bloggy's user avatar
2votes
4answers
438views

Are scalers or encoders supposed to be serialized along with trained models?

Consider the very basic example below: ...
Muhammad Usman's user avatar
2votes
1answer
1kviews

StandardScaler and MinMaxScaler vs RobustScaler

I've recently read that Standard Scaler functions best in situations where the distribution of the features are approximately normal. MinMaxScaler works in a way that it preserves the features' ...
Arthur Caldas's user avatar
1vote
2answers
1kviews

problem with standardScaler

problem with standardScaler hi I'd like to scale one column in the titanic data set. I am using the following code segment. for some reason df_scaled results an empty set. how can I solve it? what is ...
Mehmet Deniz's user avatar
2votes
4answers
6kviews

why to use Scaler.fit only on x_train and not on x_test for normalizing value using MinMaxScaler?

while normalising the data everone is saying that we need to fit only on x_train and not on x_test ? why is that we should not fit x_test ? if we should not fit the scaler on x_test then why we need ...
star's user avatar
  • 1,511
0votes
1answer
751views

Feature scaling in Linear Regression

I always use Linearregression() class in sklearn library for creating a linear regression model. According to my understanding, we need feature scaling in linear ...
AAA's user avatar
  • 35
1vote
1answer
506views

Is it a good practice to evaluate model performance by comparing the metrics of rescaled (inverse transformed) predictions and true target values?

I am now working with a Linear Regression for a time-series regression problem (I am sorry that I cannot say too much about the problem and feature vector due to NDA). I scaled both the input values ...
glorian's user avatar
0votes
1answer
8kviews

__init__() takes 1 positional argument but 4 were given sklearn standard scaler error

I defined a class like below: ...
Mohsen Mahmoodzadeh's user avatar
1vote
1answer
37views

When training a sklearn machine learning model, what part of a data from a csv file needs scaling like MaxAbsScaler or MinMaxScaler?

Consider the code below: ...
Muhammad Usman's user avatar
0votes
1answer
280views

How to scale a subset of data with respect to the entire dataset

I am developing a financial time-series prediction model using sklearn using StandardScaler for scaling purposes. I train a model, and then use the model regularly ...
functorial's user avatar
1vote
0answers
24views

Clustering Similar Articles Using Mixed Data: Seeking Advice and Validation

Question: I'm working on a project where I need to cluster a dataset of articles based on various features, including text, numeric values, and categorical data. I've implemented a clustering approach ...
sara sara's user avatar
2votes
1answer
4kviews

When and how to use StandardScaler with target data for pre-processing

I am trying to figure out when and how to use scikit-learn's StandardScaler transformer, and how I can apply it to the target variable as well. I've read this post ...
Step92's user avatar
0votes
1answer
3kviews

For sklearn ML algorithms, is it possible to use boolean data alongside continuous data for the predictive data, and if so how can the data be scaled?

I have a medium size data set (7K) of patient age, sex, and pre-existing conditions. Age of course is from 0-101, sex is 1 for male, 2 for female, and -1 for diverse. All the pre-conditions are ...
DrWhat's user avatar

153050per page
close