ENH Raises warning when getting non-finite score in SearchCV#18266

subrat93 · 2020-08-26T17:00:40Z

Reference Issues/PRs

Fixes#10529
Supersedes and closes#10546
Supersedes and closes#15469

What does this implement/fix? Explain your changes.

The fix checks for the presence of any inf/-inf values in the mean score calculated after GridSearchCV.
If yes, it raises a warning - "One or more of the test scores are infinite"

Any other comments?

glemaitre

Please add an entry to the change log at doc/whats_new/v*.rst. Like the other entries there, please reference this pull request with :pr: and credit yourself (and other contributors if applicable) with :user:.
In addition to yourself, please add @Nirvan101 @ArthurBook as contributors to this PR.

sklearn/model_selection/_search.py

sklearn/model_selection/tests/test_search.py

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

sklearn/model_selection/tests/test_search.py

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

glemaitre · 2020-08-28T11:46:29Z

Can you solve the linting issue: https://app.circleci.com/pipelines/github/scikit-learn/scikit-learn/7266/workflows/6e341ac6-b7dc-498a-9133-59690fcef076/jobs/118045

sklearn/model_selection/tests/test_search.py:28:1: F401 'scipy.stats.distributions.norm' imported but unused from scipy.stats.distributions import norm ^ sklearn/model_selection/tests/test_search.py:1790:1: W293 blank line contains whitespace ^ sklearn/model_selection/tests/test_search.py:1791:1: W293 blank line contains whitespace ^

subrat93 · 2020-08-28T13:57:52Z

@glemaitre, thanks for the review. I've resolved the linting issues.

doc/whats_new/v0.24.rst

glemaitre · 2020-08-28T14:00:07Z

sklearn/model_selection/tests/test_search.py

@@ -26,6 +26,7 @@

 from scipy.stats import bernoulli, expon, uniform

+


You don't need this extra return line

glemaitre

Just 2 small fixes otherwise LGTM.

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

glemaitre · 2020-08-28T14:30:18Z

@thomasjpfan @adrinjalali Do you want to have a look?

thomasjpfan

Thank you for the PR @subrat93 !

doc/whats_new/v0.24.rst

thomasjpfan · 2020-08-28T22:27:48Z

sklearn/model_selection/tests/test_search.py

@@ -1750,6 +1750,43 @@ def get_n_splits(self, *args, **kw):
 ridge.fit(X[:train_size], y[:train_size])


+@pytest.mark.parametrize("return_train_score", [False, True])
+def test_gridsearchcv_raise_warning_with_non_finite_score(return_train_score):


We can also check for RandomSearchCV here as well:
@pytest.mark.parametrize("return_train_score", [False, True])@pytest.mark.parametrize("SearchCV, specialized_params", [(GridSearchCV, {"param_grid": {"max_depth": [2, 3]}}), (RandomizedSearchCV, {"param_distributions": {"max_depth": [2, 3]}, "n_iter": 2})])deftest_searchcv_raise_warning_with_non_finite_score( SearchCV, specialized_params, return_train_score): ... grid=SearchCV( DecisionTreeClassifier(), scoring=FailingScorer(), cv=3, return_train_score=return_train_score, **specialized_params )

sklearn/model_selection/tests/test_search.py

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

thomasjpfan

LGTM Thank you @subrat93 !

glemaitre · 2020-08-31T09:05:41Z

Thanks @subrat93

subrat93 · 2020-08-31T10:22:32Z

Thanks a lot @glemaitre and @thomasjpfan for your support!! I look forward to contributing more.

…learn#18266) Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

subrat93 added 3 commits August 26, 2020 21:27

fixed inf issue in GridSearchCV
ac8a43d

Fixed PEP8violation in the fix
288dd8b

Fixed test_inf_warnings_in_GridSearchCV
31ed928

github-actionsbot added the module:model_selection label Aug 26, 2020

glemaitre reviewed Aug 27, 2020
View reviewed changes

subrat93and others added 10 commits August 27, 2020 15:06

Apply suggestions from code review
e7ad299
Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Incorporated Review Comments
954dab8

resolving errors
69f8d5c

still resolving errors
f09eb3a

still resolving errors
b21012d

still resolving errors
a0fa8e8

lint issues
25a463e

fixing yet another error
ed38c74

fixing error
cc863e3

fixing lint error
e247463

glemaitre self-requested a review August 28, 2020 09:10

glemaitre reviewed Aug 28, 2020
View reviewed changes

sklearn/model_selection/tests/test_search.py Outdated Show resolvedHide resolved

glemaitre changed the title ~~Fixed -> inf or -inf values in CV ruin the mean and std~~ENH raise warning when getting non-finite value during scoring in GridSearchCVAug 28, 2020

Update sklearn/model_selection/tests/test_search.py
48f0ebe
Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

fixed lint issues
7575547

subrat93 requested a review from glemaitre August 28, 2020 13:58

glemaitre reviewed Aug 28, 2020
View reviewed changes

doc/whats_new/v0.24.rst Outdated Show resolvedHide resolved

glemaitre reviewed Aug 28, 2020
View reviewed changes

glemaitre approved these changes Aug 28, 2020
View reviewed changes

subrat93and others added 2 commits August 28, 2020 19:38

Update doc/whats_new/v0.24.rst
3663a37
Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

may be final changes
f4d5160

subrat93 requested a review from glemaitre August 28, 2020 14:28

glemaitre removed their request for review August 28, 2020 14:29

thomasjpfan reviewed Aug 28, 2020
View reviewed changes

subrat93and others added 4 commits August 29, 2020 12:09

Update sklearn/model_selection/tests/test_search.py
3cd73a0
Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Update doc/whats_new/v0.24.rst
299aafc
Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

review related changes
868d506

fixing multiple param grid error
4a9e627

thomasjpfan approved these changes Aug 29, 2020
View reviewed changes

thomasjpfan changed the title ~~ENH raise warning when getting non-finite value during scoring in GridSearchCV~~ENH Raises warning when getting non-finite score in SearchCVAug 29, 2020

thomasjpfan merged commit 192c233 into scikit-learn:masterAug 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH Raises warning when getting non-finite score in SearchCV#18266

ENH Raises warning when getting non-finite score in SearchCV #18266

subrat93 commented Aug 26, 2020•
edited by glemaitre
Loading

glemaitre left a comment

glemaitre commented Aug 28, 2020

subrat93 commented Aug 28, 2020

glemaitreAug 28, 2020

glemaitre left a comment

glemaitre commented Aug 28, 2020

thomasjpfan left a comment

thomasjpfanAug 28, 2020

thomasjpfan left a comment

glemaitre commented Aug 31, 2020

subrat93 commented Aug 31, 2020

		@@ -26,6 +26,7 @@

		from scipy.stats import bernoulli, expon, uniform

ENH Raises warning when getting non-finite score in SearchCV#18266

ENH Raises warning when getting non-finite score in SearchCV #18266

Conversation

subrat93 commented Aug 26, 2020• edited by glemaitre Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

glemaitre left a comment

Choose a reason for hiding this comment

glemaitre commented Aug 28, 2020

subrat93 commented Aug 28, 2020

glemaitreAug 28, 2020

Choose a reason for hiding this comment

glemaitre left a comment

Choose a reason for hiding this comment

glemaitre commented Aug 28, 2020

thomasjpfan left a comment

Choose a reason for hiding this comment

thomasjpfanAug 28, 2020

Choose a reason for hiding this comment

thomasjpfan left a comment

Choose a reason for hiding this comment

glemaitre commented Aug 31, 2020

subrat93 commented Aug 31, 2020

subrat93 commented Aug 26, 2020•
edited by glemaitre
Loading