BUG fix SparseCoder to follow scikit-learn API and allow cloning#17679

sdpython · 2020-06-23T16:57:36Z

Reference Issues/PRs

Fixes#8675
Fixes#16336
Supersedes and closes#16346

What does this implement/fix? Explain your changes.

SparseCoder cannot be used in a GridSearchCV because it could not be cloned. A parameter in the constructor had a different meaning when stored in an instance (dictionary).

glemaitre · 2020-06-24T07:28:21Z

Uhm did something about this one. Let me check.

glemaitre · 2020-06-24T07:32:01Z

Found it: #16346

sdpython · 2020-06-24T07:47:13Z

Let me know which one you prefer. PR #16346 does not test cloning and that's what caused the issue. I recommend adding one more test.

glemaitre · 2020-06-24T08:17:29Z

I think this is important to make the estimator scikit-learn compliant (it will crash soon otherwise). Cloning is part of that so we should merge both PR. Could you merge my PR inside yours like this you supersede mine? I will review it then.

glemaitre · 2020-06-24T08:22:01Z

I just solve the conflicts in mine so you should have no so much trouble to merge them.

cmarmo · 2020-06-24T08:26:52Z

@sdpython do you mind editing the PR adding that you will also close #16336? Thanks for your patience.

sdpython · 2020-06-24T10:10:47Z

ok :)

…into i8675sparse

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

…into i8675sparse

glemaitre · 2020-06-25T09:41:44Z

@sdpython I will do a review. I think that we might need to add some of the test (usually done in common test) but that require specific shape for X and dictionary.

ogrisel

A few comments but otherwise LGTM. I don't think there is an easy way to re-add SparseCoder to test_common as it requires to specify the dictionary in the constructor params and the dictionary shape should be consistent with X.shape[1].

sklearn/decomposition/tests/test_dict_learning.py

glemaitre · 2020-06-25T10:26:55Z

We would need to have something like that (it is only the transformer).

deftest_sparse_coder_common_transformer(): fromfunctoolsimportpartialfromsklearn.utils.estimator_checksimportcheck_transformer_data_not_an_arrayfromsklearn.utils.estimator_checksimportcheck_transformer_generalfromsklearn.utils.estimator_checksimportcheck_transformers_unfittedrng=np.random.RandomState(777) n_components, n_features=40, 3init_dict=rng.rand(n_components, n_features) sc=SparseCoder(init_dict) check_transformer_data_not_an_array(sc.__class__.__name__, sc) check_transformer_general(sc.__class__.__name__, sc) check_transformer_general_memmap=partial( check_transformer_general, readonly_memmap=True ) check_transformer_general_memmap(sc.__class__.__name__, sc) check_transformers_unfitted(sc.__class__.__name__, sc)

glemaitre · 2020-06-25T10:27:39Z

I think this is wiser to merge with the current test and see how the test_common.py could be modify such that the size of the dictionary can be modified on the fly.

adrinjalali · 2020-06-25T10:32:22Z

A few comments but otherwise LGTM. I don't think there is an easy way to re-add SparseCoder to test_common as it requires to specify the dictionary in the constructor params and the dictionary shape should be consistent with X.shape[1].

I have encountered this while working on other estimators outside sklearn, and it's partly why I wanted to have a better way of telling common tests now to generate required data for certain tests.

sdpython · 2020-06-25T11:01:25Z

deftest_sparse_coder_common_transformer(): fromfunctoolsimportpartialfromsklearn.utils.estimator_checksimportcheck_transformer_data_not_an_arrayfromsklearn.utils.estimator_checksimportcheck_transformer_generalfromsklearn.utils.estimator_checksimportcheck_transformers_unfittedrng=np.random.RandomState(777) n_components, n_features=40, 3init_dict=rng.rand(n_components, n_features) sc=SparseCoder(init_dict) check_transformer_data_not_an_array(sc.__class__.__name__, sc) check_transformer_general(sc.__class__.__name__, sc) check_transformer_general_memmap=partial( check_transformer_general, readonly_memmap=True ) check_transformer_general_memmap(sc.__class__.__name__, sc) check_transformers_unfitted(sc.__class__.__name__, sc)

Added.

sklearn/decomposition/tests/test_dict_learning.py

ogrisel

LGTM! Thanks @sdpython!

glemaitre · 2020-06-25T17:50:16Z

There is a single thing missing which is an entry in what's new because we are deprecated one parameter.
Can you this acknowledging yourself @sdpython and then we are good for merging.

agramfort

@glemaitre merge if happy

thx @sdpython

agramfort · 2020-06-25T21:05:31Z

actually @glemaitre had approved so go !

thx @sdpython

glemaitre · 2020-06-26T06:28:13Z

I will make a PR adding the deprecation in what's new :)

Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…kit-learn#17679) * BUG fix SparseCoder to follow scikit-learn API * TST check that get_params and set_params work as expected * address comments * PEP8 * iter * Fixesscikit-learn#8675, fix cloning for SparseCoder * remove spaces * Update _dict_learning.py * fix confusing arguments * remove unnecessary code * Update test_common.py * removes spaces * PEP8 * iter * fix merge * ignore a mypy warning * type: ignore * remove one deprecated verification * Update sklearn/decomposition/_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> * Update sklearn/decomposition/_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> * Update sklearn/decomposition/tests/test_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> * test clone produces different id * review * add one more test * lint * Update sklearn/decomposition/tests/test_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

) Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

…kit-learn#17679) * BUG fix SparseCoder to follow scikit-learn API * TST check that get_params and set_params work as expected * address comments * PEP8 * iter * Fixesscikit-learn#8675, fix cloning for SparseCoder * remove spaces * Update _dict_learning.py * fix confusing arguments * remove unnecessary code * Update test_common.py * removes spaces * PEP8 * iter * fix merge * ignore a mypy warning * type: ignore * remove one deprecated verification * Update sklearn/decomposition/_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> * Update sklearn/decomposition/_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> * Update sklearn/decomposition/tests/test_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> * test clone produces different id * review * add one more test * lint * Update sklearn/decomposition/tests/test_dict_learning.py Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

) Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

glemaitreand others added 8 commits January 31, 2020 15:40

BUG fix SparseCoder to follow scikit-learn API
9ecd4f4

TST check that get_params and set_params work as expected
80ce689

address comments
1a3bca9

PEP8
cf51000

Merge remote-tracking branch 'origin/master' into is/16336
ea9d72e

iter
1eaa35c

Fixes#8675, fix cloning for SparseCoder
b9a74ad

remove spaces
61ab0d3

github-actionsbot added the module:decomposition label Jun 23, 2020

sdpython added 5 commits June 23, 2020 23:58

Update _dict_learning.py
070f1b5

fix confusing arguments
9f445b5

remove unnecessary code
6acc55e

Update test_common.py
33335e5

removes spaces
29099c3

Merge remote-tracking branch 'origin/master' into is/16336
5ca7f1d

glemaitre added 4 commits June 24, 2020 09:57

Merge remote-tracking branch 'glemaitre/is/16336' into is/16336
07f2cf8

PEP8
a0d0187

iter
690370e

Merge remote-tracking branch 'origin/master' into is/16336
fbdb1f7

sdpython added 4 commits June 24, 2020 13:15

Merge branch 'is/16336' ofhttps://github.com/glemaitre/scikit-learn…
f646140
…into i8675sparse

fix merge
96685de

Merge branch 'master' ofhttps://github.com/scikit-learn/scikit-learn…
3a98281
…into i8675sparse

ignore a mypy warning
84b6bca

sdpythonand others added 3 commits June 25, 2020 10:29

Update sklearn/decomposition/tests/test_dict_learning.py
ddd70e8
Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Merge branch 'master' ofhttps://github.com/scikit-learn/scikit-learn…
6b1bdd5
…into i8675sparse

test clone produces different id
18a2306

glemaitre self-requested a review June 25, 2020 09:40

ogrisel reviewed Jun 25, 2020
View reviewed changes

sklearn/decomposition/tests/test_dict_learning.py Outdated Show resolvedHide resolved
sklearn/decomposition/tests/test_dict_learning.py Outdated Show resolvedHide resolved
sklearn/decomposition/tests/test_dict_learning.pyShow resolvedHide resolved

review
79b3cd9

add one more test
0f98a76

lint
dfbcb2c

ogrisel reviewed Jun 25, 2020
View reviewed changes

sklearn/decomposition/tests/test_dict_learning.pyShow resolvedHide resolved

Update sklearn/decomposition/tests/test_dict_learning.py
b7db869

ogrisel approved these changes Jun 25, 2020
View reviewed changes

ogrisel mentioned this pull request Jun 25, 2020
BUG fix SparseCoder to follow scikit-learn API #16346
Closed

glemaitre approved these changes Jun 25, 2020
View reviewed changes

glemaitre changed the title ~~Fixes #8675, fix cloning for SparseCoder~~BUG fix SparseCoder to follow scikit-learn API and allow cloningJun 25, 2020

agramfort approved these changes Jun 25, 2020
View reviewed changes

agramfort merged commit b0c03d1 into scikit-learn:masterJun 25, 2020

ogrisel added a commit that referenced this pull request Jun 26, 2020
DOC add whats new entry following#17679(#17738)
27cfe14
Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

glemaitre added a commit to glemaitre/scikit-learn that referenced this pull request Jul 17, 2020
DOC add whats new entry followingscikit-learn#17679(scikit-learn#17738
781a933
) Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

ogrisel mentioned this pull request Jul 25, 2020
FIX SparseCoder set_params works correctly #15236
Closed

jayzed82 pushed a commit to jayzed82/scikit-learn that referenced this pull request Oct 22, 2020
DOC add whats new entry followingscikit-learn#17679(scikit-learn#17738
f26e340
) Co-authored-by: Olivier Grisel <olivier.grisel@ensta.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG fix SparseCoder to follow scikit-learn API and allow cloning#17679

BUG fix SparseCoder to follow scikit-learn API and allow cloning #17679

sdpython commented Jun 23, 2020•
edited by lesteve
Loading

glemaitre commented Jun 24, 2020

glemaitre commented Jun 24, 2020

sdpython commented Jun 24, 2020

glemaitre commented Jun 24, 2020

glemaitre commented Jun 24, 2020

cmarmo commented Jun 24, 2020

sdpython commented Jun 24, 2020

glemaitre commented Jun 25, 2020

ogrisel left a comment

glemaitre commented Jun 25, 2020

glemaitre commented Jun 25, 2020

adrinjalali commented Jun 25, 2020

sdpython commented Jun 25, 2020

ogrisel left a comment

glemaitre commented Jun 25, 2020

agramfort left a comment

agramfort commented Jun 25, 2020

glemaitre commented Jun 26, 2020

BUG fix SparseCoder to follow scikit-learn API and allow cloning#17679

BUG fix SparseCoder to follow scikit-learn API and allow cloning #17679

Conversation

sdpython commented Jun 23, 2020• edited by lesteve Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

glemaitre commented Jun 24, 2020

glemaitre commented Jun 24, 2020

sdpython commented Jun 24, 2020

glemaitre commented Jun 24, 2020

glemaitre commented Jun 24, 2020

cmarmo commented Jun 24, 2020

sdpython commented Jun 24, 2020

glemaitre commented Jun 25, 2020

ogrisel left a comment

Choose a reason for hiding this comment

glemaitre commented Jun 25, 2020

glemaitre commented Jun 25, 2020

adrinjalali commented Jun 25, 2020

sdpython commented Jun 25, 2020

ogrisel left a comment

Choose a reason for hiding this comment

glemaitre commented Jun 25, 2020

agramfort left a comment

Choose a reason for hiding this comment

agramfort commented Jun 25, 2020

glemaitre commented Jun 26, 2020

sdpython commented Jun 23, 2020•
edited by lesteve
Loading