Implement scores for `FDatairregular` objects as described in #609 #610

pcuestas · 2024-04-01T17:59:33Z

This pull request depends on #608 (integrating FDataIrregular objects is needed to implement the scores).

As explained in #609 , mean_absolute_error, mean_absolute_percentage_error, mean_squared_error and mean_squared_log_error have been implemented in the case when both y_true and y_pred are FDataIrregular objects.

Test cases have been included to ensure the same score is obtained if the FDataIrregular objects are obtained from FDataGrid's.

(testing included to assert equality with the `FDataGrid` case)

github-actions · 2024-04-01T18:00:05Z

skfda/misc/scoring.py

+    score: FDataIrregular,
+    squared: bool = True,
+    weights: NDArrayFloat | None = None,
+) -> float:


[pep8] _{reported by reviewdog 🐶}
DAR201 Missing "Returns" in Docstring: - return

github-actions · 2024-04-01T18:00:05Z

skfda/misc/scoring.py

@@ -554,6 +605,23 @@
    return _multioutput_score_grid(error, multioutput)


+@mean_absolute_percentage_error.register  # type: ignore[attr-defined, misc]
+def _mean_absolute_percentage_error_fdatairregular(


[pep8] _{reported by reviewdog 🐶}
WPS118 Found too long name: _mean_absolute_percentage_error_fdatairregular > 45

github-actions · 2024-04-01T18:00:05Z

skfda/misc/scoring.py

+    epsilon = np.finfo(np.float64).eps
+
+    if np.any(np.abs(y_true.values) < epsilon):
+        warnings.warn('Zero denominator', RuntimeWarning)


[pep8] _{reported by reviewdog 🐶}
B028 No explicit stacklevel argument found. The warn method from the warnings module uses a stacklevel of 1 by default. This will only show a stack trace for the line on which the warn method is called. It is therefore recommended to use a stacklevel of 2 or greater to provide more information to the user.

github-actions · 2024-04-01T18:00:05Z

skfda/tests/test_scoring.py

@@ -461,3 +469,101 @@ def test_negative_msle(self) -> None:
            y_true_grid,
            y_pred_grid,
        )
+
+
+############### Test irregular data scoring ####################


[pep8] _{reported by reviewdog 🐶}
E266 too many leading '#' for block comment

codecov · 2024-04-01T18:00:41Z

Codecov Report

Attention: Patch coverage is 90.00000% with 6 lines in your changes are missing coverage. Please review.

Project coverage is 86.67%. Comparing base (73161f7) to head (a5b7617).

❗ Current head a5b7617 differs from pull request most recent head 4bd8713. Consider uploading reports for the commit 4bd8713 to get more accurate results

Files	Patch %	Lines
skfda/misc/scoring.py	85.71%	4 Missing ⚠️
skfda/tests/test_scoring.py	93.75%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #610      +/-   ##
===========================================
+ Coverage    86.65%   86.67%   +0.02%     
===========================================
  Files          156      156              
  Lines        13322    13380      +58     
===========================================
+ Hits         11544    11597      +53     
- Misses        1778     1783       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

vnmabus · 2024-04-06T20:00:16Z

skfda/misc/scoring.py

+    The integral of the score is normalized because each integral is divided by
+    the length of the curve's domain.
+
+    If the score is vector-valued, then the mean of each codimension integral


Is this what we want? Is what we do for the other types?

I understand the question is regarding whether to divide by the length of the curve's domain or by the length of the FDataIrregular object's domain. This is the only difference that there is between the results of FDataGrid scores and the FDataIrregular that I implemented. As I said in #609, I think that dividing by each curve's domain length is more accurate, as the integral of that curve is being made only taking into account its particular domain.

No, I meant the treatment of vector-valued functions, but you also raised an interesting point that I didn't notice, and maybe we should discuss in the meeting.

Answering the initial question, then: yes, for other types we also take the mean of each codimension integral in the case of vector-valued functions. I think it is a reasonable design decision.

skfda/misc/scoring.py

skfda/tests/test_scoring.py

vnmabus

I think we can merge it for now, as the only remaining issue is the length we use in the division, and there are other PRs waiting for this to be merged.

I propose to keep the associated issue #609 open and mention in it explicitly the problem with the quotient lengths, to be solved in the future.

pcuestas · 2024-06-30T15:08:10Z

I'll explain the problem with the quotient lengths in #609.

Related to said problem is the design (or definition) of the integral of discretized functional observations (regular or irregular) when the grid endpoints do not coincide with the domain's. I will create another issue to discuss this (edit: this is the issue: #619).

pcuestas added 2 commits April 1, 2024 19:40

Add FDataIrregular to skfda like FDataBasis and FDataGrid

9730d51

Implement scores for FDatairregular objects as described in #609

951dea3

(testing included to assert equality with the `FDataGrid` case)

pcuestas requested a review from vnmabus April 1, 2024 17:59

github-actions bot reviewed Apr 1, 2024

View reviewed changes

pcuestas and others added 2 commits April 1, 2024 20:02

Fix ugly comment

30c807f

Merge branch 'develop' into feature/scoring-fdatairregular

a5b7617

vnmabus requested changes Apr 11, 2024

View reviewed changes

pcuestas added 2 commits April 12, 2024 12:03

Fix tests

e18bc49

Remove global variables from tests

5d3addc

pcuestas requested a review from vnmabus April 12, 2024 10:31

pcuestas added 2 commits April 13, 2024 11:18

Replace Union and Optional

e91419d

Fix possible division by zero

4bd8713

pcuestas mentioned this pull request Jun 14, 2024

Mixed effects model to convert irregular data to basis expansion #618

Open

4 tasks

vnmabus approved these changes Jun 30, 2024

View reviewed changes

pcuestas mentioned this pull request Jun 30, 2024

Scores for FDataIrregular objects #609

Open

Merge branch 'develop' into feature/scoring-fdatairregular

ca0a11c

vnmabus merged commit d19e1bd into develop Jul 5, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement scores for `FDatairregular` objects as described in #609 #610

Implement scores for `FDatairregular` objects as described in #609 #610

pcuestas commented Apr 1, 2024

github-actions bot Apr 1, 2024

github-actions bot Apr 1, 2024

github-actions bot Apr 1, 2024

github-actions bot Apr 1, 2024

codecov bot commented Apr 1, 2024 •

edited

Loading

vnmabus Apr 6, 2024

pcuestas Apr 12, 2024

vnmabus Apr 12, 2024

pcuestas Apr 13, 2024

vnmabus left a comment

pcuestas commented Jun 30, 2024 •

edited

Loading

Implement scores for FDatairregular objects as described in #609 #610

Implement scores for FDatairregular objects as described in #609 #610

Conversation

pcuestas commented Apr 1, 2024

github-actions bot Apr 1, 2024

Choose a reason for hiding this comment

github-actions bot Apr 1, 2024

Choose a reason for hiding this comment

github-actions bot Apr 1, 2024

Choose a reason for hiding this comment

github-actions bot Apr 1, 2024

Choose a reason for hiding this comment

codecov bot commented Apr 1, 2024 • edited Loading

Codecov Report

vnmabus Apr 6, 2024

Choose a reason for hiding this comment

pcuestas Apr 12, 2024

Choose a reason for hiding this comment

vnmabus Apr 12, 2024

Choose a reason for hiding this comment

pcuestas Apr 13, 2024

Choose a reason for hiding this comment

vnmabus left a comment

Choose a reason for hiding this comment

pcuestas commented Jun 30, 2024 • edited Loading

Implement scores for `FDatairregular` objects as described in #609 #610

Implement scores for `FDatairregular` objects as described in #609 #610

codecov bot commented Apr 1, 2024 •

edited

Loading

pcuestas commented Jun 30, 2024 •

edited

Loading