Refactor: reformatting python code across all the source files #73

neomatrix369 · 2023-03-12T21:26:00Z

To be able to merge a pull request, there are a few checks:

Checklist

Please check the options that you have completed and strike-out the options that do not apply via this pull request:

a clear title and description to the Pull Request has been provided
you have read
- the Contributing doc
- the Developer Guide
the pull request passes the tests (./test-coverage "tests slow-tests") - this will also be visible via the Code coverage report and CI/CD task on the Pull Request
you have performed some kind of smoke test by running your changes in an isolated environment i.e. Docker container, Google Colab, Kaggle, etc...
~~- [ ] the notebooks are updated (see notebooks folder, read the Notebooks docs)~~
CHANGELOG.md has been updated (please follow the existing format)

Goal or purpose of the PR

Minor fixes and code formatting

Changes implemented in the PR

Formatting all python code and fixing minor typos in the docs. Running black all across the code base and making the code structure consistent. Apply refactorings suggested by Sourcery.ai across all the source files.

sourcery-ai · 2023-03-12T21:26:11Z

Sourcery Code Quality Report

❌ Merging this PR will decrease code quality in the affected files by 3.04%.

Quality metrics	Before	After	Change
Complexity	0.89 ⭐	0.75 ⭐	-0.14 👍
Method Length	37.16 ⭐	39.41 ⭐	2.25 👎
Working memory	5.00 ⭐	5.96 ⭐	0.96 👎
Quality	87.20% ⭐	84.16% ⭐	-3.04% 👎

Other metrics	Before	After	Change
Lines	1997	2483	486

Changed files	Quality Before	Quality After	Quality Change
setup.py	67.46% 🙂	67.46% 🙂	0.00%
nlp_profiler/__init__.py	100.00% ⭐	100.00% ⭐	0.00%
nlp_profiler/constants.py	80.17% ⭐	80.17% ⭐	0.00%
nlp_profiler/core.py	63.10% 🙂	64.79% 🙂	1.69% 👍
nlp_profiler/generate_features/__init__.py	71.82% 🙂	72.44% 🙂	0.62% 👍
nlp_profiler/generate_features/parallelisation_methods/__init__.py	90.29% ⭐	91.42% ⭐	1.13% 👍
nlp_profiler/granular_features/__init__.py	75.47% ⭐	75.47% ⭐	0.00%
nlp_profiler/granular_features/alphanumeric.py	97.09% ⭐	94.76% ⭐	-2.33% 👎
nlp_profiler/granular_features/chars_spaces_and_whitespaces.py	94.66% ⭐	91.52% ⭐	-3.14% 👎
nlp_profiler/granular_features/dates.py	90.18% ⭐	88.11% ⭐	-2.07% 👎
nlp_profiler/granular_features/emojis.py	93.69% ⭐	93.93% ⭐	0.24% 👍
nlp_profiler/granular_features/english_non_english_chars.py	94.86% ⭐	90.69% ⭐	-4.17% 👎
nlp_profiler/granular_features/letters.py	97.09% ⭐	94.76% ⭐	-2.33% 👎
nlp_profiler/granular_features/non_alphanumeric.py	97.09% ⭐	94.76% ⭐	-2.33% 👎
nlp_profiler/granular_features/noun_phrase_count.py	87.32% ⭐	85.95% ⭐	-1.37% 👎
nlp_profiler/granular_features/numbers.py	97.09% ⭐	94.76% ⭐	-2.33% 👎
nlp_profiler/granular_features/punctuations.py	90.93% ⭐	88.44% ⭐	-2.49% 👎
nlp_profiler/granular_features/stop_words.py	93.13% ⭐	93.52% ⭐	0.39% 👍
nlp_profiler/granular_features/words.py	97.09% ⭐	94.76% ⭐	-2.33% 👎
nlp_profiler/high_level_features/__init__.py	85.89% ⭐	85.89% ⭐	0.00%
nlp_profiler/high_level_features/ease_of_reading_check.py	85.73% ⭐	86.56% ⭐	0.83% 👍
nlp_profiler/high_level_features/sentiment_polarity.py	86.78% ⭐	87.66% ⭐	0.88% 👍
nlp_profiler/high_level_features/sentiment_subjectivity.py	86.78% ⭐	87.92% ⭐	1.14% 👍
slow-tests/acceptance_tests/test_apply_text_profiling.py	89.13% ⭐	89.13% ⭐	0.00%
slow-tests/performance_tests/test_perf_ease_of_reading_check.py	99.17% ⭐	99.17% ⭐	0.00%
slow-tests/performance_tests/test_perf_grammar_check.py	99.17% ⭐	99.17% ⭐	0.00%
slow-tests/performance_tests/test_perf_granular_features.py	98.83% ⭐	98.83% ⭐	0.00%
slow-tests/performance_tests/test_perf_noun_phrase.py	99.17% ⭐	99.17% ⭐	0.00%
slow-tests/performance_tests/test_perf_spelling_check.py	99.17% ⭐	99.17% ⭐	0.00%
tests/common_functions.py	72.84% 🙂	72.84% 🙂	0.00%
tests/acceptance_tests/test_apply_text_profiling.py	88.45% ⭐	88.45% ⭐	0.00%
tests/granular/test_alphanumeric.py	94.51% ⭐	94.11% ⭐	-0.40% 👎
tests/granular/test_chars_and_spaces.py	80.81% ⭐	80.81% ⭐	0.00%
tests/granular/test_dates.py	94.84% ⭐	94.26% ⭐	-0.58% 👎
tests/granular/test_duplicates.py	95.65% ⭐	94.85% ⭐	-0.80% 👎
tests/granular/test_emojis.py	95.02% ⭐	94.50% ⭐	-0.52% 👎
tests/granular/test_english_non_english_characters.py	90.40% ⭐	70.81% 🙂	-19.59% 👎
tests/granular/test_non_alphanumeric.py	94.10% ⭐	93.24% ⭐	-0.86% 👎
tests/granular/test_nounphrase.py	%	90.76% ⭐	%
tests/granular/test_numbers.py	85.25% ⭐	85.20% ⭐	-0.05% 👎
tests/granular/test_punctuations.py	90.73% ⭐	89.94% ⭐	-0.79% 👎
tests/granular/test_repeated_digits.py	90.40% ⭐	77.69% ⭐	-12.71% 👎
tests/granular/test_repeated_letters.py	90.40% ⭐	79.93% ⭐	-10.47% 👎
tests/granular/test_repeated_punctuations.py	90.40% ⭐	70.55% 🙂	-19.85% 👎
tests/granular/test_sentences.py	89.16% ⭐	89.02% ⭐	-0.14% 👎
tests/granular/test_stop_words.py	95.02% ⭐	94.50% ⭐	-0.52% 👎
tests/granular/test_syllables.py	90.40% ⭐	74.74% 🙂	-15.66% 👎
tests/granular/test_white_spaces.py	80.81% ⭐	80.81% ⭐	0.00%
tests/granular/test_words.py	94.86% ⭐	94.37% ⭐	-0.49% 👎
tests/high_level/test_ease_of_reading_check.py	87.93% ⭐	70.29% 🙂	-17.64% 👎
tests/high_level/test_grammar_check.py	87.91% ⭐	87.04% ⭐	-0.87% 👎
tests/high_level/test_sentiment_polarity.py	79.18% ⭐	79.18% ⭐	0.00%
tests/high_level/test_sentiment_subjectivity.py	79.18% ⭐	79.18% ⭐	0.00%
tests/high_level/test_spelling_check.py	74.59% 🙂	74.59% 🙂	0.00%

Here are some functions in these files that still need a tune-up:

File	Function	Complexity	Length	Working Memory	Quality	Recommendation
tests/common_functions.py	internal_assert_benchmark	1 ⭐	136 😞	13 😞	58.26% 🙂	Try splitting into smaller methods. Extract out complex expressions
tests/common_functions.py	generate_data	0 ⭐	80 🙂	16 ⛔	62.86% 🙂	Extract out complex expressions
nlp_profiler/core.py	apply_text_profiling	5 ⭐	148 😞	7 🙂	64.79% 🙂	Try splitting into smaller methods
nlp_profiler/generate_features/__init__.py	generate_features	2 ⭐	63 🙂	10 😞	72.44% 🙂	Extract out complex expressions
nlp_profiler/granular_features/__init__.py	apply_granular_features	0 ⭐	120 😞	6 ⭐	75.47% ⭐	Try splitting into smaller methods

Legend and Explanation

The emojis denote the absolute quality of the code:

⭐ excellent
🙂 good
😞 poor
⛔ very poor

The 👍 and 👎 indicate whether the quality has improved or gotten worse with this pull request.

Please see our documentation here for details on how these metrics are calculated.

We are actively working on this report - lots more documentation and extra metrics to come!

Help us improve this quality report!

…ind it

…grammar check score was computed.

… previously

…as the library returned different results since its been upgraded.

…ng related tests by updating the data, as expected results changed

…oth pinned to this version to check if they work fine across the board

…oved further

… swifter has issues with certain versions of pandas

…de encoding to utf-8

… encoding to utf-8

…et coverage back to 100

codecov · 2023-03-13T04:24:49Z

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (a3538c6) 100.00% compared to head (7caeb47) 100.00%.

❗ Current head 7caeb47 differs from pull request most recent head def1ee8. Consider uploading reports for the commit def1ee8 to get more accurate results

Additional details and impacted files

@@            Coverage Diff            @@
##            master       #73   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           26        26           
  Lines          498       439   -59     
  Branches        74        45   -29     
=========================================
- Hits           498       439   -59

Impacted Files	Coverage Δ
nlp_profiler/constants.py	`100.00% <100.00%> (ø)`
nlp_profiler/core.py	`100.00% <100.00%> (ø)`
nlp_profiler/generate_features/__init__.py	`100.00% <100.00%> (ø)`
...erate_features/parallelisation_methods/__init__.py	`100.00% <100.00%> (ø)`
nlp_profiler/granular_features/__init__.py	`100.00% <100.00%> (ø)`
nlp_profiler/granular_features/alphanumeric.py	`100.00% <100.00%> (ø)`
.../granular_features/chars_spaces_and_whitespaces.py	`100.00% <100.00%> (ø)`
nlp_profiler/granular_features/dates.py	`100.00% <100.00%> (ø)`
nlp_profiler/granular_features/emojis.py	`100.00% <100.00%> (ø)`
...ler/granular_features/english_non_english_chars.py	`100.00% <100.00%> (ø)`
... and 11 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report in Codecov by Sentry.
📢 Do you have feedback about the report comment? Let us know in this issue.

neomatrix369 · 2023-03-13T04:28:30Z

Currently blocked by Windows Unicode error which are failing the Windows runners, as per https://github.com/neomatrix369/nlp_profiler/actions/runs/4401367033/jobs/7707511237

… before running the testscript

…nt variable to enable UTF-8

neomatrix369 · 2023-03-13T05:01:37Z

#73 (comment) is fixed by 378458f, 44bcc4, 1771150

neomatrix369 · 2023-03-13T05:04:31Z

Pending: sourcery refactoring fixes to merge this PR and other checks mentioned in the body/description of the PR

…as they fail to compile or fail tests unnecessarily

…around sentiment analysis as we loose code coverage as a result

neomatrix369 · 2023-03-13T05:37:44Z

#73 (comment) - now resolved, next manual checks of notebooks/smoke tests

… it from grammar checks or anything else

… to the library via PRs [skip ci]

…ixes Refactor: reformatting python code across all the source files

neomatrix369 · 2023-03-13T12:15:06Z

Logged a regression issue #78 on the back of reviewing the notebooks

Refactor: reformatting python code across all the source files

4afb2a9

neomatrix369 added documentation Improvements or additions to documentation enhancement New feature or request code-quality labels Mar 12, 2023

sourcery-ai bot mentioned this pull request Mar 12, 2023

Refactor: reformatting python code across all the source files (Sourcery refactored) #74

Merged

neomatrix369 added 15 commits March 12, 2023 22:07

Fixing grammar check and ease of reading tests and implementation beh…

efe4f23

…ind it

Revert "Spelling checker has been modified"

f37b2f5

Fixed failing slow-tests (grammar related). Also fixed the logic how …

eccfd08

…grammar check score was computed.

Github workflow: removing support for 3.6 and updating dependent docs

6c1808c

Fixing syllables tests as library is returning a different count than…

ce090f2

… previously

Upgraded swifter to 1.0.5 for now, pinned it to this version

f317a04

Tests: added a new test to cover for non-string inputs and return NaN

7e0e6e3

Tests: fixed slow and a high-level tests related to Ease of reading, …

250bd42

…as the library returned different results since its been upgraded.

Tests: fixed acceptance tests covering syllables count, ease of readi…

b3cc1f7

…ng related tests by updating the data, as expected results changed

Dependencies: update swifter version to 1.0.5, and pandas to 1.3.4, b…

4e6a0f5

…oth pinned to this version to check if they work fine across the board

Installation setup: removing 3.6 support as other dependencies have m…

fb9a972

…oved further

Dependencies: pinning joblib & pandas version to reduce moving parts,…

97bd1a5

… swifter has issues with certain versions of pandas

CI: special install for Python 3.8/Windows environment, setting unico…

70be76b

…de encoding to utf-8

CI: special setup for Python 3.7/Windows environment, setting unicode…

364be06

… encoding to utf-8

Tests: fixed test syllables test, change from 18 to 17

ba23ad5

neomatrix369 force-pushed the reformating-code-and-minor-fixes branch from 6ca2568 to ba23ad5 Compare March 13, 2023 03:43

neomatrix369 added 2 commits March 13, 2023 03:45

Merge branch 'master' into reformating-code-and-minor-fixes

e8b6aba

Test coverage: fixed the coverage issue in ease of reading check to g…

8963949

…et coverage back to 100

neomatrix369 added 3 commits March 13, 2023 04:36

CI: passing the Windows/Python specific Unicode environment variables…

378458f

… before running the testscript

CI: passing yet another the Windows/Python specific Unicode environme…

1771150

…nt variable to enable UTF-8

Github Action: removing unneeded env variable settings

44bcc43

'Refactored by Sourcery'

b129430

neomatrix369 added 2 commits March 13, 2023 05:29

Sourcery refactoring: removed/ignored/disabled sourcery refactorings …

1bdd5a9

…as they fail to compile or fail tests unnecessarily

Sourcery refactoring: removed/ignored/disabled sourcery refactorings …

fc0de80

…around sentiment analysis as we loose code coverage as a result

neomatrix369 added 2 commits March 13, 2023 11:23

Dependencies: removing language_tool_python as we are no longer using…

7caeb47

… it from grammar checks or anything else

CHANGELOG.md: adding entries to the change log to record changes made…

def1ee8

… to the library via PRs [skip ci]

neomatrix369 self-assigned this Mar 13, 2023

neomatrix369 merged commit f9cb2e6 into master Mar 13, 2023

neomatrix369 added a commit that referenced this pull request Mar 13, 2023

Merge pull request #73 from neomatrix369/reformating-code-and-minor-f…

93a4cd9

…ixes Refactor: reformatting python code across all the source files

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor: reformatting python code across all the source files #73

Refactor: reformatting python code across all the source files #73

neomatrix369 commented Mar 12, 2023 •

edited

Loading

sourcery-ai bot commented Mar 12, 2023 •

edited

Loading

codecov bot commented Mar 13, 2023 •

edited

Loading

neomatrix369 commented Mar 13, 2023 •

edited

Loading

neomatrix369 commented Mar 13, 2023

neomatrix369 commented Mar 13, 2023

neomatrix369 commented Mar 13, 2023

neomatrix369 commented Mar 13, 2023

Refactor: reformatting python code across all the source files #73

Refactor: reformatting python code across all the source files #73

Conversation

neomatrix369 commented Mar 12, 2023 • edited Loading

Checklist

Goal or purpose of the PR

Changes implemented in the PR

sourcery-ai bot commented Mar 12, 2023 • edited Loading

Sourcery Code Quality Report

Legend and Explanation

codecov bot commented Mar 13, 2023 • edited Loading

Codecov Report

neomatrix369 commented Mar 13, 2023 • edited Loading

neomatrix369 commented Mar 13, 2023

neomatrix369 commented Mar 13, 2023

neomatrix369 commented Mar 13, 2023

neomatrix369 commented Mar 13, 2023

neomatrix369 commented Mar 12, 2023 •

edited

Loading

sourcery-ai bot commented Mar 12, 2023 •

edited

Loading

codecov bot commented Mar 13, 2023 •

edited

Loading

neomatrix369 commented Mar 13, 2023 •

edited

Loading