Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CU-2hz5ump deid mulitprocessing #393

Merged
merged 23 commits into from
Feb 12, 2024
Merged

Conversation

mart-r
Copy link
Collaborator

@mart-r mart-r commented Jan 26, 2024

So I looked into DeID model multiprocessing and it looked like it wasn't working at all.
However, I did end up get multiprocessing to work within tests.

The takeaway right now:

  • After training, the TransformersNER class ends up in a state that makes multiprocessing fail
  • Saving the model on disk and loading it again fixes the issue

So while the multiprocessing works during testing, this is very much a workaround.

PS:
I do not know the actual issue.
I did do a bunch of debugging and all I could find out is that after the train method on TransformersNER is called, it is no longer able to be used as part of the pipe for whatever reason. I did narrow down the issue to below the line 195 in the method. The error I got was this one:
https://gist.github.com/mart-r/0927125010c54fb4d89adabc0cfb00c9
So if anyone knows what the actual fix would be, please do let me know.

@tomolopolis
Copy link
Member

Copy link
Member

@tomolopolis tomolopolis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@mart-r mart-r merged commit 08570eb into master Feb 12, 2024
5 checks passed
@mart-r mart-r deleted the CU-2hz5ump-deid-mulitprocessing branch March 1, 2024 15:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants