feat: Added loading method for PyTorch artefact detection models from HF Hub #836

fg-mindee · 2022-02-25T17:19:55Z

Following up on #426, this PR introduces the following modifications:

refactored faster rcnn
added a from_hub function to load artefact detection architectures in PyTorch
added unittest
fixed header unittest

With PyTorch backend, the following snippets are completely equivalent:

# Pretrained
from doctr.models.obj_detection import fasterrcnn_mobilenet_v3_large_fpn
model = fasterrcnn_mobilenet_v3_large_fpn(pretrained=True)

# HF Hub
from doctr.models.obj_detection.factory import from_hub
model = from_hub("mindee/fasterrcnn_mobilenet_v3_large_fpn")

Any feedback is welcome!

charlesmindee

Thanks for this amazing feature! just a small comment because you modified a parameter in the refacto

charlesmindee · 2022-03-03T09:08:49Z

doctr/models/obj_detection/faster_rcnn/pytorch.py

@@ -31,11 +29,11 @@ def _fasterrcnn(arch: str, pretrained: bool, **kwargs: Any) -> FasterRCNN:
        "image_mean": default_cfgs[arch]['mean'],
        "image_std": default_cfgs[arch]['std'],
        "box_detections_per_img": 150,
-        "box_score_thresh": 0.15,
+        "box_score_thresh": 0.5,


Are you positive this change doesn't decrease performances ?

I experimented a bit with the model as is, and this certainly decreases the recall, but the bare predictions got me concerned on the precision side. @SiddhantBahuguna suggested having a class-specific threshold strategy and that might be the best thing to do, but for now, perhaps it might be better to maximize our precision.

Happy to revert this if you think it's best to favor recall @charlesmindee 👌

Let's put it to 0.5 for now, but we should implement quickly a threshold by class if it leads to better results

charlesmindee

Thanks, @SiddhantBahuguna could you check this threshold by class when you have the time to do so ?

felixdittrich92 · 2022-03-07T21:15:08Z

@fg-mindee Can you also add a section into the docs for this feature ? 🤗 Maybe where we can add later links to some models on the hub (this would also fix the different vocab pretrained support i think)

codecov · 2022-03-08T09:21:53Z

Codecov Report

Merging #836 (0c7fc40) into main (c9806fa) will increase coverage by 0.03%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #836      +/-   ##
==========================================
+ Coverage   95.91%   95.94%   +0.03%     
==========================================
  Files         131      133       +2     
  Lines        5086     5103      +17     
==========================================
+ Hits         4878     4896      +18     
+ Misses        208      207       -1

Flag	Coverage Δ
unittests	`95.94% <100.00%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/models/obj_detection/faster_rcnn/pytorch.py	`100.00% <ø> (ø)`
doctr/models/obj_detection/factory/__init__.py	`100.00% <100.00%> (ø)`
doctr/models/obj_detection/factory/pytorch.py	`100.00% <100.00%> (ø)`
doctr/transforms/modules/base.py	`94.59% <0.00%> (ø)`
doctr/transforms/functional/base.py	`97.10% <0.00%> (+1.44%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c9806fa...0c7fc40. Read the comment docs.

fg-mindee · 2022-03-08T09:27:45Z

@fg-mindee Can you also add a section into the docs for this feature ? hugs Maybe where we can add later links to some models on the hub (this would also fix the different vocab pretrained support i think)

Hey Felix 👋

This is something we have to address but the documentation is built using the TF backend for now (since most high-level feature are identical) and this PR is only about PyTorch (we have no implementation of faster rcnn in TF in docTR for now) 😅

charlesmindee

Thanks!

SiddhantBahuguna · 2022-03-08T15:27:58Z

Thanks, @SiddhantBahuguna could you check this threshold by class when you have the time to do so ?

Hi @fg-mindee @charlesmindee, actually, since we are using the bare output of the model it lacks post nms strategy.
After observing the output, along with other filter methods I did use thresholds for logos specifically and it did lead to better precision and recall. I will open a PR soon for the same :)

fg-mindee added 5 commits February 25, 2022 18:11

refactor: Refactored FasterRCNN

c4f34f9

feat: Added factory method from_hub

b0f7595

chore: Updated requirements

4d72f95

test: Added unittest

a9399da

chore: Updated mypy config

749111b

fg-mindee added module: models Related to doctr.models ext: tests Related to tests folder framework: pytorch Related to PyTorch backend topic: object detection Related to the task of object detection type: new feature New feature labels Feb 25, 2022

fg-mindee added this to the 0.6.0 milestone Feb 25, 2022

fg-mindee requested review from fharper, charlesmindee and SiddhantBahuguna February 25, 2022 17:19

fg-mindee self-assigned this Feb 25, 2022

fg-mindee mentioned this pull request Feb 25, 2022

Integration with Hugging Face Hub #426

Closed

fg-mindee added 3 commits February 25, 2022 18:59

test: Updated unittests

8b27b6e

test: Fixed unittest

72d41f0

test: Fixed unittest

78884ba

charlesmindee reviewed Mar 3, 2022

View reviewed changes

charlesmindee previously approved these changes Mar 7, 2022

View reviewed changes

feat: Added cfg to model

44f0942

frgfm mentioned this pull request Mar 7, 2022

feat: Added docTR integration for object detection huggingface/huggingface_hub#747

Merged

2 tasks

fg-mindee added 2 commits March 8, 2022 10:14

Merge branch 'main' into obj-det

641031c

test: Fixed unittest

0c7fc40

fg-mindee dismissed charlesmindee’s stale review via 0c7fc40 March 8, 2022 09:15

charlesmindee approved these changes Mar 8, 2022

View reviewed changes

fg-mindee merged commit 9b31588 into main Mar 8, 2022

fg-mindee deleted the obj-det branch March 8, 2022 09:49

frgfm mentioned this pull request Jun 28, 2022

Release tracker - v0.6.0 #791

Closed

85 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Added loading method for PyTorch artefact detection models from HF Hub #836

feat: Added loading method for PyTorch artefact detection models from HF Hub #836

fg-mindee commented Feb 25, 2022 •

edited

Loading

charlesmindee left a comment

charlesmindee Mar 3, 2022

fg-mindee Mar 7, 2022

charlesmindee Mar 7, 2022

charlesmindee left a comment

felixdittrich92 commented Mar 7, 2022

codecov bot commented Mar 8, 2022

fg-mindee commented Mar 8, 2022

charlesmindee left a comment

SiddhantBahuguna commented Mar 8, 2022 •

edited

Loading

feat: Added loading method for PyTorch artefact detection models from HF Hub #836

feat: Added loading method for PyTorch artefact detection models from HF Hub #836

Conversation

fg-mindee commented Feb 25, 2022 • edited Loading

charlesmindee left a comment

Choose a reason for hiding this comment

charlesmindee Mar 3, 2022

Choose a reason for hiding this comment

fg-mindee Mar 7, 2022

Choose a reason for hiding this comment

charlesmindee Mar 7, 2022

Choose a reason for hiding this comment

charlesmindee left a comment

Choose a reason for hiding this comment

felixdittrich92 commented Mar 7, 2022

codecov bot commented Mar 8, 2022

Codecov Report

fg-mindee commented Mar 8, 2022

charlesmindee left a comment

Choose a reason for hiding this comment

SiddhantBahuguna commented Mar 8, 2022 • edited Loading

fg-mindee commented Feb 25, 2022 •

edited

Loading

SiddhantBahuguna commented Mar 8, 2022 •

edited

Loading