Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Current WIP for rppd-to-lobid transformation (RPB-102) #56

Merged
merged 52 commits into from
Nov 6, 2023
Merged

Conversation

fsteeg
Copy link
Member

@fsteeg fsteeg commented Oct 30, 2023

maipet and others added 30 commits October 24, 2023 14:30
Co-authored-by: maipet <maipet@users.noreply.github.com>
Co-authored-by: maipet <maipet@users.noreply.github.com>
- Prefer GND IDs and namespace for `id` field
- Add `type` field with hard-coded values
- Set filenames and indexing settings
We are using the links of the GND explorer here, since we do not have the pictures yet.
Using 1)  `cat conf/RPB-Export_HBZ_Bio.txt | grep -a '#1wz' | tail -n 1 >> conf/RPB-Export_HBZ_Bio_Test.txt`
2) `sbt "runMain rpb.ETL conf/rppd-to-strapi.flux"`
3) `sbt "runMain rpb.ETL conf/rppd-test-to-lobid.flux"`
"variantName[]",
"periodOfActivity[]",
"placeOfActivity[]",
"gndSubjectCategory[]"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Gut wäre es hier die Werte alphabetisch sortieren.

conf/rppd-to-lobid.fix Outdated Show resolved Hide resolved
@TobiasNx TobiasNx marked this pull request as ready for review November 3, 2023 07:56
@fsteeg
Copy link
Member Author

fsteeg commented Nov 3, 2023

Are the *.txt files for debugging/development? Are they intentionally checked in?

  • conf/RPB-title-source-path.txt
  • conf/RPB-title-strapi-data-path.txt
  • conf/lobidGnd-path.txt
  • conf/rppd-source-path.txt
  • conf/rppd-strapi-data-paths.txt

@TobiasNx
Copy link
Contributor

TobiasNx commented Nov 3, 2023

Yes, they are helper files to provide the pathes for quickly checking and changing transformations.
Not sure if we should put them in a separate folder.

@fsteeg
Copy link
Member Author

fsteeg commented Nov 3, 2023

Alright, then I'd say we leave them where they are for now. You mentioned we should clean up the conf folder anyway, so maybe we do that in a separate PR, with something like subfolders for each data source (rpb, sw, rppd etc.)

@TobiasNx
Copy link
Contributor

TobiasNx commented Nov 3, 2023

+1

@fsteeg fsteeg merged commit 014b9b5 into main Nov 6, 2023
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants