Staatsarchiv-BS Linked Data

This pipeline uses barnard59 to generate triples from a JSON file using a carml mapping.

To generate the mapping files out of the XRM mappings, you can use our Expressive RDF Mapper (XRM) tool.

Quick start

Put your input in a single input.json file in the input directory.

Then run the following:

npm install # install dependencies
npm run start # run the pipeline

You will find the generated triples in the output directory.

input: should contain some JSON files (not included in this repository) ; it will be used by the pipeline to generate triples
mapping: XRM mapping files, to map fields from the JSON files to specific triples
metadata: some static triples that need to be published
node_module: contains the source code of all dependencies required to run the pipeline ; it should never be pushed
output: contain triple files with the generated triples from the pipeline
pipelines: contains the pipeline definition
scripts: some useful scripts
src-gen: the generated mapping file for carml
.gitlab-ci.yml: the GitLab CI pipeline declaration
package.json: specify the version of each dependency that is used and define some useful scripts

The GitLab CI pipeline is doing the following:

github: this step is only run on a push on the main branch. It pushes the content of the repository we have on GitLab to GitHub.
fetch: this step fetches the latest JSON file for all prefixes defined in the scripts/file_prefix.sh file and will store them in the input directory.
process: there are two main jobs for that step:
- metadata: this job generates the triples from the Turtle files (extension .ttl) that are stored in the metadata directory into a output/metadata.nt file. That way everything is converted into the right format and is stored into a single file.
- process: this job splits the JSON files into smaller chunks and run the pipeline on each of them.
store: this step publishes the generated triples from the previous step (every file with .nt extension from the output directory) to the triple store.

Name		Name	Last commit message	Last commit date
Latest commit History 336 Commits
mappings		mappings
metadata		metadata
pipelines		pipelines
scripts		scripts
src-gen		src-gen
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
README.md		README.md
docker-compose.yaml		docker-compose.yaml
package-lock.json		package-lock.json
package.json		package.json