SFB tabby utils

Overview

This repo contains my own utils for parsing tabby input for DataLad catalog, with specific focus on SFB1451.

The code is functional and presented in the form of argparse-parameterised scripts, but heavily proototype in nature; parts of the code rely on files being saved or having been saved in the code / working directory; parts are unused.

The key scripts are:

load_inbox.py covers reading an excel file or bunch of tsv files, and placing renamed tsv files in target directory
load_tabby.py reads a dataset tabby file and produces a catalog schema translation

Philosophy

A convention for reading the tabby collection is provided with this repository
JSON-LD expansion and compaction is used to align set of terms (roughly) with catalog schema
Processing of values (e.g. reshaping, ensuring type) is done via a set of process-* functions
Optionally, a “temporary” catalog in the working directory can be populated with created entries to allow “live” preview while prototyping. It needs to be created by hand with a permissive config, you can use this one.
Some terms are resolved with API queries, done with simple requests and cached using requests_cache

Install Requirements

“` python -m venv /tmp/my_env source /tmp/my_env/bin/activate

pip install requirements-devel.txt “`

Workflow - rough overview

Scripts are scattered across this repo (tabby-related) and the sfb1451 projects catalog (non-tabby) repo. To add a tabby subdataset to an existing project dataset, the following is needed:

save the tabby files in a dataset (ideally recursively, so that superdataset is updated too)

extract and add the project metadata (tabby addition means it is in a new version):

python .../extract_project.py PROJECTDIR OUTDIR
datalad catalog-add -c ... -F ... -m ...

“inject” metadata that does not come from any extractor (keywords, etc)
```
python .../inject_metadata.py --funding --keywords PROJECTDIR
    
```

finally, add tabby metadata:

python .../load_tabby.py --catalog ... TABBYFILE

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
conventions/tby-crc1451v0		conventions/tby-crc1451v0
.gitignore		.gitignore
README.org		README.org
gitworktree2tabby.py		gitworktree2tabby.py
initial-tagging.py		initial-tagging.py
list_directory.py		list_directory.py
list_files.py		list_files.py
load_inbox.py		load_inbox.py
load_subdatasets.py		load_subdatasets.py
load_tabby.py		load_tabby.py
lookup_tables.toml		lookup_tables.toml
mock_dataset.py		mock_dataset.py
queries.py		queries.py
requirements-devel.txt		requirements-devel.txt
status2tabby.py		status2tabby.py
superds-config.json		superds-config.json
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SFB tabby utils

Overview

Philosophy

Install Requirements

Workflow - rough overview

About

Releases

Packages

Contributors 2

Languages

sfb1451/tabby-utils

Folders and files

Latest commit

History

Repository files navigation

SFB tabby utils

Overview

Philosophy

Install Requirements

Workflow - rough overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages