Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crawling of stanford dataspace, and simple indexes #11

Closed
wants to merge 4 commits into from

Conversation

yarikoptic
Copy link
Member

@yarikoptic yarikoptic commented Jul 31, 2018

@yarikoptic
Copy link
Member Author

some failures are due to the bug somewhere in twisted or scrapy leading to
TypeError (attrib() got an unexpected keyword argument 'converter')
which I recently observed elsewhere but I think it was resolved via upgrades... so not sure what to do for travis. Will fixup for the rogue pdb now

@codecov-io
Copy link

codecov-io commented Aug 1, 2018

Codecov Report

Merging #11 into master will decrease coverage by 15.72%.
The diff coverage is 74.6%.

Impacted file tree graph

@@             Coverage Diff             @@
##           master      #11       +/-   ##
===========================================
- Coverage   86.44%   70.71%   -15.73%     
===========================================
  Files          51       51               
  Lines        4130     4180       +50     
===========================================
- Hits         3570     2956      -614     
- Misses        560     1224      +664
Impacted Files Coverage Δ
datalad_crawler/nodes/crawl_url.py 78.82% <100%> (-11.43%) ⬇️
datalad_crawler/pipeline.py 74.27% <100%> (-8.08%) ⬇️
...awler/pipelines/tests/test_simple_with_archives.py 54.83% <26.66%> (-45.17%) ⬇️
datalad_crawler/pipelines/simple_with_archives.py 75.55% <75%> (-6.27%) ⬇️
datalad_crawler/nodes/matches.py 89.18% <94.11%> (+1.31%) ⬆️
datalad_crawler/pipelines/tests/test_openfmri.py 28.08% <0%> (-63.27%) ⬇️
datalad_crawler/pipelines/balsa.py 34.73% <0%> (-61.06%) ⬇️
datalad_crawler/dbs/versions.py 45.45% <0%> (-52.28%) ⬇️
datalad_crawler/pipelines/tests/test_balsa.py 51.78% <0%> (-48.22%) ⬇️
datalad_crawler/nodes/annex.py 47.09% <0%> (-34.58%) ⬇️
... and 14 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1684a9f...681e1b2. Read the comment docs.

* origin/master:
  TST: Mark test_simple1 as a known V6 failure
  TST: travis: Add V6 run
  TST: Drop stale known_failure_v6's
  RF: rename simple_with_stanford_lib.py to stanford_lib.py
  BF: crcns - use new datacite interface
  BF(workaround): adjust for absent pruning commits due to --incremental
  BF: need to use "incremental=True" now for aggregate_metadata
  BF: use legacy.openfmri.org
@mih mih mentioned this pull request Dec 3, 2018
2 tasks
@yarikoptic
Copy link
Member Author

elderly effort. IIRC was working but datasets of interest were broken (broken tarballs iirc) anyways. And with no immediate need - abandoned. So let's let it RiP

@yarikoptic yarikoptic closed this Apr 27, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

stanford digital repository
2 participants