Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for a list of urls for web-extract #499

Open
kevinschaper opened this issue Jan 17, 2025 · 0 comments
Open

Support for a list of urls for web-extract #499

kevinschaper opened this issue Jan 17, 2025 · 0 comments

Comments

@kevinschaper
Copy link
Member

You can definitely take or leave this feature request. For the MIC ingest, I think the ideal workflow would be to provide a list of urls to web-extract and receive a single pair of kgx node & edge tsv files, and keep the url of the pages in those files.

That might be a whole bundle of way too specific feature requests though, so the alternative would probably be just patching in file -o support for kgx tsv. (I saw a TODO in there) and then they can be merged, and then we can handle iterating on the urls and merging the output.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant