Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

community: add init for UnstructuredHTMLLoader to solve pathlib paths #29091

Merged
merged 1 commit into from
Jan 8, 2025

Conversation

Marsman1996
Copy link
Contributor

Description

Add __init__ for UnstructuredHTMLLoader to restrict the input type to str or Path, and transfer the self.file_path to str just like UnstructuredXMLLoader does.

Issue

Fix #29090

Dependencies

No changes.

@dosubot dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Jan 8, 2025
Copy link

vercel bot commented Jan 8, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
langchain ✅ Ready (Inspect) Visit Preview 💬 Add feedback Jan 8, 2025 11:38am

@dosubot dosubot bot added community Related to langchain-community Ɑ: doc loader Related to document loader module (not documentation) labels Jan 8, 2025
@dosubot dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Jan 8, 2025
@ccurme ccurme merged commit 2b09f79 into langchain-ai:master Jan 8, 2025
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
community Related to langchain-community Ɑ: doc loader Related to document loader module (not documentation) lgtm PR looks good. Use to confirm that a PR is ready for merging. size:S This PR changes 10-29 lines, ignoring generated files.
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

UnstructuredHTMLLoader fail when given Path type document
2 participants