Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about high amount of sleepers/parked domains #79

Closed
ldlac opened this issue Oct 21, 2024 · 5 comments
Closed

Question about high amount of sleepers/parked domains #79

ldlac opened this issue Oct 21, 2024 · 5 comments

Comments

@ldlac
Copy link

ldlac commented Oct 21, 2024

Hi, first of all, thanks for your good work, you guys have a huge hosts list!

I was wondering why there is so many sleepers/parked domains in the list? I'm actively building an hosts list using multiple sources and I found out that you guys have around 31833 sleepers. Are you keeping them just in case they submerged? Is it safe to remove them?

@hawkeye116477
Copy link
Contributor

hawkeye116477 commented Oct 21, 2024

We're removing parked (based on parkers ns => https://github.com/FiltersHeroes/ScriptsPlayground/blob/master/domainParking.txt) and expired domains if our script will find them. How you found so many?
Some domains can be offline, but not marked by whois as expired yet, so they're kept, cuz can always come back.

@krystian3w
Copy link
Contributor

krystian3w commented Oct 21, 2024

There may also be a problem that a given domain has a deliberately damaged homepage and only with unknown parameters or URI path it returns text worth blocking (Then for offline records it is safer to recognize deletion from WHOIS).

We certainly do not know such parameters and paths for CERT POLSKA (they may not necessarily be willing to publish the exact URLs from the reports to reduce block before his 180 days of quarantine).

@ldlac
Copy link
Author

ldlac commented Oct 21, 2024

Thanks for your responses

I'm currently experimenting with https://pypi.org/project/dnspython/ simply trying to resolve the domain. So they simply do not have an A record which is why they are still on the list, my bad on this just figured that out.
I recently discovered this tool too https://pypi.org/project/PyFunceble/ which could give me better results.

@hawkeye116477
Copy link
Contributor

hawkeye116477 commented Oct 22, 2024

We're also using dnspython for our script, but that's just one of the methods. You need more to make sure that domain won't come back.
https://github.com/FiltersHeroes/ScriptsPlayground/blob/master/scripts/ECODFF.py.

@hawkeye116477 hawkeye116477 closed this as not planned Won't fix, can't repro, duplicate, stale Oct 22, 2024
Copy link

github-actions bot commented Jan 1, 2025

This thread was automatically locked as/because there was no activity after it was closed. Please open a new ticket for related issues.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Jan 1, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

3 participants