This is software created to learn what sensitive information can be extracted from a website.
This software work simply: you insert the URL of the website then the software will automatically extract the data and store it in a log file. The log file is structured in the way explained below.
The data that this software will extract are:
- Metadata
- Technologies used
- Contacts
- Name of people (With the relative role if specified)
- Pages with forms
This software is created with the only purpose of learning, I am not responsible for any malicious usage of the software.
- You have to install Python on your device if you haven't already.
- You have to install Nmap
- Download this repository and extract it into a folder
- Use the command
pip install -r requirements.txt
to install the dependencies needed - Run the
main.py
file running the commandpy main.py
in the terminal - You are asked to insert a URL of the website, insert the URL of the home page
The logs are stored in the folder /logs
inside the project directory.
Logs are stored in md
files cause this format allows to create easy to read documents.
This section will include all the sensitive files that can be founded on a website such as sitemap.xml and robots.txt.
A whois of the site
Ip address
OS and other information about the target
All the technologies that can be detected, for example, GSAP, ReactJS
Each page found on the website will be analyzed then the information will be reported in the log file as follows:
All the metadata of the current page
List of all the links present on the page
If the page includes forms/inputs and information about these forms/inputs
The cookies that are saved when you are on the page
By contacts, we mean emails, social links, etc. All the information that regard the contacts that are found on the page
Names of people (With the relative role if specified)
- Create the
logs
folder automatically - Update
requirements.txt
and remove unused dependencies - Update
README.md
to make it more concise and clear - Add template to create pull requests and add documentation for contributors
- Create proper error handling
- Implement something like DirBuster
- Implement other useful Nmap scans
- Improve / add new features to
contacts
function