Skip to content

philiptham2546/extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 

Repository files navigation

This is an extractor and text-ify tool to scrape for UN websites from UN pages.

Steps to run

Either go to this link or download extractor.js and extractor.html and open the html file in your browser.

Description

Extracts the title of official and open UN documents from a given UN website link based on element type. Can then input filters for inclusion or exclusion based on the type of UN document (ie working paper, resolution, declaration etc). Then converts the collection of UN documents into formatted text that can be copy-and-pasted into working documents. Note of caution: the extractor relies on the element type of documents on the UN website, might differ from UN agency to agency.

Example usage on UN OEWG on ICT and international security documents page

Link to UN page: UN OEWG on ICTs
    Filter function
Screenshot 2024-10-28 at 23 00 24
    Convert from table to UN formatted text
Screenshot 2024-10-28 at 15 56 37