Skip to content

javasoze/meaningfulweb

Folders and files

NameName
Last commit message
Last commit date

Latest commit

6e698ed · Feb 26, 2013

History

88 Commits
Jan 25, 2011
Feb 23, 2011
Apr 4, 2011
Feb 20, 2013
Mar 7, 2011
Feb 8, 2011
Jan 24, 2011
Jan 28, 2011
Jan 24, 2011
Jan 24, 2011
Feb 26, 2013
Feb 26, 2013

Repository files navigation

What is Meaningful Web?

We aim to extract structured information from a web resource:

url --> meaningfulweb engine --> structured information

Homepage:

http://www.meaningfulweb.org

Artifacts:

  1. meaningfulweb-opengraph.jar <- open graph parser
  2. meaningfulweb-core.jar <-- core engine
  3. meaningfulweb-app.war <-- web application

Build:

Build and release are managed via Maven: http://maven.apache.org/

  1. build all: under meaningfulweb, do: mvn clean install
  2. start webapp: under meaningfulweb-app/, do: mvn jetty:run

application should be running at: http://localhost:8080/

the rest service should be running at: http://localhost:8080/get-meaning?url=xxx

Example:

http://localhost:8080/get-meaning?url=http://www.google.com

Sample Code:

// extract the best image representing an url

String url = "http://www.google.com"

MetaContentExtractor extractor = new MetaContentExtractor();
MeaningfulWebObject obj = extractor.extractFromUrl(url);

String bestImageURL = obj.getImage();
String title = obj.getTitle();
String description = obj.getDescription();
String domain = obj.getDomain();

...

Bugs:

File bugs here

Releases

No releases published

Packages

No packages published

Languages