Skip to content

Latest commit

 

History

History
50 lines (38 loc) · 1.94 KB

assignment7.md

File metadata and controls

50 lines (38 loc) · 1.94 KB

Output for shakespeare -

  1. 1073319 The slings and arrows of outrageous fortune

Output for wiki -

  1. 8468033 Eurostar Eurostar is a high-speed railway service connecting London with Paris a...
  2. 88418750 Railway platform A railway platform is a section of pathway, alongside rail trac...
  3. 137384453 Andy Bechtolsheim Andreas "Andy" von Bechtolsheim (born September 30, 1955) is a...
  4. 299601817 List of civil parishes in Hampshire This is a list of civil parishes in the cere...
  5. 603734578 Institute for Quantum Computing The Institute for Quantum Computing, or IQC, loc...
  6. 986247632 List of University of Wisconsin–Madison people in academics List of University o…

NOTE -

I believe "cs489-2016w-esnrRahman-a7-index-shakespeare" hbase table is corrupted because when I created it, I accidentally ran

byte[] docId = ByteBuffer.allocate(keyPair.getRightElement()).array();

So this might've messed it up as this code allocates waaayyy more than 4 bytes. Therefore the hbase table cannot get deleted now. Issue stated in piazza. https://piazza.com/class/ii64c6llmtx1xf?cid=319.

The script tries to run shakespeare with my username first. It gets stuck because of the issue stated above. But right now, whatever table gets created can get safely deleted. So my suggestion is to run shakespeare with a name other than "esnrRahman". And wiki can be normally run. Apologies for the inconvenience and messing up the hbase table. I manually ran everything multiple times and deleted from the hbase terminal to assure myself that the code is working.

I have uploaded a script so that it runs in altiscale. You can use that to do your tests. P.S. - Please change the permissions first

Marks BuildInvertedIndexHBase: 10/10 BooleanRetrievalHBase: 5/5 Shakespeare Sample: 5/5 Wiki sample: 5/5 Private: 5/5 Total: 30/30