Skip to content
This repository has been archived by the owner on May 12, 2020. It is now read-only.

Commit

Permalink
fork from datproject
Browse files Browse the repository at this point in the history
  • Loading branch information
max-mapper committed Sep 22, 2016
1 parent 86f6e1b commit f358e26
Show file tree
Hide file tree
Showing 147 changed files with 58,464 additions and 23,416 deletions.
Empty file added .bundle.js
Empty file.
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
node_modules
.DS_Store
3 changes: 3 additions & 0 deletions bundle.js

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

File renamed without changes
49 changes: 0 additions & 49 deletions css/style.css

This file was deleted.

3 changes: 3 additions & 0 deletions dev/pre-commit.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
browserify -t brfs index.js -o bundle.js
autoless static/styles
echo 'Bundled up!'
Binary file removed fonts/Aleo/Bold Italic/aleo-bolditalic-webfont.eot
Binary file not shown.
785 changes: 0 additions & 785 deletions fonts/Aleo/Bold Italic/aleo-bolditalic-webfont.svg

This file was deleted.

Binary file removed fonts/Aleo/Bold Italic/aleo-bolditalic-webfont.ttf
Binary file not shown.
Binary file removed fonts/Aleo/Bold Italic/aleo-bolditalic-webfont.woff
Binary file not shown.
Binary file not shown.
Binary file removed fonts/Aleo/Bold/aleo-bold-webfont.eot
Binary file not shown.
4,741 changes: 0 additions & 4,741 deletions fonts/Aleo/Bold/aleo-bold-webfont.svg

This file was deleted.

Binary file removed fonts/Aleo/Bold/aleo-bold-webfont.ttf
Binary file not shown.
Binary file removed fonts/Aleo/Bold/aleo-bold-webfont.woff
Binary file not shown.
Binary file removed fonts/Aleo/Bold/aleo-bold-webfont.woff2
Binary file not shown.
Binary file removed fonts/Aleo/Light Italic/aleo-lightitalic-webfont.eot
Binary file not shown.
4,501 changes: 0 additions & 4,501 deletions fonts/Aleo/Light Italic/aleo-lightitalic-webfont.svg

This file was deleted.

Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file removed fonts/Aleo/Light/aleo-light-webfont.eot
Binary file not shown.
4,300 changes: 0 additions & 4,300 deletions fonts/Aleo/Light/aleo-light-webfont.svg

This file was deleted.

Binary file removed fonts/Aleo/Light/aleo-light-webfont.ttf
Binary file not shown.
Binary file removed fonts/Aleo/Light/aleo-light-webfont.woff
Binary file not shown.
Binary file removed fonts/Aleo/Light/aleo-light-webfont.woff2
Binary file not shown.
Binary file removed fonts/Aleo/Regular Italic/aleo-italic-webfont.eot
Binary file not shown.
4,645 changes: 0 additions & 4,645 deletions fonts/Aleo/Regular Italic/aleo-italic-webfont.svg

This file was deleted.

Binary file removed fonts/Aleo/Regular Italic/aleo-italic-webfont.ttf
Binary file not shown.
Binary file removed fonts/Aleo/Regular Italic/aleo-italic-webfont.woff
Binary file not shown.
Binary file removed fonts/Aleo/Regular Italic/aleo-italic-webfont.woff2
Binary file not shown.
Binary file removed fonts/Aleo/Regular/aleo-regular-webfont.eot
Binary file not shown.
4,373 changes: 0 additions & 4,373 deletions fonts/Aleo/Regular/aleo-regular-webfont.svg

This file was deleted.

Binary file removed fonts/Aleo/Regular/aleo-regular-webfont.ttf
Binary file not shown.
Binary file removed fonts/Aleo/Regular/aleo-regular-webfont.woff
Binary file not shown.
Binary file removed fonts/Aleo/Regular/aleo-regular-webfont.woff2
Binary file not shown.
22 changes: 0 additions & 22 deletions index.html

This file was deleted.

Empty file added index.js
Empty file.
71 changes: 71 additions & 0 deletions package.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,71 @@
{
"name": "dat-website",
"version": "1.0.0",
"description": "dat-data.com source",
"main": "index.js",
"scripts": {
"start": "npm run build-css && node render.js && npm run server",
"server": "nodemon --watch templates --watch scss -e scss,js,jade,html -i rendered --exec \"npm run build-css && node render.js && http-server rendered\"",
"test": "node test/run.js | tap-spec",
"watch": "watchify -t brfs index.js -o bundle.js",
"build-css": "node-sass --importer node_modules/node-sass-magic-importer scss/base.scss static/css/base.css",
"deploy": "gh-pages-deploy",
"render": "node render.js"
},
"keywords": [
"data",
"datasets",
"publish",
"registry",
"dathub",
"dat",
"web"
],
"gh-pages-deploy": {
"staticpath": "rendered",
"cname": "dat-data.com",
"prep": [
"build-css",
"render"
],
"noprompt": false
},
"author": "dat project",
"license": "BSD",
"bugs": {
"url": "https://github.com/datproject/website/issues"
},
"homepage": "http://dat-data.com",
"dependencies": {
"cheerio": "^0.19.0",
"cptar": "^1.0.0",
"dat-design": "^1.2.11",
"gh-pages-deploy": "^0.3.0",
"handlebars": "^4.0.5",
"jquery": "^2.1.1",
"marked": "^0.3.5",
"mkdirp": "^0.5.1",
"node-sass": "^3.8.0",
"node-sass-magic-importer": "^0.1.4",
"nodemon": "^1.2.1",
"page": "^1.4.1",
"rimraf": "^2.4.4",
"rss": "^1.2.1",
"xhr": "^2.0.3"
},
"devDependencies": {
"page-router": "^1.0.1",
"autoless": "^0.1.5",
"brfs": "^1.4.0",
"browserify": "^7.0.0",
"dom": "0.0.3",
"gravatar": "^1.1.1",
"http-server": "^0.8.0",
"mustache": "^2.0.0",
"watchify": "^2.1.1"
},
"repository": {
"type": "git",
"url": "git+https://github.com/datproject/website.git"
}
}
71 changes: 71 additions & 0 deletions posts.js
Original file line number Diff line number Diff line change
@@ -0,0 +1,71 @@
module.exports = [
{
// date: '2016-07-07',
title: 'Announcing dat.land online demo',
author: 'Karissa McKelvey',
teaser: 'Live peer to peer sync in the browser with http://dat.land',
name: '2016-07-07-announcing-dat-land'
},
{
date: '2016-05-16',
title: 'Key Elements of Distributing Data',
author: 'Karissa McKelvey',
teaser: 'How the current tools fall short by missing key elements.',
name: '2016-05-16-key-elements'
},
{
title: 'Dat 1.0 is ready',
author: 'Karissa McKelvey',
teaser: 'After years of R&D, the release candidate is ready.',
name: '2016-02-01-dat-1',
date: '2016-02-05'
},
{
title: 'Announcing Publicbits.org',
author: 'Karissa McKelvey',
teaser: 'Breaking down open data silos with a new Knight Foundation grant.',
name: '2016-02-01-announcing-publicbits',
date: '2016-02-01'
},
{
title: 'The likely collapse of a major proprietary hosting service',
author: 'Karissa McKelvey',
teaser: 'The beginning of the end -- and what it means for science.',
name: '2015-12-01-proprietary'
},
{
date: '2016-01-19',
title: 'A Brief History of Dat',
author: 'Max Ogden',
teaser: 'History of Dat and the design choices over the last 2 years',
name: '2016-01-19-brief-history-of-dat'
},
{
date: '2015-07-29',
title: 'Dat goes beta!',
author: 'Karissa McKelvey',
teaser: 'After a long year of alpha testing, dat goes beta.',
name: '2015-07-29-dat-beta'
},
{
date: '2015-04-03',
title: 'Sloan Redoubles Dat Funding',
author: 'Waldo Jacquith',
teaser: 'Announcing a generous $640,000 grant from the Sloan foundation',
name: '2015-04-03-sloan'
},
{
date: '2014-08-19',
title: 'Announcing the Dat Alpha',
author: 'Max Ogden',
teaser: 'The first major version of dat along with a new website.',
name: '2014-08-19-dat-alpha'
},
{
date: '2014-04-02',
title: 'Sloan funding Dat development',
author: 'Waldo Jacquith',
teaser: 'Announcing Sloans support of the dat project',
name: '2014-04-02-dat'
}
]
11 changes: 11 additions & 0 deletions posts/2014-04-02-dat.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# Sloan Funding Dat Development

We’ve got two happy pieces of news at the U.S. Open Data Institute.

The first is that board member [Max Ogden](http://maxogden.com/) is transitioning to being an employee, as we start to house and support [Dat](http://dat-data.com/), Max’s data package management tool. Dat is a system for real-time replication, transformation, and versioning of large tabular datasets. Git is to source code as Dat is to datasets. [The John S. and James L. Knight Foundation provided $50,000](http://maxogden.com/) to support taking Dat from an idea to [a pre-alpha stage](https://github.com/maxogden/dat/), which sustained the project until February. Now it's becoming a US ODI project.

This is possible because of the second announcement: the [Alfred P. Sloan Foundation](http://www.sloan.org/) has provided $260,000 in funding to support the continued development of Dat. That money will be used to pay Max and two additional developers to spend the year growing Dat into a vibrant, healthy, widely used open source project. Sloan's interest is in seeing Dat used in the sciences, which produces a great deal of data, but for reasons both cultural and technological, does not generally publish that data openly. We intend to play a role in changing that.

Max got started yesterday. You can follow along with development of Dat at [its GitHub repository](https://github.com/maxogden/dat/).

Our thanks to Josh Greenberg and the Sloan Foundation for their generosity and gracious support. We're also grateful to Casey Dachs and the [Miami Foundation](https://www.miamifoundation.org/), and the folks at the Knight Foundation for their important help in making this possible.
82 changes: 82 additions & 0 deletions posts/2014-08-19-dat-alpha.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,82 @@
# Announcing the dat alpha.
by [Max Ogden](http://maxogden.com)

The first code went into dat one year ago, [on August 17th 2013](https://github.com/maxogden/dat/commit/e5eda57b53f60b05c0c3d97da90c10cd17dcbe19). Today, after a year of work, we are really excited to release the first major version of dat along with a [new website](http://dat-data.com).

Our overall goal with dat is to make a set of tools for creating and sharing streaming data pipelines, a sort of [ETL](http://en.wikipedia.org/wiki/Extract,_transform,_load) style system but designed from the ground up to be developer friendly, open source and streaming. We are aligned with the goals of the [frictionless data](http://blog.okfn.org/2013/04/24/frictionless-data-making-it-radically-easier-to-get-stuff-done-with-data/) initiative and see dat as an important tool for sharing data wrangling, munging and clean-up code so that data consumers can simply `dat clone` to get good data.

The first six months of dat development were spent making a prototype (thanks to the [Knight foundation Prototype Fund](http://www.knightfoundation.org/grants/201346305/)). In April of this year we were able to expand the team working on dat from 1 person to 3 persons, thanks to [support from the Sloan foundation](http://usodi.org/2014/04/02/dat). At that time dat also became an official [US Open Data Institute](http://usopendata.org/) project, to ensure that open data remains a top priority going forward.

Sloan's proposition was that they like the initial dat prototype but wanted to see scientific data use cases be treated as top priority. As a result we expanded the scope of the project from its tabular-data-specific beginnings and have focused on adding features that will help us work with larger scientific datasets.

Up until this point, the dat API has been in flux, as we were constantly iterating on it. From this point forward we will be taking backwards compatibility much more seriously, so that third party developers can feel confident building on top of dat.

## How to get involved

### Try it out

You can [install dat today](https://github.com/maxogden/dat#install) and play around with it by importing or cloning a dataset.

You can also click this button to deploy a dat to Heroku for testing purposes for free (but be aware of the [Heroku ephemeral filesystem limitations](https://github.com/maxogden/dat/blob/master/readme.md#deploy-to-heroku)):

[![Deploy](https://www.herokucdn.com/deploy/button.png)](https://heroku.com/deploy?template=https://github.com/bmpvieira/heroku-dat-template.git)

The dat REST API comes bundled with the [dat-editor](https://github.com/maxogden/dat-editor#readme) web application.

<a href="https://github.com/maxogden/dat-editor">
<img height="200" title="dat editor" alt="dat editor" src="https://raw.githubusercontent.com/maxogden/dat-editor/master/screenshot.png"/>
</a>

To start learning about how to use dat please read our [getting started guide](https://github.com/maxogden/dat/blob/master/docs/getting-started.md).

To help you choose an approach to loading data into dat we have created a [data importing guide](https://github.com/maxogden/dat/blob/master/docs/importing.md).

### Write a module or 5

The benefit of dat isn't in the dat module, but rather in the ecosystem that it enables to be built around it.

There are a lot of modules that we think would be really awesome to have, and [we started a wishlist here](https://github.com/datproject/discussions/issues/5). If you see something you are interested in building, please leave a comment on that thread stating your intent. Similarly, if there is a format or storage backend that you would like to see dat support, leave it in the comments.

## Pilot users

This release of dat represents our efforts to get it to a point where we can start working with scientists on modeling their data workflows with dat. We will now be starting concrete work on these pilot use cases.

If you have a use case in mind and you want to bounce it off of us please open at issue on the maxogden/dat repository with a detailed description.

While we don't have many details to share today about these pilots, we hope to change that over the new few months.

### Bionode (Bioinformatics -- DNA)

<a href="http://bionode.io">
<img height="100" width="100" title="bionode" alt="bionode logo" src="https://rawgithub.com/bionode/bionode/master/docs/bionode-logo.min.svg"/>
</a>

Dat core team member [@bmpvieira](https://github.com/bmpvieira/), a Bioinformatics PhD student at Queen Mary University in London, is working on applying dat to the domain of working with various DNA analysis related datasets.

Bruno runs the [Bionode](https://github.com/bionode) project. We will be working on integrating Bionode with dat workflows to solve common problems in DNA bioinformatics research.

### RNA-Seq (Bioinformatics -- RNA)

Two researchers from UC-San Diego reached out to us recently and have started explaining their use case [here](https://github.com/maxogden/dat/issues/129) and [here](https://github.com/maxogden/dat/issues/135). We hope to use dat to make their data management problems go away.

### Sloan Digital Sky Survey (Astronomy)

<a href="http://sdss.org">
<img height="100" title="sdss" alt="sdss" src="https://raw.githubusercontent.com/maxogden/dat/master/img/sdss.png"/>
</a>

We will be working with the [SDSS](http://www.sdss.org/) project to share large their scans of the visible universe, and eventually connect their data with other sky survey data from other organizations.

## The future of dat

This release is the first step towards our goal of creating a streaming interface between every database or file storage backend in the world. We are trying to solve hard problems the right way. This is a process that takes a lot of time.

In the future we would also like to work on a way to easily host and share datasets online. We envision a sort of data package registry, similar to [npmjs.org](http://npmjs.org), but designed with datasets in mind. This kind of project could also eventually turn into a sort of "GitHub for data".

We also want to hook dat up to P2P networks, so that we can make downloads faster but also so that datasets become more permanent. Dat advisor Juan Benet is now working on [IPFS](http://ipfs.io/), which we are excited to hook up to dat when it is ready.

Certain datasets are simply too large to share, so we also expect to work on a distributed computation layer on top of dat in the future (similar to the [ESGF](http://esgf.org/) project).

You can help us discuss these high level future ideas on [this issue](https://github.com/datproject/discussions/issues/1).

*To keep up to date with the dat project you can follow [@dat_project](https://twitter.com/dat_project) on Twitter or [watch the repo](https://github.com/maxogden/dat) on GitHub.*
9 changes: 9 additions & 0 deletions posts/2015-04-03-sloan.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
# Sloan Redoubles Dat Funding

We’re thrilled to announce that the [Alfred P. Sloan Foundation](http://www.sloan.org/) has provided a $640,000 grant to support the development of [Dat](http://dat-data.com/). Created by Max Ogden, and housed at U.S. Open Data, Dat makes it easy to create automated, reproducible data pipelines that sync. Sloan’s support will allow Dat to bring to scientific data the same automated, distributed workflows that Git brings to source code sharing. They’re funding three full-time positions for the next two years.

Dat is about more than scientific data—Sloan’s core support is about building its capacity in the sciences, but it will also improve Dat in ways that will serve open data generally.

One year ago, [Sloan provided $260,000 in funding for Dat development](https://usopendata.org/2014/04/02/dat/), also focusing on scientific data use cases, which enabled the team to release an alpha version of Dat in August, and a beta version, due out imminently.

We’re all grateful to the Sloan Foundation for their support, and to Sloan’s Josh Greenberg for his guidance and forbearance, and look forward to two years of fruitful work.
Loading

0 comments on commit f358e26

Please sign in to comment.