Skip to content
This repository has been archived by the owner on Feb 6, 2020. It is now read-only.

another approach to use spot instance for persistent training in AWS #89

Open
xiuliren opened this issue May 5, 2016 · 1 comment
Open

Comments

@xiuliren
Copy link
Member

xiuliren commented May 5, 2016

  • create a spot fleet or persistent request, it is a kind of persistent spot instance managed by AWS. define the training command by user_data of the spot instance.
  • read all the data and configuration files in S3, save network to S3. This can be implemented using boto3 and remove the complex dependency of starcluster.
  • whenever there is an update of saved network file, plot the learning curve online using Plotly.
@xiuliren
Copy link
Member Author

created a pull request to implement this feature:
#94
need some more tests.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

1 participant