-
Notifications
You must be signed in to change notification settings - Fork 790
Focus areas and goals
This page gives details of the focus areas and goals for the Kubeflow Doc Sprint, and what you'll get out of attending.
We'll refine our goals together in the doc-sprint Kanban board before and during the sprint.
The primary goals of the doc sprint are:
-
Describe Kubeflow use cases. A November analytics study showed that people visited the use cases section of the docs often. People want to know the most common ways of using Kubeflow, and they need to move on from the getting-started guides to indepth tutorials. But the doc ratings showed that people were disappointed with what they found in the use cases section. We need to fill the gaps. See issue #1596 for further details.
-
Build more end-to-end tutorials: In a UX study in October, Kubeflow users identified documentation as a top adoption driver, and chose end-to-end tutorials as the area that needs most attention. We ran a docs survey in January to get more info.
Feedback from the January docs survey gives us some good tasks to work on:
- Many respondents in the January docs survey found that the code samples are out of date or that they cause errors and thus cannot be completed. Yet, most users (54.5%) said that they think the most valuable type of docs are code samples with related tutorials. (Next most valuable were installation/config guides at 27.3%.) So, a very good exercise for the doc sprint is to choose an existing tutorial and test it, then update the code and docs accordingly. (Note that some Kubeflow examples have a related doc while others exist only in the kubeflow/samples repository. Both types are good candidates for the doc sprint.)
- The survey results show that the most helpful type of tutorial is "end-to-end training and serving of a particular model type (for example, image recognition / NLP / recommendation)" (36.4%). For the doc sprint, you can either develop and document a new sample for a model type, or test an existing sample and document it accordingly.
- From the survey results, the second-most helpful type of tutorial is "end-to-end deployment of Kubeflow on a particular platform (for example, AWS / GCP / local machine)" (27.3%). So a good exercise for the doc sprint is to choose an existing deployment guide and test it, then update the code and docs accordingly. In particular, on-premises and desktop/server guides need testing and fixing.
- From the survey results, another helpful type of tutorial is "end-to-end experimentation in a notebook, then moving to K8s for production" (13/6%). We've heard from users in the survey and in other forums that this is a gap that needs filling.
If you choose one of the above tasks: Check the Kanban board to see if there's already an issue raised for your chosen tutorial. If not, raise one. Assign the issue to yourself (or comment on the issue that you'll take it.)
-
Create trouble-shooting guides for those tricky issues that require the attention of an expert. By bringing many of the community members together during the doc sprint, we'll have those experts on tap!
-
Fix bugs. We're creating a hot list of bugs to fix during the doc sprint. If you spot a bug in the docs, log an issue! Include a comment that you think this is a good candidate for the doc sprint. Assign it to yourself if you'd like to tackle it.
Sometimes it helps to put tasks in buckets. So here's a classification of some of the tasks to tackle during the doc sprint:
-
Create end-to-end tutorials:
-
Create a getting-started guide for OpenShift - @vpavlin will take this one on for the doc sprint! https://github.com/kubeflow/website/issues/33
-
Pipelines: Create a pipeline-oriented hyperparameter tuning (Katib) tutorial https://github.com/kubeflow/website/issues/738
-
Pipelines: Document how to set up scheduled pipelines that can read a sliding window of data: https://github.com/kubeflow/website/issues/996
-
Pipelines: Create a tutorial that uses components for Dataflow, AI Platform training and prediction: https://github.com/kubeflow/website/issues/886
-
Microsoft Azure Pipelines: Create End-to-end MNIST/ML Pipeline tutorial on Azure: https://github.com/kubeflow/website/issues/880
-
A tutorial on serving using Cloud Run: https://github.com/kubeflow/website/issues/755
-
Pipelines: Create a tutorial that uses pre-built component(s) for a common ML workflow: https://github.com/kubeflow/website/issues/639
-
Pipelines: Create a minimal usage example for the Kubeflow Pipelines API https://github.com/kubeflow/website/issues/734
-
-
Improve the docs for some of the Kubeflow components:
- Add a doc for running TensorBoard: https://github.com/kubeflow/website/issues/208
- Expand the KFServing docs: https://github.com/kubeflow/website/issues/1412
- Update NVIDIA TensorRT Inference Server document for kustomize - sample YAML supplied on issue: https://github.com/kubeflow/website/issues/959
- Create docs for XGBoost operator: https://github.com/kubeflow/website/issues/866
- Update the metadata docs: https://github.com/kubeflow/website/issues/1207
-
Test and fix the Microk8s/Multipass installation process - see 3 issues:
-
Microsoft Azure:
- Microsoft Azure: Kubeflow installation guide needs attention: https://github.com/kubeflow/website/issues/1019
- Microsoft Azure: Test the Azure end-to-end example for Kubeflow 1.0: https://www.kubeflow.org/docs/azure/azureendtoend/ (no issue created yet)
-
AWS:
-
AWS auth and TLS: https://github.com/kubeflow/website/issues/1435
-
AWS: doc still refers to ksonnet: https://github.com/kubeflow/website/issues/1223
-
AWS: document manual steps for ASG/EC2 deletion: https://github.com/kubeflow/website/issues/851
-
AWS: Add explanation of IAM prerequisites https://github.com/kubeflow/website/issues/836
-
AWS: TensorFlow Serving: "Sending prediction request" does not work:
-
-
GCP: Test and fix the docs on using your own domain: https://github.com/kubeflow/website/issues/1166
-
Advanced config:
- Explain the usage of Istio within Kubeflow: https://github.com/kubeflow/website/issues/1532
- RBAC: https://github.com/kubeflow/website/issues/1427
- PodDefault: https://github.com/kubeflow/website/issues/1423
- Add a KfDef reference guide: https://github.com/kubeflow/website/issues/1442
-
Create an FAQ page and answer these questions.
-
Create some diagrams.
-
Here's a small issue for those interested in website UI improvements: https://github.com/kubeflow/website/issues/1211
-
Clarify Pipelines run triggers: https://github.com/kubeflow/website/issues/746
-
Do some analysis and tidying up:
- Update the list of samples and indicate which Kubeflow version each sample supports: https://github.com/kubeflow/website/issues/1316
- Pipelines: Update SDK references to point to the generated docs on Read The Docs https://github.com/kubeflow/website/issues/819
- Move info about IAP and TFServing to the docs website https://github.com/kubeflow/website/issues/240
-
Codelab: Update the GitHub Issue Summarization codelab to use kustomize instead of ksonnet: https://github.com/kubeflow/website/issues/871
- Can the tutorial be in a Jupyter notebook? Yes! We'll add a doc introducing the notebook and telling people how to run the notebook on Kubeflow.
- Where will we put the tutorials and other docs? The main location is on www.kubeflow.org.
Camaraderie and...
- Wear it with pride: A designer, limited-edition, Kubeflow Doc Sprint T-shirt.
- See your name in lights: The Kubeflow docs and samples are on GitHub. Your username will appear as author of your contributions.
- Be part of something big: We'll write up our results on the Kubeflow blog.
- Talk to Kubeflow community members: A doc sprint is a great way of bringing the community together, in the same room and online.
- Learn while you sprint: We're planning some lightning talks from UX researchers to help us shine up the way people experience Kubeflow. Technical writers will present tips on writing effectively. You'll also be able to tap the knowledge of the members of the Kubeflow community who'll be sprinting with you.
- Help other people use Kubeflow: We know Kubeflow is great, and we know how to use it. Help other people know too.
There are many ways you can take part in the doc sprint:
- Join us on site for one, two or three days, or participate online. You can mix and match, part on site and part online. (We'll include the online conference details in the calendar invitations.) The on-site doc sprint offers the best experience, as you can chat to the other participants in person, attend the mini learning sessions, and get the most out of having the experts in one room at the same time.
- Write docs and fix doc bugs.
- Review pull requests (PRs) containing docs and code written by others.
- Act as an on-call expert for one or more areas of Kubeflow, to advise people when they're building the tutorials and fixing the docs.
- Take part in the sprint demos, to show all participants what you've achieved and to cheer others on.
Check out the details and find the signup link at the main page for the Kubeflow Doc Sprint.