Adding some initial docs content and diagrams #129

robscott · 2024-12-21T01:10:33Z

Initial WIP attempt of some content for our docs.

kfswain · 2024-12-21T17:24:17Z

/lgtm

ahg-g · 2024-12-21T18:21:17Z

site-src/index.md

+Gateway API has [more than 25
+implementations](https://gateway-api.sigs.k8s.io/implementations/). As this
+pattern stabilizes, we expect a wide set of these implementations to support


not clear what pattern this refers to, perhaps we should say the word pattern in the composable layer section intro so that the reader can connect the dots

ahg-g · 2024-12-21T18:25:36Z

site-src/index.md

+## Composable Layers
+
+This project aims to develop an ecosystem of implementations that are fully


What do we mean by "ecosystem of implementations"? This project aims to "define specifications to enable a compatible ecosystem for extending the Gateway API with custom endpoint selection algorithms"?

+1 on ^ description from @ahg-g

ahg-g · 2024-12-21T19:00:40Z

site-src/index.md

+
+As part of this project, we're building an initial reference extension that is
+focused on routing to LoRA workloads. Over time, we hope to see a wide variety


I prefer not to give the impression that this is centered on LoRA, in reality LoRA is just one of multiple criteria for selection.

ahg-g · 2024-12-21T19:03:43Z

site-src/index.md

+[vLLM](https://github.com/vllm-project/vllm) and
+[Triton](https://github.com/triton-inference-server/server), and will be open to
+other integrations as they are requested.


@liu-cong will create a page for the model server protocol. @liu-cong we can start with the narrowest set of requirements: kv-cache, active adapters and queue length; the metric type of each for both Prometheus and ORCA formats.

ahg-g · 2024-12-21T19:34:47Z

Part of #72

k8s-ci-robot · 2024-12-30T20:15:14Z

New changes are detected. LGTM label has been removed.

danehans

A few nits, otherwise /LGTM.

danehans · 2025-01-06T00:20:34Z

crd-ref-docs.yaml

+processor:
+  ignoreTypes:
+    - "(InferencePool|InferenceModel)List$"
+  # RE2 regular expressions describing type fields that should be excluded from the generated documentation.


danehans · 2025-01-06T00:23:15Z

crd-ref-docs.yaml

+
+render:
+  # Version of Kubernetes to use when generating links to Kubernetes API documentation.
+  kubernetesVersion: 1.31


Can this value be an environment variable? I ask b/c it would be nice to have all version dependencies centrally located, e.g. Makefile.

danehans · 2025-01-06T00:30:44Z

site-src/concepts/conformance.md

+
+* InferencePool is supported as a backend type
+* Implementations forward requests to the configured extension for an
+  InferencePool using the protocol specified by this project


using the protocol specified by this project

Can you clarify? Are you referencing to protocol as xRoute types supported by the implementation?

danehans · 2025-01-06T00:42:15Z

site-src/index.md

+
+The overall resource model focuses on 2 new inference-focused
+[personas](/concepts/roles-and-personas) and corresponding resources that


Note that roles-and-personas.md is in a TODO state.

danehans · 2025-01-06T00:49:40Z

site-src/index.md

+replace a Kubernetes Service with an InferencePool. This resource has some
+similarities to Service (a way to select Pods and specify a port), but has some
+unique capabilities. With InferenceModel, you can configure a routing extension


but has some unique capabilities.

It would be helpful to talk about what the unique capabilities are or provide an example of a unique capability.

With InferenceModel, you can configure a routing extension as well as inference-specific routing optimizations.

Why is InferenceModel being highlighted here? Are you trying to describe how an InferencePool combined with an InferenceModel provides the unique capabilities described above?

danehans · 2025-01-06T00:51:05Z

site-src/index.md

s/configuration associated with that model./ its associated configuration./

danehans · 2025-01-06T00:52:54Z

site-src/index.md

+## Composable Layers
+
+This project aims to develop an ecosystem of implementations that are fully


+1 on ^ description from @ahg-g

danehans · 2025-01-06T00:58:37Z

site-src/index.md

+metrics probing may happen asynchronously, depending on the extension.
+
+4. The extension will instruct the Gateway which endpoint should be routed to.


s/which endpoint should be routed to./which endpoint the request should be routed to./

k8s-ci-robot · 2025-01-06T01:01:36Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: danehans, robscott
Once this PR has been reviewed and has the lgtm label, please ask for approval from kfswain. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Adding some initial docs content and diagrams

14772b4

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Dec 21, 2024

k8s-ci-robot requested review from kfswain and liu-cong December 21, 2024 01:10

k8s-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Dec 21, 2024

k8s-ci-robot assigned kfswain Dec 21, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 21, 2024

ahg-g reviewed Dec 21, 2024

View reviewed changes

ahg-g mentioned this pull request Dec 21, 2024

Create a website for the project #72

Open

Adding FAQ, contributing, and implementations pages

af1b216

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Dec 30, 2024

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Dec 30, 2024

Adding generated API docs + basic API docs and diagram

4b4eb91

k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Dec 31, 2024

robscott mentioned this pull request Jan 3, 2025

Redefined Gateway API Types #115

Open

danehans approved these changes Jan 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding some initial docs content and diagrams #129

Adding some initial docs content and diagrams #129

robscott commented Dec 21, 2024

kfswain commented Dec 21, 2024

ahg-g Dec 21, 2024

ahg-g Dec 21, 2024

danehans Jan 6, 2025

ahg-g Dec 21, 2024

ahg-g Dec 21, 2024

ahg-g commented Dec 21, 2024

k8s-ci-robot commented Dec 30, 2024

danehans left a comment

danehans Jan 6, 2025

danehans Jan 6, 2025

danehans Jan 6, 2025

danehans Jan 6, 2025

danehans Jan 6, 2025

danehans Jan 6, 2025

danehans Jan 6, 2025

danehans Jan 6, 2025

k8s-ci-robot commented Jan 6, 2025

		## Composable Layers

		This project aims to develop an ecosystem of implementations that are fully


		As part of this project, we're building an initial reference extension that is
		focused on routing to LoRA workloads. Over time, we hope to see a wide variety


		The overall resource model focuses on 2 new inference-focused
		[personas](/concepts/roles-and-personas) and corresponding resources that

		metrics probing may happen asynchronously, depending on the extension.

		4. The extension will instruct the Gateway which endpoint should be routed to.

Adding some initial docs content and diagrams #129

Are you sure you want to change the base?

Adding some initial docs content and diagrams #129

Conversation

robscott commented Dec 21, 2024

kfswain commented Dec 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ahg-g commented Dec 21, 2024

k8s-ci-robot commented Dec 30, 2024

danehans left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

k8s-ci-robot commented Jan 6, 2025