Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Table of Contents, update metadata page, add name note, fix a few names and typos #495

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

nkmcalli
Copy link
Collaborator

Description

Update Table of Contents, update metadata page, add name note, fix a few names and typos

This pr replaces #463

@nkmcalli nkmcalli added the doc Improvements or additions to documentation label Feb 27, 2025
@nkmcalli nkmcalli self-assigned this Feb 27, 2025
@nkmcalli nkmcalli requested a review from a team as a code owner February 27, 2025 04:29
Copy link

copy-pr-bot bot commented Feb 27, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@nkmcalli
Copy link
Collaborator Author

@jdye64 I made some updates to the doc build config. Let me know if there are any problems.

@@ -19,6 +19,9 @@ limitations under the License.

The nv-ingest devcontainer is provided as a quick-to-set-up development and exploration environment for use with [Visual Studio Code](https://code.visualstudio.com) (Code). The devcontainer is a lightweight container which mounts-in a Conda environment with cached packages, alleviating long Conda download times on subsequent launches. It provides a simple framework for adding developer-centric [scripts](#development-scripts), and incorporates some helpful Code plugins.

> [!Note]
> NV-Ingest is also known as NVIDIA Ingest and NeMo Retriever Extraction.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isabel emphasized we use "e" instead of "E" in Extraction.
So it would be NeMo Retriever extraction.

@@ -8,6 +8,9 @@ SPDX-License-Identifier: Apache-2.0

NVIDIA-Ingest is a scalable, performance-oriented document content and metadata extraction microservice. Including support for parsing PDFs, Word and PowerPoint documents, it uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images for use in downstream generative applications.

> [!Note]
> NVIDIA Ingest is also known as NV-Ingest and NeMo Retriever Extraction.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here - NeMo Retriever extraction

@@ -8,6 +8,10 @@ SPDX-License-Identifier: Apache-2.0

NV-Ingest-Client is a tool designed for efficient ingestion and processing of large datasets. It provides both a Python API and a command-line interface to cater to various ingestion needs.

> [!Note]
> NV-Ingest is also known as NVIDIA Ingest and NeMo Retriever Extraction.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here - NeMo Retriever extraction

@@ -3,7 +3,7 @@ hide:
- navigation
---

**NV-Ingest** is a scalable, performance-oriented document content and metadata extraction microservice. NV-Ingest uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images for use in downstream generative applications.. You can access NV-Ingest as a free community resource or learn more about getting an enterprise license for improved expert-level support at the [NV-Ingest homepage](https://www.nvidia.com).
NeMo Retriever Extraction (NV-Ingest) is a scalable, performance-oriented document content and metadata extraction microservice. NV-Ingest uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images for use in downstream generative applications.. You can access NV-Ingest as a free community resource or learn more about getting an enterprise license for improved expert-level support at the [NV-Ingest homepage](https://www.nvidia.com).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the URL correct here? It says NV-Ingest homepage but takes the user to nvidia.com

@@ -1,43 +1,47 @@
# What is NVIDIA-Ingest?
# What is NVIDIA Ingest?
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought we would use either NV-Ingest or NeMo Retriever Extraction. Is it necessary to use the 3rd alternative "NVIDIA Ingest" ? Maybe in this example it seems to make more sense to call it NeMo Retriever extraction?

NV-Ingest is a scalable, performance-oriented document content and metadata extraction microservice.
NV-Ingest uses specialized NVIDIA NIM microservices
NVIDIA Ingest is a scalable, performance-oriented document content and metadata extraction microservice.
NVIDIA Ingest uses specialized NVIDIA NIM microservices
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same here and in other places we have used "NVIDIA Ingest". We should replace with either NV-Ingest or NeMo Retriever extraction

@@ -225,7 +228,7 @@ pip install .

### Step 3: Ingesting Documents

You can submit jobs programmatically in Python or via the nv-ingest-cli tool.
You can submit jobs programmatically in Python or via the [NV-Ingest CLI](nv-ingest_cli.md).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Leave as nv-ingest-cli

@@ -130,7 +130,7 @@ pip install .
## Step 3: Ingesting Documents
You can submit jobs programmatically in Python or using the nv-ingest-cli tool.
You can submit jobs programmatically in Python or using the [NV-Ingest CLI](nv-ingest_cli.md).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Leave as nv-ingest-cli

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
doc Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants