Added new model support Cohere/Command-R #154

quic-akuruvil · 2024-10-14T04:17:50Z

Added the model architecture changes.
Added a new input parameter: inputs_embeds instead of input_ids to calculate embeddings in host.
Adjusted the code flow to integrate the new input - inputs_embeds
CB mode enabled for this

anujgupt-github · 2024-10-14T08:01:53Z

QEfficient/transformers/models/cohere/modeling_cohere.py

+#
+# Copyright (c) 2024 Qualcomm Innovation Center, Inc. All rights reserved.
+# SPDX-License-Identifier: BSD-3-Clause
+#


can you add a delimiter here?

anujgupt-github · 2024-10-14T08:11:36Z

Update this also:
https://github.com/quic/efficient-transformers/blob/main/docs/source/validate.md

anujgupt-github · 2024-10-22T05:40:37Z

@quic-akuruvil can you please add accuracy evaluation run details?

quic-akuruvil · 2024-10-22T05:47:32Z

@quic-akuruvil can you please add accuracy evaluation run details?

Yes, I tried running the perplexity script, but it broke. I’ll look into it and get the numbers.

vbaddi · 2024-10-22T13:52:55Z

QEfficient/cloud/infer.py

@@ -72,6 +72,7 @@ def main(
        cache_dir=cache_dir,
        hf_token=hf_token,
    )
+    embeds, config = get_embeddings(model_name, hf_token, cache_dir, local_model_dir)


Do we want to generalize this? I don't this is accessible for all the model categories.

Yes right, we can make this conditional, fetch embeddings only for Cohere.

vbaddi · 2024-10-22T13:54:17Z

QEfficient/exporter/export_hf_to_cloud_ai_100.py

@@ -204,18 +204,23 @@ def export_kvstyle_transformed_model_to_onnx(
        raise ValueError(f"Need seq_len to be greater than zero, got seq_len={seq_len}")

    # Preprocess inputs
+    embeds = None
+    if model_name == "CohereForAI/c4ai-command-r-v01":


If we want to do this for all the versions of cohere and not specific to this model_name, then we need to do it as torch level? not a good practice to do at model_name level

okay sure, I have changed it to those architecture with CohereCausalLM head in architecture, so it's more generic.

quic-akuruvil · 2024-11-05T17:40:39Z

@quic-akuruvil can you please add accuracy evaluation run details?

The perplexity is matching for onnx and qpc.

quic-akuruvil · 2024-11-06T04:02:25Z

@quic-akuruvil can you please add accuracy evaluation run details?

The perplexity is matching for onnx and qpc.

for CL=1024, perplexity results are as below:

quic-akuruvil · 2024-11-11T03:47:10Z

@quic-akuruvil can you please add accuracy evaluation run details?

Done

quic-rishinr · 2024-11-11T05:14:32Z

@quic-akuruvil Can you please resolve the conflicts?

Signed-off-by: Ann <[email protected]>

quic-akuruvil requested review from anujgupt-github, vbaddi and irajagop October 14, 2024 04:17

quic-akuruvil self-assigned this Oct 14, 2024

quic-akuruvil requested review from quic-rishinr and ochougul as code owners October 14, 2024 04:17

quic-akuruvil added wip Work in progress enhancement New feature or request and removed wip Work in progress labels Oct 14, 2024

anujgupt-github reviewed Oct 14, 2024

View reviewed changes

vbaddi requested changes Oct 22, 2024

View reviewed changes

quic-rishinr added the in-review Review process is ongoing label Nov 6, 2024

quic-akuruvil added the wip Work in progress label Nov 13, 2024

quic-akuruvil added 2 commits November 13, 2024 16:04

Added new model support Cohere/Command-R

d5c622b

Signed-off-by: Ann <[email protected]>

Formatting done

5eaf4bb

Signed-off-by: Ann <[email protected]>

quic-akuruvil force-pushed the cohere branch from 61ca907 to 5eaf4bb Compare November 14, 2024 03:58

quic-akuruvil added 2 commits November 14, 2024 04:17

Rebase with main

6290a49

Signed-off-by: Ann <[email protected]>

Formatting files

83d0900

Signed-off-by: Ann <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added new model support Cohere/Command-R #154

Added new model support Cohere/Command-R #154

quic-akuruvil commented Oct 14, 2024 •

edited

Loading

anujgupt-github Oct 14, 2024

quic-akuruvil Oct 14, 2024

anujgupt-github commented Oct 14, 2024

anujgupt-github commented Oct 22, 2024

quic-akuruvil commented Oct 22, 2024

vbaddi Oct 22, 2024

quic-akuruvil Nov 5, 2024

vbaddi Oct 22, 2024

quic-akuruvil Nov 11, 2024

quic-akuruvil commented Nov 5, 2024 •

edited

Loading

quic-akuruvil commented Nov 6, 2024

quic-akuruvil commented Nov 11, 2024

quic-rishinr commented Nov 11, 2024

Added new model support Cohere/Command-R #154

Are you sure you want to change the base?

Added new model support Cohere/Command-R #154

Conversation

quic-akuruvil commented Oct 14, 2024 • edited Loading

anujgupt-github Oct 14, 2024

Choose a reason for hiding this comment

quic-akuruvil Oct 14, 2024

Choose a reason for hiding this comment

anujgupt-github commented Oct 14, 2024

anujgupt-github commented Oct 22, 2024

quic-akuruvil commented Oct 22, 2024

vbaddi Oct 22, 2024

Choose a reason for hiding this comment

quic-akuruvil Nov 5, 2024

Choose a reason for hiding this comment

vbaddi Oct 22, 2024

Choose a reason for hiding this comment

quic-akuruvil Nov 11, 2024

Choose a reason for hiding this comment

quic-akuruvil commented Nov 5, 2024 • edited Loading

quic-akuruvil commented Nov 6, 2024

quic-akuruvil commented Nov 11, 2024

quic-rishinr commented Nov 11, 2024

quic-akuruvil commented Oct 14, 2024 •

edited

Loading

quic-akuruvil commented Nov 5, 2024 •

edited

Loading