-
Notifications
You must be signed in to change notification settings - Fork 136
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Benchmarking january results. (#2189) * Benchmarking january results. * Update to add MFE job definition files. * Fix phi-2 paths. * Update phi-2 model directory. * Fix boolq phi-2 results path. --------- Co-authored-by: Alex Kalita <[email protected]> * Model card updated for whisper large (#2202) * fix credential-less blob check (#2188) * fix credential-less blob check * add spec_version_upgrader * update component versions * add header and doc string. * add more UT for spec version upgrader * remove trailing whitespace * add missing param. * add null check for client_secret for adlsgen2 datastore --------- Co-authored-by: Richard Li <[email protected]> * upgrading the environment to latest pkgs (#2204) * removing NC series from computes allow list (#2211) * updating model specific defaults and finetune config for mistral model (#2209) * add rai qa quality and safety eval flow (#2208) * add rai qa quality and safety eval flow * add test_config for rai qa quality & safety flow * Check if secrets exist (#2217) * Check if secrets exists * update * Update * add batch allowlist for mistral base model (#2201) * add batch allowlist for mistral base model * format * Fix olive-optimizer vul Jan new (#2200) * Vulnerability fixes for python-sdk-v2 and model-management environment (#2216) * sdk v2 * sdk v2 * sdk v2 * sdk v2 * sdk v2 * sdk v2 * sdk v2 * sdk v2 * sdk v2 * new acpt env for torch2.1 and cuda12.1 (#2186) * new env for cuda12.1 * updated * update rai qa safety flow output format (#2226) * update rai qa safety flow output format * update rai qa quality&saftey flow output format * bump up component version and use azureml-rag 0.2.24.2 in environment (#2225) * Update DBCopilot version (#2220) * Preprocessor custom scipt fix (#2219) * Replaced os.system with subprocess.check_output in dataset_preprocessor method that is used to run custom script. * Replaced os.system with subprocess.check_output in dataset_preprocessor method that is used to run custom script. * Fix llama-2-7b results for truthful-qa (#2229) * stable diffusion XL base model support (#2233) * basexl update * wrapper updates * format update * Make sure we recover details * Upgrade AML Benchmark components (#2236) Co-authored-by: Sarthak Singhal <[email protected]> * add gsq e2e test (#2231) * Update inputs (#2239) * Remove acs stuff in faiss pipeline (#2240) * Add two promptflow models: count-cars and detect-defects (#2070) * Add two promptflow models: count-cars and detect-defects * Add ci test configs for count-cars and detect-defects * Put "connection" into "inputs" for Azure OpenAI GPT-4 Turbo with Vision tool --------- Co-authored-by: Zhi Zhou <[email protected]> * Update DBCopilot promptflow (#2242) * SystemLog: prefix logging * Adding more detailed logging * Ccozianu/rm bug fix (#2247) * add more logs * fix stdout logs * Make sure we recover details * Make sure we recover details (#2238) * SystemLog: prefix logging * Ccozi/temp logging fix (#2246) * Make sure we recover details * SystemLog: prefix logging * Adding more detailed logging --------- Co-authored-by: svaruag <[email protected]> * Fixing typo --------- Co-authored-by: arun-rajora <[email protected]> Co-authored-by: Alex Kalita <[email protected]> Co-authored-by: HrishikeshGeedMS <[email protected]> Co-authored-by: Richard Li <[email protected]> Co-authored-by: Richard Li <[email protected]> Co-authored-by: pmanoj <[email protected]> Co-authored-by: qusongms <[email protected]> Co-authored-by: Ayush Mishra <[email protected]> Co-authored-by: ym11369 <[email protected]> Co-authored-by: savitamittal1 <[email protected]> Co-authored-by: jingyizhu99 <[email protected]> Co-authored-by: XiangRao <[email protected]> Co-authored-by: Nivedita Mishra <[email protected]> Co-authored-by: Ramu Vadthyavath <[email protected]> Co-authored-by: sarthaks95 <[email protected]> Co-authored-by: Sarthak Singhal <[email protected]> Co-authored-by: Ilya Matiach <[email protected]> Co-authored-by: jinzhaochang <[email protected]> Co-authored-by: Zhi Zhou <[email protected]> Co-authored-by: Zhi Zhou <[email protected]> Co-authored-by: svaruag <[email protected]>
- Loading branch information
1 parent
20ffbd7
commit 2d747f0
Showing
503 changed files
with
34,373 additions
and
702 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/aml-benchmark/components/batch-benchmark-score/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/aml-benchmark/components/batch-output-formatter/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_gpt_35_turbo_0301_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_gpt_35_turbo_0613_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_gpt_4_0314_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_gpt_4_0613_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_gpt_4_32k_0314_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_gpt_4_32k_0613_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_llama_2_13b_chat_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_llama_2_13b_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_llama_2_70b_chat_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_llama_2_70b_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_llama_2_7b_chat_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
2 changes: 1 addition & 1 deletion
2
assets/evaluation_results/boolq_llama_2_7b_question_answering/spec.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
3 changes: 3 additions & 0 deletions
3
assets/evaluation_results/boolq_microsoft_phi_2_question_answering/asset.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
type: evaluationresult | ||
spec: spec.yaml | ||
categories: ["EvaluationResult"] |
Oops, something went wrong.