Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Windows] Support CPU shared memory (Client/Frontend) #7048

Open
wants to merge 3,302 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
3302 commits
Select commit Hold shift + click to select a range
a58246f
Fix default instance name (#6097)
kthui Jul 24, 2023
1d02ea6
Removing unused tests (#6085)
oandreeva-nv Jul 25, 2023
4b3c278
Update post-23.07 release (#6103)
mc-nv Jul 28, 2023
46859db
Improve asyncio testing (#6122)
Tabrizian Jul 28, 2023
4de6d8a
Reduce instance count to 1 for python bls model loading test (#6130)
krishung5 Aug 1, 2023
28da6a6
Fix queue test to expect exact number of failures (#6133)
tanmayv25 Aug 4, 2023
01c70b9
Add CPU & GPU metrics in Grafana dashboard.json for K8s op prem deplo…
yeahdongcn Aug 4, 2023
a129063
Adding the support tracing of child models invoked from a BLS model (…
oandreeva-nv Aug 7, 2023
775d700
Test python environments 3.8-3.11 (#6109)
jbkyang-nvi Aug 8, 2023
1d97247
Improve L0_backend_python debugging (#6157)
krishung5 Aug 8, 2023
8a7c14c
Add unreachable output test for reporting source of disconnectivity (…
GuanLuo Aug 8, 2023
2591c87
Update 'main' to track development of 2.38.0 / 23.09 (#6163)
mc-nv Aug 8, 2023
1370ec7
Fix the versions in the doc (#6164)
tanmayv25 Aug 9, 2023
39fafff
Update docs with NVAIE messaging (#6162)
dzier Aug 9, 2023
e87478e
Add sanity tests for parallel instance loading (#6126)
rmccorm4 Aug 10, 2023
da392f4
Remove extra whitespace (#6174)
dyastremsky Aug 10, 2023
d126623
Remove a test case that sanity checks input value of --shape CLI flag…
nv-hwoo Aug 10, 2023
874ff6d
Add test when unload/load requests for same model is received at the …
kthui Aug 14, 2023
e6735e8
Make assert failures clearer in L0_trt_plugin (#6166)
dyastremsky Aug 14, 2023
1bb0131
Add end-to-end CI test for decoupled model support (#6131) (#6184)
matthewkotila Aug 15, 2023
d8fd125
Test preserve_ordering for oldest strategy sequence batcher (#6185)
rmccorm4 Aug 15, 2023
c968045
added debugging guide (#5924)
tanayvarshney Aug 15, 2023
ecfe017
Add deadlock gdb section to debug guide (#6193)
rmccorm4 Aug 15, 2023
8cca875
Fix character escape in model repository documentation (#6197)
GuanLuo Aug 16, 2023
5d2ea14
Fix docs test (#6192)
matthewkotila Aug 16, 2023
c527cfd
Add utility functions for array manipulation (#6203)
dyastremsky Aug 17, 2023
0cf0681
Add gc collect to make sure gpu tensor is deallocated (#6205)
krishung5 Aug 17, 2023
d8931b0
Check for log error on failing to find explicit load model (#6204)
rmccorm4 Aug 18, 2023
1423260
Set default shm size to 1MB for Python backend (#6209)
Tabrizian Aug 22, 2023
abd53c9
Trace Model Name Validation (#6199)
fpetrini15 Aug 23, 2023
e43aa5d
Fix the check for error message (#6226)
krishung5 Aug 23, 2023
2007bd8
Fix copyright for debugging guide (#6225)
krishung5 Aug 23, 2023
1f9db58
Add watts units to GPU power metric descriptions (#6242)
rmccorm4 Aug 28, 2023
6ec125b
Update post-23.08 release (#6234)
mc-nv Aug 30, 2023
4b45fa6
Add tests/docs for queue size (pending request count) metric (#6233)
rmccorm4 Aug 30, 2023
dd696b1
Adding safe string to number conversions (#6173)
oandreeva-nv Aug 30, 2023
38bdc53
Test Python execute() to return Triton error code (#6228)
kthui Sep 2, 2023
ed76ffb
Add docs on IPv6 (#6262)
rmccorm4 Sep 2, 2023
1ea7765
Add test for TensorRT version-compatible model support (#6255)
dyastremsky Sep 5, 2023
7429cf8
Upgrade protobuf version (#6268)
krishung5 Sep 6, 2023
9ea1108
Add testing for retrieving shape and datatype in backend API (#6231)
fpetrini15 Sep 6, 2023
a3cb238
Update 'main' to track development of 2.39.0 / 23.10 (#6277)
mc-nv Sep 8, 2023
edb95cb
Apply UCX workaround (#6254)
Tabrizian Sep 11, 2023
d622ed4
Add ensemble parameter forwarding test (#6284)
rmccorm4 Sep 11, 2023
38a1eab
Exclude extra TRT version-compatible models from tests (#6294)
dyastremsky Sep 11, 2023
0f576d2
Adding installation of docker and docker-buildx (#6299)
mc-nv Sep 12, 2023
f5371a6
Use targetmodel from header as model name in SageMaker (#6147)
nikhil-sk Sep 13, 2023
4e6d7be
Add more error messages, return codes, and refactor HTTP server (#6297)
rmccorm4 Sep 14, 2023
bce13c8
Fix typo (#6318)
krishung5 Sep 18, 2023
f8e83f9
Update the request re-use example (#6283)
Tabrizian Sep 18, 2023
415495a
Disable developer tools build for In-process API + JavaCPP tests (#6296)
jbkyang-nvi Sep 19, 2023
2aec8a6
Add Python binding build. Add L0_python_api to test Python binding (#…
GuanLuo Sep 19, 2023
fdd6f6a
Increase network timeout for valgrind (#6324)
krishung5 Sep 19, 2023
d329df4
Tests and docs for ability to specify subdirectory to download for Lo…
oandreeva-nv Sep 20, 2023
605c15b
Refactor HandleInfer into more readable chunks (#6332)
rmccorm4 Sep 21, 2023
2275845
Refactor model generation scripts (#6336)
Tabrizian Sep 25, 2023
f88b4fb
Add testing for scalar I/O in ORT backend (#6343)
Tabrizian Sep 26, 2023
60d6690
Update post-23.09 release (#6367)
mc-nv Sep 29, 2023
e7ec69d
Add security policy (#6376)
dyastremsky Oct 3, 2023
cfa5f72
Adding client-side request cancellation support and testing (#6383)
tanmayv25 Oct 4, 2023
9a8bcf4
Add missing copyright (#6388)
tanmayv25 Oct 4, 2023
dd5efac
Add basic generate endpoints for LLM tasks (#6366)
GuanLuo Oct 6, 2023
d67b725
Add Python backend request cancellation test (#6364)
kthui Oct 6, 2023
6959419
Add TRT-LLM backend build to Triton (#6365) (#6392)
krishung5 Oct 7, 2023
103ee32
Add Python backend when vLLM backend built (#6397)
dyastremsky Oct 8, 2023
42addb6
Update build.py to build vLLM backend (#6394)
dyastremsky Oct 8, 2023
93da69d
Support parameters object in generate route
nnshah1 Oct 9, 2023
f54ddd6
Update 'main' to track development of 2.40.0 / 23.11 (#6400)
mc-nv Oct 9, 2023
2ffbf85
Fix L0_sdk (#6387)
tanmayv25 Oct 11, 2023
f8ba585
Add documentation on request cancellation (#6403)
tanmayv25 Oct 11, 2023
9fe8569
Fixes in request cancellation doc (#6409)
tanmayv25 Oct 11, 2023
1992892
Document generate HTTP endpoint (#6412)
GuanLuo Oct 12, 2023
f849724
Update SECURITY.md to not display commented copyright (#6426)
dyastremsky Oct 12, 2023
9bf792d
Fix missing library in L0_data_compression (#6424)
GuanLuo Oct 12, 2023
cefbfee
Add Javacpp-presets repo location as env variable in Java tests(#6385)
jbkyang-nvi Oct 13, 2023
fb296a8
TRT-LLM backend build changes (#6406)
krishung5 Oct 13, 2023
0956f95
Add gRPC AsyncIO request cancellation tests (#6408)
kthui Oct 13, 2023
ccbae03
Fix L0_implicit_state (#6427)
Tabrizian Oct 16, 2023
c112666
Fixing vllm build (#6433)
oandreeva-nv Oct 16, 2023
3aba5f4
Switch Jetson model TensorRT models generation to container (#6378)
mc-nv Oct 17, 2023
46f93e9
Bumped vllm version (#6444)
oandreeva-nv Oct 18, 2023
cf85998
Adjust test_concurrent_same_model_load_unload_stress (#6436)
kthui Oct 18, 2023
e29c89b
Adding emergency vllm latest release (#6454)
oandreeva-nv Oct 19, 2023
b792c32
Fix notify state destruction and inflight states tracking (#6451)
tanmayv25 Oct 19, 2023
566facd
Update TRT-LLM backend url (#6455)
krishung5 Oct 20, 2023
7e7ee88
Added docs on python based backends (#6429)
oandreeva-nv Oct 20, 2023
383850d
L0_model_config Fix (#6472)
fpetrini15 Oct 24, 2023
8c37608
Add test for Python model parameters (#6452)
kthui Oct 24, 2023
2f7f396
Test Python BLS with different sizes of CUDA memory pool (#6276)
krishung5 Oct 25, 2023
5d6a60a
Add documentation for K8s-onprem StartupProbe (#5257)
okdimok Oct 26, 2023
bd99799
Update `main` post-23.10 release (#6484)
mc-nv Oct 27, 2023
9f04d6d
Adding structure reference to the new document (#6493)
mc-nv Oct 28, 2023
dee479d
Improve L0_backend_python test stability (ensemble / gpu_tensor_lifec…
rmccorm4 Oct 30, 2023
ab4d03a
Add L0_generative_sequence test (#6475)
GuanLuo Nov 1, 2023
1f8507e
Downgrade vcpkg version (#6503)
GuanLuo Nov 1, 2023
a4286b5
Collecting sub dir artifacts in GitLab yaml. Removing collect functio…
krishung5 Nov 2, 2023
11ac9f0
Use post build function for TRT-LLM backend (#6476)
krishung5 Nov 2, 2023
0d8059b
Enhanced python_backend autocomplete (#6504)
oandreeva-nv Nov 2, 2023
fa8c2b6
Parse reuse-grpc-port and reuse-http-port as booleans (#6511)
dyastremsky Nov 2, 2023
aa473f1
Fixing L0_io (#6510)
oandreeva-nv Nov 3, 2023
e552ef2
Add Python-based backends CI (#6466)
pskiran1 Nov 3, 2023
e9677ec
Enabling option to restrict access to HTTP APIs based on header value…
nnshah1 Nov 3, 2023
659611c
Upgrade DCGM from 2.4.7 to 3.2.6 (#6515)
kthui Nov 6, 2023
f1465b9
Enhance GCS credentials documentations (#6526)
kthui Nov 7, 2023
2ad2786
Test file override outside of model directory (#6516)
rmccorm4 Nov 7, 2023
e12d06c
Update ORT version to 1.16.2 (#6531)
mc-nv Nov 7, 2023
21a007a
Adjusting expected error msg (#6517)
oandreeva-nv Nov 8, 2023
e33752b
Update 'main' to track development of 2.41.0 / 23.12 (#6543)
mc-nv Nov 8, 2023
0a0ea18
Enhance testing for pending request count (#6532)
tanmayv25 Nov 8, 2023
ce9652b
Add testing for Python backend request rescheduling (#6509)
krishung5 Nov 9, 2023
cf40588
Check that the wget is installed (#6556)
mc-nv Nov 9, 2023
26ef299
secure deployment considerations guide (#6533)
nnshah1 Nov 10, 2023
705f810
Fix typo and change the command line order (#6557)
mc-nv Nov 10, 2023
806fed4
Add error during rescheduling test to L0_generative_sequence (#6550)
GuanLuo Nov 13, 2023
1897add
changing references to concrete instances
nnshah1 Nov 15, 2023
246564a
Add testing for implicit state enhancements (#6524)
Tabrizian Nov 15, 2023
4be58a3
Fix copyright (#6584)
Tabrizian Nov 15, 2023
e455ed7
Mention TRT LLM backend supports request cancellation (#6585)
kthui Nov 16, 2023
42a6873
update model repository generation for onnx models for protobuf (#6575)
jbkyang-nvi Nov 16, 2023
ea1d87d
Fix L0_sagemaker (#6587)
rmccorm4 Nov 16, 2023
0b1694e
Add C++ server wrapper to the doc (#6592)
krishung5 Nov 16, 2023
624f24b
Add timeout to client apis and tests (#6546)
jbkyang-nvi Nov 16, 2023
68d5bb4
Change name generative -> iterative (#6601)
nnshah1 Nov 19, 2023
926330e
Add documentation on generative sequence (#6595)
GuanLuo Nov 20, 2023
65bc558
Updated description of iterative sequences
nnshah1 Nov 20, 2023
96933b1
Restricted HTTP API documentation
nnshah1 Nov 20, 2023
fe10097
Add request cancellation and debugging guide to generated docs (#6617)
rmccorm4 Nov 21, 2023
7c93b8e
Support for http request cancellation. Includes fix for seg fault in …
nnshah1 Nov 21, 2023
2bb61e9
Bumped vLLM version to v0.2.2 (#6623)
oandreeva-nv Nov 21, 2023
da3ecc1
Upgrade ORT version (#6618)
tanmayv25 Nov 21, 2023
17d2de7
Use compliant preprocessor (#6626)
tanmayv25 Nov 22, 2023
e0d00e6
Update README.md (#6627)
dzier Nov 22, 2023
0556b7c
Extend request objects lifetime and fixes possible segmentation fault…
tanmayv25 Nov 22, 2023
b2dd98b
Update protobuf after python update for testing (#6638)
jbkyang-nvi Nov 29, 2023
7c0ed09
Update post-23.11 release (#6653)
mc-nv Dec 4, 2023
0e721e4
PYBE Timeout Tests (#6483)
fpetrini15 Dec 4, 2023
9e7f237
Add note on lack of ensemble support (#6648)
rmccorm4 Dec 5, 2023
df9d6e8
Added request id to span attributes (#6667)
oandreeva-nv Dec 6, 2023
88b8843
Add test for optional internal tensor within an ensemble (#6663)
GuanLuo Dec 8, 2023
afaf6b0
Set CMake version to 3.27.7 (#6675)
mc-nv Dec 10, 2023
c329f48
restore typo (#6680)
mc-nv Dec 11, 2023
d496f95
Update 'main' to track development of 2.42.0 / 24.01 (#6673)
mc-nv Dec 12, 2023
d191ef9
iGPU build refactor (#6684) (#6691)
kthui Dec 12, 2023
0b63e0d
Mlflow Plugin Fix (#6685)
fpetrini15 Dec 13, 2023
ebd6a3e
Fix extra content-type headers in HTTP server (#6678)
rmccorm4 Dec 13, 2023
6ca4764
Fix iGPU CMakeFile tags (#6695)
kthui Dec 13, 2023
57b49b4
adding default value for TRITON_IGPU_BUILD=OFF (#6705)
nv-kmcgill53 Dec 14, 2023
b0bbabb
Add test case for decoupled model raising exception (#6686)
krishung5 Dec 15, 2023
8f5f515
Escape special characters in general docs (#6697)
nealvaidya Dec 15, 2023
5b46d0e
vLLM Benchmarking Test (#6631)
pskiran1 Dec 15, 2023
e2e3837
Allow configuring GRPC max connection age and max connection age grac…
siweili11 Dec 15, 2023
a4b8162
Modify HTTP frontend to return error code reflecting Triton error. Ad…
GuanLuo Dec 19, 2023
06df7a8
Update `main` branch post-23.12 release (#6726)
mc-nv Dec 26, 2023
45278d7
trtllm CI fixes (#6687)
mc-nv Dec 28, 2023
cb0c2e5
Remove double unit test (#6714)
kthui Jan 2, 2024
14f70b6
Update RE2 package location (#6750)
Tabrizian Jan 3, 2024
6bc5625
Add testing for zero tensors in PyTorch backend (#6760)
Tabrizian Jan 3, 2024
fb3747a
Fix L0_lifecycle on insufficient hardware concurrency (#6762)
kthui Jan 4, 2024
1ea633a
Simplify cmake install command (#6725)
Tabrizian Jan 5, 2024
b48aa57
Add cmdline option to set model load retry. Add test (#6764)
GuanLuo Jan 5, 2024
8af13e9
Increase timeout (#6774)
fpetrini15 Jan 8, 2024
b5f1f7d
Move from jfrog artifactory to archives.boost.io to fix boost downloa…
rmccorm4 Jan 8, 2024
310c38c
Add Triton Inference Server In-Process Python API Tests
nnshah1 Jan 11, 2024
fa72cf2
Bring back Python backend based PyTorch backend (#6518)
kthui Jan 11, 2024
2782d30
Bump min cxx standard to 17 (#6742)
oandreeva-nv Jan 11, 2024
cf5fe35
Add tests/docs for Pinned Memory metrics (#6754)
pskiran1 Jan 12, 2024
c205451
Update 'main' to track development of 2.42.0 / 24.02 (#6786)
mc-nv Jan 12, 2024
4a719e4
Support for Context Propagation for OTel trace mode (#6785)
oandreeva-nv Jan 17, 2024
87165b2
Use current time when overwriting model configuration. (#6727)
whoisj Jan 17, 2024
7b06a37
Added docs for otel context propagation (#6804)
oandreeva-nv Jan 18, 2024
b6e017e
Fix typos in trace.md (#6808)
rmccorm4 Jan 18, 2024
3e79b2a
Fix test_model_config_overwite in L0_lifecycle (#6818)
GuanLuo Jan 19, 2024
7edeb9f
Improve L0_backend_python on shm reliability (#6803)
kthui Jan 19, 2024
3bff367
Remove boost::filesystem (#6810)
rmccorm4 Jan 22, 2024
bc71da0
Generate unittest xml reports from L0_python_api (#6822)
rmccorm4 Jan 23, 2024
6192c6e
Add unit test reports to L0_json, L0_metrics, L0_response_cache, L0_b…
rmccorm4 Jan 25, 2024
a514a05
Update trace summary script (#6758)
pskiran1 Jan 25, 2024
28f497c
Add gsutil upload retry helper function (#6817)
kthui Jan 25, 2024
ddfdb2a
Add test for shutdown while unloading in background (#6835)
kthui Jan 27, 2024
56e4232
Handle 0 dimension output for generate endpoint (#6833)
krishung5 Jan 29, 2024
d98a59c
tensorrt-llm benchmarking test (#6771)
pskiran1 Jan 29, 2024
2309bce
Update README.md and versions post-24.01 (#6847)
mc-nv Jan 30, 2024
d0e2653
Use libmamba solver for L0_backend_python env test. Fix pytest not fo…
krishung5 Jan 30, 2024
f92732d
Add test for shutdown while loading model (#6837)
kthui Jan 31, 2024
776e641
Adding OpenTelemetry Batch Span Processor (#6842)
oandreeva-nv Feb 1, 2024
b0a495a
Support Double-Type Inference Request/Response Parameters (#6755)
fpetrini15 Feb 1, 2024
508929a
Updating vllm version to 0.3.0 (#6858)
oandreeva-nv Feb 7, 2024
738c98f
Python Backend Windows Support (#6830)
fpetrini15 Feb 8, 2024
3d79568
Add support for Oracle Cloud in deploy (#6850)
bruno-garbaccio Feb 9, 2024
1df73dc
Add link to TRTLLM metrics docs (#6874)
rmccorm4 Feb 13, 2024
4294cc6
Add unit test reports to L0_dlpack_multi_gpu and L0_warmup (#6873)
krishung5 Feb 14, 2024
f078bfb
Set OV version to 2023.3.0 (#6880)
kthui Feb 14, 2024
80fc56c
Fixing StringTo uint32_t used only by tracing (#6883)
oandreeva-nv Feb 14, 2024
8a2a229
Update 'main' to track development of 2.44.0 / 24.03 (#6892)
mc-nv Feb 16, 2024
59e267f
Add response statistics (#6869)
kthui Feb 17, 2024
21a7fc5
Fix busyop test for L0_memory_growth (#6900)
krishung5 Feb 22, 2024
60872b9
Add cancellation into response statistics (#6904)
kthui Feb 23, 2024
8d8b607
Install required pip pkgs (#6906)
krishung5 Feb 24, 2024
adafa4f
Match forward headers case insensitively. (#6889)
yinggeh Feb 27, 2024
551978b
Add note on --cache-config spacing and fix typos (#6929)
rmccorm4 Mar 1, 2024
246f46c
Remove ignore files that are not in use by repository (#6893)
mc-nv Mar 2, 2024
1dcf2cf
Update README and versions for 2.43.0 / 24.02 (#6886)
mc-nv Feb 15, 2024
9be77f1
Set ONNX Runtime version 1.17.2
mc-nv Mar 1, 2024
19b02a2
Expose tritonserver args in values.yaml (#5582)
okyspace Mar 4, 2024
d0f332b
Parameterize git repository (#6934)
nv-kmcgill53 Mar 6, 2024
c2299d5
Enhance bound check for shm offset (#6914)
kthui Mar 8, 2024
110251b
Allow non-decoupled model to send response and FINAL flag separately …
GuanLuo Mar 8, 2024
25266a5
Add test for max queue delay timeout prompt response (#6938)
kthui Mar 8, 2024
b012bd0
Test improved input validation errors (#6933)
indrajit96 Mar 9, 2024
52a1cd2
Update Dockerfile.sdk with OpenAI support (#6941)
tgerdesnv Mar 11, 2024
b2e6e7e
Test Correlation Id string support for BLS (#6963)
pskiran1 Mar 11, 2024
9786e40
Update 'main' to track development of 2.45.0 / 24.04 (#6974)
mc-nv Mar 11, 2024
e92abf2
Add AsyncIO HTTP compression test (#6975)
kthui Mar 13, 2024
8139431
Install `genai-pa` into SDK container (#6942)
mc-nv Mar 13, 2024
5c6e487
extend existing tests with more parameters (#6951)
yf711 Mar 15, 2024
9f16eef
Exposing trace context to python backend (#6985)
oandreeva-nv Mar 15, 2024
8b36aa8
Add documentation for mapping between Triton Errors and HTTP status c…
Tabrizian Mar 19, 2024
afaa6f4
Remove hatch version (#7009)
tgerdesnv Mar 21, 2024
fdbfb27
Update vLLM to 0.3.2 for gemma support (#6918)
kebe7jun Mar 21, 2024
2be127b
Add missing copyright for L0_trace (#6996)
oandreeva-nv Mar 25, 2024
df753d7
fix sphinx warnings (#7030)
yinggeh Mar 25, 2024
a844eda
Add meetup invite banner (#7049)
rmccorm4 Mar 27, 2024
8a208d7
Update 'main' post-24.03 (#7051)
mc-nv Apr 1, 2024
1dfa33d
Fix incorrect version updates (#7073)
Tabrizian Apr 4, 2024
879a505
Update compose.py and remove mention of tensorflow1 in documentation …
jbkyang-nvi Apr 4, 2024
e9e3648
Add testing for iterative scheduler backlogged requests (#7059)
Tabrizian Apr 5, 2024
dbeb198
Remove conda package manager (#7069)
mc-nv Apr 5, 2024
e1d58c7
fix link (#7044)
yinggeh Apr 5, 2024
74660f1
Add Documentation from Additional Repositories to nvidia.docs.com (#7…
yinggeh Apr 5, 2024
2150fc2
Fix html image rendering in sphinx documentation (#7084)
tanmayv25 Apr 8, 2024
cbd6967
Remove obsolete mention of image tags (#7085)
tanmayv25 Apr 9, 2024
aff4b93
HTTP live connections on server shutdown (#6986)
kthui Apr 9, 2024
10f1c8d
Enable autodocs for python client library API documentation (#7082)
tanmayv25 Apr 9, 2024
5e20ef6
Updated vllm version (#7095)
oandreeva-nv Apr 10, 2024
52f97b5
Disable Dynamic Log File (#7092)
yinggeh Apr 11, 2024
b1663e9
Initial commit
fpetrini15 Mar 15, 2024
14dacb3
Fix GPU case
fpetrini15 Mar 19, 2024
a8914c4
Validate offset
fpetrini15 Mar 19, 2024
3de58c9
Open shm file, don't create
fpetrini15 Mar 20, 2024
be0d10e
Intermmediate commit: Major test restructuring.
fpetrini15 Mar 26, 2024
6d99076
Formatting
fpetrini15 Mar 27, 2024
ef2c735
Gitbot Fixes
fpetrini15 Mar 27, 2024
35aa5cc
Major software bloat refactor. Opaque shm file handle
fpetrini15 Apr 4, 2024
9df15bd
Fixes for Unix and handle-agnostic get restructure
fpetrini15 Apr 6, 2024
76f9f2b
Review comments. Passing ShmFile pointer instead of void
fpetrini15 Apr 11, 2024
159b060
Validate system shared memory region size when registering a region (…
rmccorm4 Apr 11, 2024
196caf0
Decoupled Async Execute (#7062)
kthui Apr 11, 2024
5b739db
Add trace mode and trace config entries in trace settings API (#7050)
indrajit96 Apr 11, 2024
2d206e6
Merge remote-tracking branch 'origin/main' into fpetrini-win-cpu-shm
fpetrini15 Apr 11, 2024
06d1e6b
Open backing file to validate shared memory
fpetrini15 Apr 13, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
6 changes: 4 additions & 2 deletions .clang-format
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,8 @@
BasedOnStyle: Google

IndentWidth: 2
ContinuationIndentWidth: 2
ColumnLimit: 80
ContinuationIndentWidth: 4
UseTab: Never
MaxEmptyLinesToKeep: 2

Expand Down Expand Up @@ -34,4 +35,5 @@ BinPackArguments: true
BinPackParameters: true
ConstructorInitializerAllOnOneLineOrOnePerLine: false

IndentCaseLabels: true
IndentCaseLabels: true

24 changes: 24 additions & 0 deletions .github/ISSUE_TEMPLATE/bug_report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
---
name: Bug report
about: Create a report to help us improve
title: ''
labels: ''
assignees: ''

---

**Description**
A clear and concise description of what the bug is.

**Triton Information**
What version of Triton are you using?

Are you using the Triton container or did you build it yourself?

**To Reproduce**
Steps to reproduce the behavior.

Describe the models (framework, inputs, outputs), ideally include the model configuration file (if using an ensemble include the model configuration file for that as well).

**Expected behavior**
A clear and concise description of what you expected to happen.
20 changes: 20 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
---
name: Feature request
about: Suggest an idea for this project
title: ''
labels: ''
assignees: ''

---

**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

**Describe the solution you'd like**
A clear and concise description of what you want to happen.

**Describe alternatives you've considered**
A clear and concise description of any alternative solutions or features you've considered.

**Additional context**
Add any other context or screenshots about the feature request here.
84 changes: 84 additions & 0 deletions .github/workflows/codeql.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,84 @@
# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
# are met:
# * Redistributions of source code must retain the above copyright
# notice, this list of conditions and the following disclaimer.
# * Redistributions in binary form must reproduce the above copyright
# notice, this list of conditions and the following disclaimer in the
# documentation and/or other materials provided with the distribution.
# * Neither the name of NVIDIA CORPORATION nor the names of its
# contributors may be used to endorse or promote products derived
# from this software without specific prior written permission.
#
# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS ``AS IS'' AND ANY
# EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
# IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
# PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR
# CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
# EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
# PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
# PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY
# OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
# (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
# OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

name: "CodeQL"

on:
pull_request:

jobs:
analyze:
name: Analyze
runs-on: ubuntu-latest
permissions:
actions: read
contents: read
security-events: write

strategy:
fail-fast: false
matrix:
language: [ 'python' ]
# CodeQL supports [ 'cpp', 'csharp', 'go', 'java', 'javascript', 'python', 'ruby' ]
# Learn more about CodeQL language support at https://aka.ms/codeql-docs/language-support

steps:
- name: Checkout repository
uses: actions/checkout@v3

# Initializes the CodeQL tools for scanning.
- name: Initialize CodeQL
uses: github/codeql-action/init@v2
with:
languages: ${{ matrix.language }}
# If you wish to specify custom queries, you can do so here or in a config file.
# By default, queries listed here will override any specified in a config file.
# Prefix the list here with "+" to use these queries and those in the config file.

# Details on CodeQL's query packs refer to:
# https://docs.github.com/en/code-security/code-scanning/automatically-scanning-your-code-for-vulnerabilities-and-errors/configuring-code-scanning#using-queries-in-ql-packs
queries: +security-and-quality


# Autobuild attempts to build any compiled languages (C/C++, C#, Go, or Java).
# If this step fails, then you should remove it and run the build manually (see below)
- name: Autobuild
uses: github/codeql-action/autobuild@v2

# Command-line programs to run using the OS shell.
# See https://docs.github.com/en/actions/using-workflows/workflow-syntax-for-github-actions#jobsjob_idstepsrun

# If the Autobuild fails above, remove it and uncomment the following three lines.
# modify them (or add more) to build your code if your project, please refer to the EXAMPLE below for guidance.

# - run: |
# echo "Run, Build Application using script"
# ./location_of_script_within_repo/buildscript.sh

- name: Perform CodeQL Analysis
uses: github/codeql-action/analyze@v2
with:
category: "/language:${{matrix.language}}"
39 changes: 39 additions & 0 deletions .github/workflows/pre-commit.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,39 @@
# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
# are met:
# * Redistributions of source code must retain the above copyright
# notice, this list of conditions and the following disclaimer.
# * Redistributions in binary form must reproduce the above copyright
# notice, this list of conditions and the following disclaimer in the
# documentation and/or other materials provided with the distribution.
# * Neither the name of NVIDIA CORPORATION nor the names of its
# contributors may be used to endorse or promote products derived
# from this software without specific prior written permission.
#
# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS ``AS IS'' AND ANY
# EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
# IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
# PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR
# CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
# EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
# PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
# PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY
# OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
# (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
# OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

name: pre-commit

on:
pull_request:

jobs:
pre-commit:
runs-on: ubuntu-22.04
steps:
- uses: actions/checkout@v3
- uses: actions/setup-python@v3
- uses: pre-commit/[email protected]

19 changes: 8 additions & 11 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,11 +1,8 @@
/bazel-bin
/bazel-ci_build-cache
/bazel-genfiles
/bazel-trtserver
/bazel-out
/bazel-serving
/bazel-tensorflow
/bazel-tensorflow_serving
/bazel-testlogs
/bazel-tf
/bazel-workspace
/build
/builddir
/.vscode
*.so
__pycache__
tmp
*.log
test_results.txt
74 changes: 74 additions & 0 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,74 @@
# Copyright 2023, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
# are met:
# * Redistributions of source code must retain the above copyright
# notice, this list of conditions and the following disclaimer.
# * Redistributions in binary form must reproduce the above copyright
# notice, this list of conditions and the following disclaimer in the
# documentation and/or other materials provided with the distribution.
# * Neither the name of NVIDIA CORPORATION nor the names of its
# contributors may be used to endorse or promote products derived
# from this software without specific prior written permission.
#
# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS ``AS IS'' AND ANY
# EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
# IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
# PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR
# CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,
# EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO,
# PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR
# PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY
# OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
# (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
# OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

repos:
- repo: https://github.com/timothycrosley/isort
rev: 5.12.0
hooks:
- id: isort
additional_dependencies: [toml]
- repo: https://github.com/psf/black
rev: 23.1.0
hooks:
- id: black
types_or: [python, cython]
- repo: https://github.com/PyCQA/flake8
rev: 5.0.4
hooks:
- id: flake8
args: [--max-line-length=88, --select=C,E,F,W,B,B950, --extend-ignore = E203,E501]
types_or: [python, cython]
- repo: https://github.com/pre-commit/mirrors-clang-format
rev: v16.0.5
hooks:
- id: clang-format
types_or: [c, c++, cuda, proto, textproto, java]
args: ["-fallback-style=none", "-style=file", "-i"]
- repo: https://github.com/codespell-project/codespell
rev: v2.2.4
hooks:
- id: codespell
additional_dependencies: [tomli]
args: ["--toml", "pyproject.toml"]
exclude: (?x)^(.*stemmer.*|.*stop_words.*|^CHANGELOG.md$)
# More details about these pre-commit hooks here:
# https://pre-commit.com/hooks.html
- repo: https://github.com/pre-commit/pre-commit-hooks
rev: v4.4.0
hooks:
- id: check-case-conflict
- id: check-executables-have-shebangs
- id: check-merge-conflict
- id: check-json
- id: check-toml
- id: check-yaml
exclude: ^deploy(\/[^\/]+)*\/templates\/.*$
- id: check-shebang-scripts-are-executable
- id: end-of-file-fixer
types_or: [c, c++, cuda, proto, textproto, java, python]
- id: mixed-line-ending
- id: requirements-txt-fixer
- id: trailing-whitespace
7 changes: 7 additions & 0 deletions CITATION.cff
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
cff-version: 1.2.0
message: "If you use this software, please cite it as below."
title: "Triton Inference Server: An Optimized Cloud and Edge Inferencing Solution."
url: https://github.com/triton-inference-server
repository-code: https://github.com/triton-inference-server/server
authors:
- name: "NVIDIA Corporation"
Loading