Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
453 commits
Select commit Hold shift + click to select a range
4316421
Move ConvParams methods directly on struct (#89062)
ezyang Nov 15, 2022
d96dd8f
Add int64_t, SymInt overloads for all binary operators in C++ (#89063)
ezyang Nov 15, 2022
9f0b2c7
Revert "[Inductor] Build FX Linear + Permute Vertical Fusion in Induc…
pytorchmergebot Nov 16, 2022
46ba015
Increase slow grad check timeout (#89079)
huydhn Nov 16, 2022
397f100
[FSDP] Test `named_parameters()` in forward (`use_orig_params=True`) …
Nov 15, 2022
b291c12
Create native function for determining which implementation of SDP to…
drisspg Nov 16, 2022
ce2f870
Symintify numel(), infer_size, prims.elementwise_meta (#88956)
SherlockNoMad Nov 15, 2022
8ebbd5a
Easier to understand event_dim computation (#81396)
johannespitz Nov 16, 2022
e2f0648
Add an option to include actual license terms to the output (#85624)
kmaehashi Nov 16, 2022
7e66d1d
[Inductor] Support Shape Padding for aten.mm in Inductor (#89086)
jiawenliu64 Nov 16, 2022
59ba15f
Upload CSV test reports from inductor (#89112)
huydhn Nov 16, 2022
370fc5c
[dtensor] PART 1: move DeviceMesh and placement to core distributed (…
wanchaol Nov 15, 2022
4b94596
[dtensor] PART 2: move DTensor abstraction and APIs to core distribut…
wanchaol Nov 15, 2022
2dcf097
[dtensor] PART 3: move most DTensor ops to core distributed (#88177)
wanchaol Nov 15, 2022
1b88476
[dtensor] PART 4: move remaining DTensor ops to core distributed (#88…
wanchaol Nov 15, 2022
527c5bd
[dtensor] PART 5: move DTensor basic tests to core distributed (#88178)
wanchaol Nov 15, 2022
550a019
[dtensor] PART 6: move DTensor op tests to core distributed (#88551)
wanchaol Nov 15, 2022
0230e52
[dtensor] PART 7: move remaining DTensor tests to core distributed (#…
wanchaol Nov 15, 2022
f20b3f2
[dtensor] PART 8: move tensor parallel api and tests to core distribu…
wanchaol Nov 15, 2022
9d2f5a2
[dynamo] Support if cond on NNModuleVariable (#89095)
anijain2305 Nov 16, 2022
9d28775
Revert "Rewrite assert statement with torch._assert under config (#88…
pytorchmergebot Nov 16, 2022
5270122
[Inductor] Build FX Linear + Permute Vertical Fusion in Inductor (#89…
jiawenliu64 Nov 16, 2022
dc40d3f
Add meta impl for grid_sampler_2d_backward (#88745)
anjali411 Nov 15, 2022
57af0c8
Bug fix: make sure `copy_impl` doesn't read out of bounds (#88544)
nkaretnikov Nov 16, 2022
5e0c013
SymIntArrayRef type caster (#89074)
ezyang Nov 15, 2022
09ed8b6
SymIntify convolution backend calculation (#89069)
ezyang Nov 15, 2022
37d5423
Towards unifying symbolic and non symbolic fake tensor (#89038)
ezyang Nov 16, 2022
9fe36a0
[ONNX] Extra support for bernoulli export (#88655)
mindest Nov 16, 2022
abe41ae
[ONNX] Support custom Op with onnx-script local function (#86906)
titaiwangms Nov 16, 2022
cf4b4b1
Fix python types in pybind function signatures (#89115)
AngelJA Nov 16, 2022
90db86b
Revert "SymIntify convolution backend calculation (#89069)"
pytorchmergebot Nov 16, 2022
fe276ea
[UCC] Add pre & post processing for CPU collectives (#89030)
kirteshpatil Nov 16, 2022
cf6003f
Revert "Towards unifying symbolic and non symbolic fake tensor (#89038)"
pytorchmergebot Nov 16, 2022
7f55db4
add quantize_decomposed_dynamic to op lib (#88855)
JacobSzwejbka Nov 16, 2022
d2d22d8
test_unary_ufuncs few tests enabled on rocm which are passing (#89007)
bmedishe Nov 16, 2022
e1ecf53
Simplify linspace decomp and increase its tolerance (#87203)
lezcano Nov 16, 2022
3320915
Fix decomp for embedding_backward and simplify the decomposition of e…
lezcano Nov 16, 2022
58ebf92
Add bfloat16 support to torch.prod to align with torch.cumprod (#87205)
lezcano Nov 16, 2022
7b0adc2
Run tests from test/inductor in inductor CI job (#88957)
peterbell10 Nov 16, 2022
a6ef2c7
Support test-config filter logic for rocm (#89046)
huydhn Nov 16, 2022
6180179
[Quant][bc-breaking] Remove overwrite_output_observer (#88620)
andrewor14 Nov 15, 2022
848e724
[Dynamo] Add a dummy profiler to avoid activating real profiler (#88930)
yanboliang Nov 16, 2022
0581331
[ONNX] Document ONNX diagnostics (#88371)
BowenBao Nov 14, 2022
6b521bb
Prevent module full_backward_hook from erroring in double backward (#…
soulitzer Nov 16, 2022
aee96bb
[PT-D][Checkpointing] Move distributed checkpointing from torch.distr…
wz337 Nov 16, 2022
45c62a3
[ao] making _is_activation_post_process private (#87520)
HDCharles Nov 16, 2022
4908a12
Reland "SymIntify convolution backend calculation (#89069)"" (#89142)
ezyang Nov 16, 2022
305b9b1
Fix XLASymNode.str() no str() attribute error (#89093)
Krovatkin Nov 16, 2022
640af8d
More dynamo dashboard improvements (#89155)
williamwen42 Nov 16, 2022
e70f446
[Dynamo] Fix bug in NamedTupleVariable (#89110)
yanboliang Nov 16, 2022
ee1d375
[FSDP] Add fast path for `NO_SHARD` `clip_grad_norm_()` (#89137)
Nov 16, 2022
5848704
Removed unecessary check in `select_nested` (#89150)
mikaylagawarecki Nov 16, 2022
ec61951
Fix inaccuracy in nt constructor documentation + broken rendering (#8…
mikaylagawarecki Nov 16, 2022
8ba62bd
add test_c10d_spawn_ucc.py (#86508)
Fuzzkatt Nov 16, 2022
f920bfa
Use torchrun for dynamo/distributed.py (#89149)
wconstab Nov 16, 2022
98379a3
[ONNX] Add onnx-script test cases (#86907)
titaiwangms Nov 16, 2022
0c835e2
Fix nightly build binary errors (#89153)
atalman Nov 17, 2022
8506b30
handle scatter(Scalar) overload in inductor (#88894)
Krovatkin Nov 17, 2022
cfd5525
Use the Python frame safely in _pythonCallstack (#88993)
charlie-wt Nov 17, 2022
3af5cf4
doc(typo): memroy -> memory (#89126)
rmax Nov 17, 2022
80b6761
Update README.md (#85534)
jmdetloff Nov 17, 2022
0d87a4f
Fix typo in Dispatcher.h (#89045)
keineahnung2345 Nov 17, 2022
251fdda
Add pytest-flakefinder as a test dependency (#89103)
huydhn Nov 17, 2022
716f70f
Added conv constraint that infers layouts (#89031)
Chillee Nov 16, 2022
088f2fa
Fix typos in messages under test (#89121)
kiszk Nov 17, 2022
f5e2cb5
Add comprehensive minifier tests (#88022)
williamwen42 Nov 17, 2022
30d9fb9
[dynamo][reland] API Support for nn.Module (#89113)
anijain2305 Nov 17, 2022
ac0a6f3
[dtensor] disable op db tests for now (#89162)
wanchaol Nov 16, 2022
f73d9a7
[torch][fx] Fix PassManager to not use a class variable mutable list …
dulinriley Nov 17, 2022
f3af5ba
[WIP] Composable API: `replicate` and `DistributedState` (#87649)
yhcharles Nov 15, 2022
c3acb9c
[ONNX] Add Internal Utils: onnx_proto_utils.py for onnx/onnx-script/o…
titaiwangms Nov 16, 2022
fce6d6b
Redefine the simdlen semantic: (#88482)
EikanWang Nov 16, 2022
573eaf1
Analyze and upload disabled tests rerun to S3 (#89083)
huydhn Nov 17, 2022
a5f04e9
Fix typos in .md and .rst files (#88962)
kiszk Nov 17, 2022
1adb7b9
[nn][utils] Preserve requires_grad from original weight and bias in f…
jerryzh168 Nov 16, 2022
366f1b2
[xnnpack][lite-int] Freeze/Inline module to remove reference to self …
mcr229 Nov 16, 2022
d1f48f0
[xnnpack][Bug Fix] Pass serialized model by reference (#89089)
mcr229 Nov 16, 2022
1cd6ebe
Fix typos in messages under torch (#89049)
kiszk Nov 17, 2022
24b9890
[torchrec] [composable] update ShardedEmbeddingBagCollection to be us…
colin2328 Nov 17, 2022
637e764
[xnnpack][executorch] Pass xnnexecutor pointer to compileModel() (#89…
mcr229 Nov 16, 2022
44c9185
Fix empty input issue of convolution for channels last memory format …
CaoE Nov 17, 2022
81a8fdc
[MPS] Add binary operations dtype precedence test case (#87545)
lhoenig Nov 17, 2022
4e1d19c
Revert "Redefine the simdlen semantic: (#88482)"
pytorchmergebot Nov 17, 2022
54fca6a
Fix: prefer .is_none() over .is(py::none()) for pybind11 in caffe2 (#…
Skylion007 Nov 17, 2022
70fb673
Use software approach to catch overflow ( `c10/utils/safe_numerics.h`…
Rachel030219 Nov 17, 2022
a41f706
Round out rad2deg sparse support (#88442)
amjames Nov 17, 2022
74610a1
[dynamo][benchmarks] HF - Fix seq len and batch sizes (#89165)
anijain2305 Nov 17, 2022
126e441
[ONNX] Add onnx-script into ONNX docs (#89078)
titaiwangms Nov 17, 2022
b72f5b9
[Dynamo] Support typing.Mapping & Support function as argument (#88963)
yanboliang Nov 17, 2022
37c85cf
Add warning if tensor cores are not used (#88844)
msaroufim Nov 17, 2022
3beccbc
Add BFloat16 support and optimization for mish, hardtanh backward, an…
CaoE Nov 17, 2022
bdc9911
Fix typo in dist_util.py (#89167)
wconstab Nov 16, 2022
e686b8c
Reland "Towards unifying symbolic and non symbolic fake tensor (#8903…
ezyang Nov 17, 2022
2b131b1
Support masked_fill (#88736)
EikanWang Nov 17, 2022
cd81a70
Fix buffer overflow from AddressSanitizer checks due to inaccurate bf…
Nov 17, 2022
8e4c982
Revert "Reland "Towards unifying symbolic and non symbolic fake tenso…
pytorchmergebot Nov 17, 2022
706f791
Revert "Support masked_fill (#88736)"
pytorchmergebot Nov 17, 2022
af448e8
Fix bug in dynamo dashboard summary stats diff (#89226)
williamwen42 Nov 17, 2022
04169c5
Rewrite assert statement with torch._assert under config (#88246)
tugsbayasgalan Nov 15, 2022
e856a4d
Add an env var to skip cudnn version compatibility check (#89184)
xwang233 Nov 17, 2022
f057a45
reland "support running test_mobile_profiler with buck1/buck2 and OSS…
Nov 17, 2022
fbbf368
Fix distributed test paths when running periodic multigpu job (#89225)
huydhn Nov 17, 2022
767f6aa
[JIT][Security] Do not blindly eval input string (#89189)
malfet Nov 17, 2022
a695fcf
Add tests for replicate multiple modules (#89099)
yhcharles Nov 17, 2022
e2229a8
Fix typo in aten/src/README.md (#89175)
keineahnung2345 Nov 17, 2022
3d8a853
[DataPipe] Add container template for _Fork and _Demux (#89216)
ejguan Nov 17, 2022
31b10e7
Enable inductor CI for TorchBench (#87465)
desertfire Nov 17, 2022
2b3ac87
feat: adding view_copy_batch_rule and opinfo for view_copy (#88150)
srossross Nov 17, 2022
57e05e8
Issue 68576 prefetch factor (#88972)
blankstatic Nov 18, 2022
177621a
Use pytest-flakefinder to rerun tests multiple times (#89106)
huydhn Nov 18, 2022
b652fbc
Fix torch.nn.functional.gelu docstring formatting (#89061)
cherrywoods Nov 18, 2022
9fd00f1
Fix the kineto daemon build condition (#89174)
dzhulgakov Nov 18, 2022
fd0efb0
[MPS] Support for median with dim (#88807)
Raman-Kumar Nov 18, 2022
92f9214
add -Wnarrowing as error to cmake builds (#89207)
Nov 17, 2022
65bcd1f
Add previously deleted circleci readme back to repo (#85598)
jmdetloff Nov 18, 2022
3c2676d
[LTC] Restore GetPythonFrames (#89122)
alanwaketan Nov 18, 2022
6ed14c7
[vision hash update] update the pinned vision hash (#89102)
pytorchmergebot Nov 18, 2022
f4efc5e
[quant][be] Move some helper functions to the top level to reduce fun…
jerryzh168 Nov 18, 2022
6f4f69f
[Executorch] [Quantization] New pattern for dynamic dequant (#89236)
JacobSzwejbka Nov 18, 2022
6f609dd
docs: conv2d `padding` attribute- add `int` option (#85004)
YoniChechik Nov 18, 2022
ba5e39e
Fix tol for test_nvfuser_correctness__softmax_backward_data_cuda (#89…
SherlockNoMad Nov 17, 2022
f1fb586
Symintify repeat_interleave.self_int (#89111)
SherlockNoMad Nov 17, 2022
3bc7829
Fix consistentcy of histc on CPU and CUDA (#87832)
Aidyn-A Nov 18, 2022
ab75982
Always retry curl downloads (#89157)
ZainRizvi Nov 18, 2022
7beb151
[xnnpack][executorch] remove unordered_set from xnn_compiler (#89231)
mcr229 Nov 17, 2022
fc1c0cd
Add support trace on MPS backend (#87910)
pumetu Nov 18, 2022
6a964c1
[flaky] relax tolerance conv1d_vs_scipy (#89193)
kshitij12345 Nov 18, 2022
afdc48f
Gate CUDA-only inductor tests by HAS_CUDA (#89251)
huydhn Nov 18, 2022
30c3e5a
Disable tracing `zero_grad()` (#88731)
mlazos Nov 18, 2022
c5fafb4
[ao] maintain BC for is_activation_post_process (#89260)
HDCharles Nov 18, 2022
2dcacc6
[LTC] Upstream short_metrics (#89186)
alanwaketan Nov 18, 2022
4c67249
[PT-D][Checkpoint] Update import and update docstring for distributed…
wz337 Nov 18, 2022
5654fed
Export c10/[macros|util] headers to be used by internal inductor buil…
malfet Nov 18, 2022
2e358cc
Add platform markers for linux only extra_install_requires (#88826)
JacobHayes Nov 18, 2022
ce0e22a
Fix names of some reference functions (#88115)
lezcano Nov 18, 2022
6741443
Simplify maybe_resize_out (#88116)
lezcano Nov 18, 2022
154e58c
Add most in-place references/decompositions (#88117)
lezcano Nov 18, 2022
55e55d9
Update torch.distributed.DistBackendError type (#89235)
H-Huang Nov 18, 2022
1f7c0ff
[inductor] Temporarily disable functorch_dp_cifar10 test in TorchBenc…
desertfire Nov 18, 2022
19fcb80
[inductor] Skip DALLE2_pytorch in torchbench (#89288)
desertfire Nov 18, 2022
19e66fc
[Quant] Allow setting fixed qparams for inner LSTM ops (#88456)
andrewor14 Nov 17, 2022
12a9744
[xplat] remove -weak_framework (#89233)
rmaz Nov 18, 2022
c219b55
Use standard __func__ macro in symbolic shape. (#89264)
zhxchen17 Nov 18, 2022
38ccd08
[quant][fx][be] Refactor replace observer with q/dq op code (#89247)
jerryzh168 Nov 18, 2022
8a419cb
Added partial decomposition of conv_backward and grad_bias computatio…
Chillee Nov 18, 2022
81a4aea
[Dynamo] Support Tensor.nelement & torch.cuda.is_available (#89164)
yanboliang Nov 18, 2022
7ec8a4d
Vectorized horizontal flip implementation (#88989)
vfdev-5 Nov 18, 2022
ee2ce3f
Set make max load when building libtorch (#89237)
huydhn Nov 18, 2022
837ca8f
Remove --retry-all-errors from environment with old curl (#89298)
ZainRizvi Nov 18, 2022
e04dc35
Symintify obeys_layout_contract (#89138)
ezyang Nov 18, 2022
ba605c3
Don't trace when we track_tensor_tree (#89139)
ezyang Nov 18, 2022
304b5de
Re-enable test_hf_bert_fsdp (#89223)
davidberard98 Nov 17, 2022
bfffc8d
[DDP][Docs] Add warning that `no_sync()` should include forward (#89244)
Nov 17, 2022
35d5fc5
[Profiler] Don't raise SOFT_ASSERT in debug builds. (#89240)
Nov 17, 2022
7551136
Add NVTX markers that dump additional information for nvprim_nvfuser …
blongnv Nov 18, 2022
ecfb4e0
[Inductor CI] Use string format for cuda-arch-list input to prevent 8…
weiwangmeta Nov 18, 2022
0e1fcc8
[FX] Add type annotation to `getitem` node before `split_module` (#88…
lequytra Nov 18, 2022
885f8a5
[BE] Print backtraces from coredumps (#89309)
malfet Nov 18, 2022
94b5c80
Detach fake tensors into val, so they aren't affected by metadata mut…
ezyang Nov 18, 2022
c3938bb
[functorch] introduce an experimental map() op. (#88767)
zhxchen17 Nov 19, 2022
ee90737
[small] Update error message (#89294)
H-Huang Nov 19, 2022
cad5772
[dashboard][huggingface] skip accuracy checks for really large models…
anijain2305 Nov 19, 2022
ea58955
Move bazel to c++17 (#89297)
malfet Nov 19, 2022
85a87e6
[dynamo] mutable local caching to make dynamo faster at tracing mutat…
mlazos Nov 19, 2022
2e72ec7
Update sdp dispatch logic to enable fused backward (#89154)
drisspg Nov 19, 2022
8c0515d
cast C++ py-bound SymNode to SymInt correctly (#89295)
Krovatkin Nov 19, 2022
e6996ea
Don't redefine __STDC_FORMAT_MACROS (#89310)
ppwwyyxx Nov 19, 2022
631baec
Add --explain flag to bench (#89316)
voznesenskym Nov 19, 2022
7a2930b
add jvp test with non-contig inputs (#89131)
kshitij12345 Nov 19, 2022
419ef2c
Added utility to count memory reads/written in Inductor (#89203)
Chillee Nov 18, 2022
808bdba
Fix try/except flow where DataDependentOutputException is getting wra…
voznesenskym Nov 19, 2022
940959e
[quant][fix] Add quant_min/quant_max for default dynamic quantization…
jerryzh168 Nov 19, 2022
6daf60b
[ONNX] Add setType from user into InferredType and Reliable in Consta…
titaiwangms Nov 19, 2022
6b8c1b1
RM expectedFailure UnspecReproTests.test_batch_norm_act_unspec (#89340)
voznesenskym Nov 19, 2022
6afe341
[PT-D][1/N] Sync TP Beta change to prod (#89242)
fduwjj Nov 19, 2022
5582001
Reland 2 "Towards unifying symbolic and non symbolic fake tensor (#89…
ezyang Nov 19, 2022
8ac58bc
Add nullptr_t overload to c10::intrusive_ptr (#89196)
kvathupo Nov 19, 2022
8ad3953
Revert "Symintify numel(), infer_size, prims.elementwise_meta (#88956)"
pytorchmergebot Nov 19, 2022
7c811ef
Add support for dynamic kwarg to torch._dynamo.optimize (#89290)
ezyang Nov 19, 2022
caf3d53
Symintify numel(), infer_size, prims.elementwise_meta (#88956)
SherlockNoMad Nov 19, 2022
dbeacf1
Fix cat striding in PrimTorch (#89332)
ezyang Nov 20, 2022
7b0d577
Set INTERFACE_LINK_DIRECTORIES on caffe2::mkl (#89359)
ezyang Nov 20, 2022
c099296
Also include MKL_THREAD_LIB in link libraries for caffe2::mkl (#89378)
ezyang Nov 20, 2022
e1d58b1
Revert "Update sdp dispatch logic to enable fused backward (#89154)"
pytorchmergebot Nov 20, 2022
1856fa5
Temporary increase ASAN shard 5 to 4xlarge (#89387)
huydhn Nov 20, 2022
51e961d
use std/libdevice erf in inductor (#89388)
Nov 21, 2022
1db5ce0
[vision hash update] update the pinned vision hash (#89287)
pytorchmergebot Nov 21, 2022
e0251de
[Easy] Use prepend arg to register forward hooks in quantize.py (#89391)
mrshenli Nov 20, 2022
79770d3
TorchDynamo: enable conv+relu6 fusion (#89265)
XiaobingSuper Nov 21, 2022
bc71638
Redefine the simdlen semantic (#89263)
EikanWang Nov 21, 2022
31708a7
TorchDynamo: enable conv+silu fusion (#89278)
XiaobingSuper Nov 21, 2022
a80e5e7
Update ideep for future performance improvement (#87966)
yanbing-j Nov 21, 2022
c2cf0bd
Move the OpInfo same-storage error to the autograd test (#88306)
lezcano Nov 21, 2022
6796979
[Inductor] Limit the number of compile threads to the available cpu c…
Nov 21, 2022
f3db036
Revert "[ao] maintain BC for is_activation_post_process (#89260)"
pytorchmergebot Nov 21, 2022
9d209e7
Revert "[ao] making _is_activation_post_process private (#87520)"
pytorchmergebot Nov 21, 2022
e4d9dbd
Port torchdynamo's torchbench script to userbenchmark (#89239)
xuzhao9 Nov 21, 2022
cf94765
update kineto pinned commit (#89435)
Nov 21, 2022
1d9e1fc
Update sdp dispatch logic to enable fused backward (#89154)
drisspg Nov 21, 2022
3d247a8
Fix unconvertible_ops as per #89261 (#89299)
kevalmorabia97 Nov 21, 2022
1267dcf
[inductor] Fix nan handling for aten.sign (#88937)
peterbell10 Nov 20, 2022
c068fa9
[inductor] Misc division lowering fixes (#88603)
peterbell10 Nov 20, 2022
047e542
[tools] expose selective build library (#89351)
larryliu0820 Nov 21, 2022
deae450
[1/n] Thread PG: add test for allgather (#89439)
wanchaol Nov 21, 2022
3876f94
[2/n] Thread PG: add test for broadcast (#89440)
wanchaol Nov 21, 2022
3e99d4d
[3/n] Thread PG: add scatter to threaded pg (#89441)
wanchaol Nov 21, 2022
821ba6b
[4/n] Thread PG: add reduce_scatter to threaded pg (#89442)
wanchaol Nov 21, 2022
186192b
[Dynamo] Fix bugs when calling tensor.data and tensor.layout (#89257)
yanboliang Nov 21, 2022
fa4980c
Add commit hash to dynamo dashboard (#89462)
williamwen42 Nov 21, 2022
ea50549
Suppress guards when creating fake tensors (#89349)
ezyang Nov 21, 2022
dbc354b
Mitigate flaky test_ops_fwd_gradients on macOS (#89410)
huydhn Nov 22, 2022
b189a74
[fix] tril & tril : out of bound check (#89384)
khushi-411 Nov 22, 2022
57ed948
Bind DispatchKey.Functionalonalize in pybind11 (#89452)
ezyang Nov 21, 2022
7174572
Add torchvis support to dist bench (#89324)
wconstab Nov 21, 2022
58a74f3
[17/N] Add _reduce_scatter_base custom op with CPU/CUDA implementatio…
H-Huang Nov 21, 2022
06dffb3
dont clone symints, dont clobber symint proxies (#88230)
ezyang Nov 21, 2022
120d200
Revert "Added conv constraint that infers layouts (#89031)" (#89451)
anijain2305 Nov 22, 2022
496c8ae
[xnnpack][lite-int] Handle Constant Data (#89445)
mcr229 Nov 21, 2022
82713a1
[inductor][compilation time] Fallback when kernel size for avg/max po…
anijain2305 Nov 22, 2022
00b9473
[PT-D][Tensor Parallelism][2/N] Sync TP API change to PT prod (#89467)
fduwjj Nov 22, 2022
338f619
[vision hash update] update the pinned vision hash (#89471)
pytorchmergebot Nov 22, 2022
ce342ed
Fix retrying logic for successful unittest tests under --rerun-disabl…
huydhn Nov 22, 2022
1dae59b
[Checkpoint][2D][1/N] Add dedup_tensors for distributed checkpoint to…
wz337 Nov 22, 2022
e545caa
dynamo/torchxla integration: trace on xla rather than eager (#88904)
shunting314 Nov 22, 2022
40cf214
Support masked_fill to address the GPT2 performance issue (#89274)
EikanWang Nov 21, 2022
f2cf1b0
Revert submodule updates introduced by #89157 (#89449)
malfet Nov 22, 2022
7b0650d
Back out "[static-runtime] change the backend for permute_copy" (#89463)
Nov 22, 2022
6b085d5
[Checkpoint][2D][2/N] Add traverse for distributed checkpoint to core…
wz337 Nov 22, 2022
1d6a188
Reland Dispatch torch.norm to linalg.vector_norm and linalg.matrix_no…
lezcano Nov 19, 2022
0f7dca1
Vectorized CPU code implementing right shift operator. (#88990)
alexsamardzic Nov 21, 2022
2d94fd3
[Vulkan][TCC] Fix quantized shaders (#89456)
manuelcandales Nov 22, 2022
d9cbe77
Make aten.copy preserve strides (hf_Longformer) (#89464)
ezyang Nov 22, 2022
be22b5d
[18/N] Add allgather_coalesced custom op with CPU/CUDA implementation…
H-Huang Nov 21, 2022
5797f74
[19/N] Add monitored_barrier custom op with CPU implementation (#89318)
H-Huang Nov 21, 2022
2823fc5
[inductor] generate nan in the cpp backend (#89289)
desertfire Nov 22, 2022
c4e0838
[quant][fx] Support producing reference quantized patterns for dynami…
jerryzh168 Nov 21, 2022
9c0bf93
Meta impl for linalg_cholesky and linalg_cholesky_ex (#89430)
anjali411 Nov 22, 2022
f4898da
Add cached conda env file for Buck CI workflow (#89422)
PratsBhatt Nov 22, 2022
7c0bb61
Force numpy prod to use 64 bit integers on Windows in some tests (#88…
mantaionut Nov 22, 2022
f281f43
Fix benchmarks - xla tensor test (#89509)
anijain2305 Nov 22, 2022
ef8b91f
enable previously failing UCC distributed_test.py tests (#89023)
Fuzzkatt Nov 22, 2022
c2ce79f
Fix dev-discuss link in the maintainer docs (#89493)
qqaatw Nov 22, 2022
d053d51
(Further) limit world size in test_fsdp_pure_fp16 (#86280)
Flamefire Nov 22, 2022
ac30047
Relax tolerance for test_out_addbmm_cpu_float32 (#86365)
Flamefire Nov 22, 2022
177baf3
Fix vectorized trigonometric functions for VSX (#86453)
Flamefire Nov 22, 2022
77d7f2c
[dashboard] Add commit date & fix date related issues (#89517)
williamwen42 Nov 22, 2022
00b7d8e
Shard windows periodic job more (#89455)
clee2000 Nov 22, 2022
74e62a1
[ROCm] Optimize layer norm backward kernel for ROCm (#87635)
hubertlu-tw Nov 22, 2022
d17ddf0
Enable maxpool_2d in NNC
Guobing-Chen Jul 18, 2022
aadf974
fix maxpool2d output buf dtype
Guobing-Chen Aug 29, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .bazelrc
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
build --cxxopt=--std=c++14
build --cxxopt=--std=c++17
build --copt=-I.
# Bazel does not support including its cc_library targets as system
# headers. We work around this for generated code
Expand Down
468 changes: 468 additions & 0 deletions .circleci/README.md

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion .circleci/docker/build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -81,7 +81,7 @@ fi

TRAVIS_DL_URL_PREFIX="https://s3.amazonaws.com/travis-python-archives/binaries/ubuntu/14.04/x86_64"
_UCX_COMMIT=31e74cac7bee0ef66bef2af72e7d86d9c282e5ab
_UCC_COMMIT=12944da33f911daf505d9bbc51411233d0ed85e1
_UCC_COMMIT=1c7a7127186e7836f73aafbd7697bbc274a77eee

# It's annoying to rename jobs every time you want to rewrite a
# configuration, so we hardcode everything here rather than do it
Expand Down
4 changes: 2 additions & 2 deletions .circleci/docker/common/install_cudnn.sh
Original file line number Diff line number Diff line change
Expand Up @@ -6,9 +6,9 @@ if [[ ${CUDNN_VERSION} == 8 ]]; then
CUDNN_NAME="cudnn-linux-x86_64-8.3.2.44_cuda11.5-archive"
if [[ ${CUDA_VERSION:0:4} == "11.7" ]]; then
CUDNN_NAME="cudnn-linux-x86_64-8.5.0.96_cuda11-archive"
curl -OLs https://ossci-linux.s3.amazonaws.com/${CUDNN_NAME}.tar.xz
curl --retry 3 -OLs https://ossci-linux.s3.amazonaws.com/${CUDNN_NAME}.tar.xz
else
curl -OLs https://developer.download.nvidia.com/compute/redist/cudnn/v8.3.2/local_installers/11.5/${CUDNN_NAME}.tar.xz
curl --retry 3 -OLs https://developer.download.nvidia.com/compute/redist/cudnn/v8.3.2/local_installers/11.5/${CUDNN_NAME}.tar.xz
fi

tar xf ${CUDNN_NAME}.tar.xz
Expand Down
4 changes: 2 additions & 2 deletions .circleci/docker/common/install_docs_reqs.sh
Original file line number Diff line number Diff line change
Expand Up @@ -7,10 +7,10 @@ if [ -n "$KATEX" ]; then
# Ignore error if gpg-agent doesn't exist (for Ubuntu 16.04)
apt-get install -y gpg-agent || :

curl -sL https://deb.nodesource.com/setup_12.x | sudo -E bash -
curl --retry 3 -sL https://deb.nodesource.com/setup_12.x | sudo -E bash -
sudo apt-get install -y nodejs

curl -sS https://dl.yarnpkg.com/debian/pubkey.gpg | sudo apt-key add -
curl --retry 3 -sS https://dl.yarnpkg.com/debian/pubkey.gpg | sudo apt-key add -
echo "deb https://dl.yarnpkg.com/debian/ stable main" | sudo tee /etc/apt/sources.list.d/yarn.list

apt-get update
Expand Down
2 changes: 1 addition & 1 deletion .circleci/docker/common/install_protobuf.sh
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ install_protobuf_317() {
# g++: error: ./../lib64/crti.o: No such file or directory
ln -s /usr/lib64 "$pb_dir/lib64"

curl -LO "https://github.com/protocolbuffers/protobuf/releases/download/v3.17.3/protobuf-all-3.17.3.tar.gz"
curl -LO "https://github.com/protocolbuffers/protobuf/releases/download/v3.17.3/protobuf-all-3.17.3.tar.gz" --retry 3
tar -xvz -C "$pb_dir" --strip-components 1 -f protobuf-all-3.17.3.tar.gz
# -j6 to balance memory usage and speed.
# naked `-j` seems to use too much memory.
Expand Down
7 changes: 6 additions & 1 deletion .circleci/docker/requirements-ci.txt
Original file line number Diff line number Diff line change
Expand Up @@ -159,8 +159,13 @@ pytest-shard
#Pinned versions:
#test that import:

pytest-flakefinder==1.1.0
#Description: plugin for rerunning tests a fixed number of times in pytest
#Pinned versions: 1.1.0
#test that import:

pytest-rerunfailures
#Description: plugin for rerunning tests in pytest
#Description: plugin for rerunning failure tests in pytest
#Pinned versions:
#test that import:

Expand Down
4 changes: 2 additions & 2 deletions .circleci/scripts/binary_install_miniconda.sh
Original file line number Diff line number Diff line change
Expand Up @@ -31,9 +31,9 @@ fi

conda_sh="$workdir/install_miniconda.sh"
if [[ "$(uname)" == Darwin ]]; then
curl --retry 3 -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
curl --retry 3 --retry-all-errors -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
else
curl --retry 3 -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
curl --retry 3 --retry-all-errors -o "$conda_sh" https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
fi
chmod +x "$conda_sh"
"$conda_sh" -b -p "$MINICONDA_ROOT"
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/binary_ios_build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ PROJ_ROOT=/Users/distiller/project
export TCLLIBPATH="/usr/local/lib"

# Install conda
curl --retry 3 -o ~/conda.sh https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
curl --retry 3 --retry-all-errors -o ~/conda.sh https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
chmod +x ~/conda.sh
/bin/bash ~/conda.sh -b -p ~/anaconda
export PATH="~/anaconda/bin:${PATH}"
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/binary_ios_upload.sh
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,7 @@ echo "${IOS_NIGHTLY_BUILD_VERSION}" > version.txt
zip -r ${ZIPFILE} install src version.txt LICENSE
# upload to aws
# Install conda then 'conda install' awscli
curl --retry 3 -o ~/conda.sh https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
curl --retry 3 --retry-all-errors -o ~/conda.sh https://repo.anaconda.com/miniconda/Miniconda3-latest-MacOSX-x86_64.sh
chmod +x ~/conda.sh
/bin/bash ~/conda.sh -b -p ~/anaconda
export PATH="~/anaconda/bin:${PATH}"
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/driver_update.bat
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
set "DRIVER_DOWNLOAD_LINK=https://s3.amazonaws.com/ossci-windows/452.39-data-center-tesla-desktop-win10-64bit-international.exe"
curl --retry 3 -kL %DRIVER_DOWNLOAD_LINK% --output 452.39-data-center-tesla-desktop-win10-64bit-international.exe
curl --retry 3 --retry-all-errors -kL %DRIVER_DOWNLOAD_LINK% --output 452.39-data-center-tesla-desktop-win10-64bit-international.exe
if errorlevel 1 exit /b 1

start /wait 452.39-data-center-tesla-desktop-win10-64bit-international.exe -s -noreboot
Expand Down
4 changes: 2 additions & 2 deletions .circleci/scripts/setup_ci_environment.sh
Original file line number Diff line number Diff line change
Expand Up @@ -40,8 +40,8 @@ if [ -n "${USE_CUDA_DOCKER_RUNTIME:-}" ]; then
# Taken directly from https://github.com/NVIDIA/nvidia-docker
# Add the package repositories
distribution=$(. /etc/os-release;echo "$ID$VERSION_ID")
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L "https://nvidia.github.io/nvidia-docker/${distribution}/nvidia-docker.list" | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
curl -s -L --retry 3 --retry-all-errors https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L --retry 3 --retry-all-errors "https://nvidia.github.io/nvidia-docker/${distribution}/nvidia-docker.list" | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

retry sudo apt-get update -qq
# Necessary to get the `--gpus` flag to function within docker
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/setup_linux_system_environment.sh
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
set -eux -o pipefail

# Set up CircleCI GPG keys for apt, if needed
curl --retry 3 -s -L https://packagecloud.io/circleci/trusty/gpgkey | sudo apt-key add -
curl --retry 3 --retry-all-errors -s -L https://packagecloud.io/circleci/trusty/gpgkey | sudo apt-key add -

# Stop background apt updates. Hypothetically, the kill should not
# be necessary, because stop is supposed to send a kill signal to
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/vs_install.ps1
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@ if (Test-Path "${env:ProgramFiles(x86)}\Microsoft Visual Studio\Installer\vswher
}

echo "Downloading VS installer from S3."
curl.exe --retry 3 -kL $VS_DOWNLOAD_LINK --output vs_installer.exe
curl.exe --retry 3 --retry-all-errors -kL $VS_DOWNLOAD_LINK --output vs_installer.exe
if ($LASTEXITCODE -ne 0) {
echo "Download of the VS 2019 Version ${env:VS_VERSION} installer failed"
exit 1
Expand Down
2 changes: 1 addition & 1 deletion .circleci/scripts/vs_install_cmath.ps1
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
$CMATH_DOWNLOAD_LINK = "https://raw.githubusercontent.com/microsoft/STL/12c684bba78f9b032050526abdebf14f58ca26a3/stl/inc/cmath"
$VC14_28_INSTALL_PATH="C:\Program Files (x86)\Microsoft Visual Studio\2019\Community\VC\Tools\MSVC\14.28.29910\include"

curl.exe --retry 3 -kL $CMATH_DOWNLOAD_LINK --output "$home\cmath"
curl.exe --retry 3 --retry-all-errors -kL $CMATH_DOWNLOAD_LINK --output "$home\cmath"
Move-Item -Path "$home\cmath" -Destination "$VC14_28_INSTALL_PATH" -Force
2 changes: 1 addition & 1 deletion .circleci/scripts/windows_cudnn_install.sh
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,7 @@ else
tmp_dir=$(mktemp -d)
(
pushd "${tmp_dir}"
curl --retry 3 -o "${cudnn_installer_name}" "$cudnn_installer_link"
curl --retry 3 --retry-all-errors -o "${cudnn_installer_name}" "$cudnn_installer_link"
7z x "${cudnn_installer_name}" -ocudnn
# Use '${var:?}/*' to avoid potentially expanding to '/*'
# Remove all of the directories before attempting to copy files
Expand Down
7 changes: 6 additions & 1 deletion .github/actions/setup-rocm/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,12 @@ runs:
run: |
ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx')
if [[ "x$ngpu" != "x2" && "x$ngpu" != "x4" ]]; then
echo "Failed to detect GPUs on the runner"
if [[ $ngpu -eq 0 ]]; then
echo "Error: Failed to detect any GPUs on the runner"
else
echo "Error: Detected $ngpu GPUs on the runner, when only 2 or 4 were expected"
fi
echo "Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified"
exit 1
fi

Expand Down
11 changes: 7 additions & 4 deletions .github/actions/upload-test-artifacts/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@ runs:
run: |
# Remove any previous test reports if they exist
rm -f test-reports-*.zip
zip -r "test-reports-${FILE_SUFFIX}.zip" test -i '*.xml'
zip -r "test-reports-${FILE_SUFFIX}.zip" test -i '*.xml' -i '*.csv'

- name: Zip usage log for upload
if: runner.os != 'Windows' && !inputs.use-gha
Expand Down Expand Up @@ -67,7 +67,7 @@ runs:
FILE_SUFFIX: ${{ inputs.file-suffix }}
run: |
# -ir => recursive include all files in pattern
7z a "test-reports-$Env:FILE_SUFFIX.zip" -ir'!test\*.xml'
7z a "test-reports-$Env:FILE_SUFFIX.zip" -ir'!test\*.xml' -ir'!test\*.csv'

- name: Zip usage log for upload
if: runner.os == 'Windows' && !inputs.use-gha
Expand Down Expand Up @@ -127,8 +127,11 @@ runs:
# Add the run attempt, see [Artifact run attempt]
name: test-reports-runattempt${{ github.run_attempt }}-${{ inputs.file-suffix }}.zip
retention-days: 14
if-no-files-found: error
path: test/**/*.xml
# Don't want to fail the workflow here because not all workflows have csv files
if-no-files-found: ignore
path: |
test/**/*.xml
test/**/*.csv

- name: Store Usage Logs on Github
uses: actions/upload-artifact@v3
Expand Down
1 change: 1 addition & 0 deletions .github/ci_commit_pins/text.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
5b78d074bd303eb230d30567646fcf0358ee2dd4
2 changes: 1 addition & 1 deletion .github/ci_commit_pins/vision.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
bf58902b2fd881c760cd2eeacfae2d7c468ebf1f
4a310f26049371959617921d0eb9b001f4d262c6
2 changes: 1 addition & 1 deletion .github/ci_commit_pins/xla.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
7889d2d3be16675943d84e4a4133ed7c245a623f
50855d7babfa7970cba18528c659989b91c83824
2 changes: 2 additions & 0 deletions .github/requirements/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,8 @@ The list of support files are as follows:
test jobs to setup the conda environment
* conda-env-macOS-X64. This is use by MacOS (x86-64) build and test
jobs to setup the conda environment
* conda-env-Linux-X64. This is used by Linux buck build and test jobs
to setup the conda environment
* Pip:
* pip-requirements-macOS.txt. This is used by MacOS build and test jobs to
setup the pip environment
10 changes: 10 additions & 0 deletions .github/requirements/conda-env-Linux-X64
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
cffi=1.15.1
cmake=3.22.1
mkl=2022.1.0
mkl-include=2022.1.0
ninja=1.10.2
numpy=1.23.3
pyyaml=6.0
requests=2.28.1
setuptools=65.5.0
typing_extensions=4.3.0
1 change: 1 addition & 0 deletions .github/requirements/pip-requirements-macOS.txt
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@ pygments==2.12.0
pytest==7.2.0
pytest-xdist==3.0.2
pytest-rerunfailures==10.2
pytest-flakefinder==1.1.0
pytest-shard==0.1.2
scipy==1.9.0
sympy==1.11.1
Expand Down
2 changes: 1 addition & 1 deletion .github/scripts/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
> NOTE: This README contains information for the `.github` directory but cannot be located there because it will overwrite the
repo README.

This directory contains workflows and scripts to support our CI infrastructure that runs on Github Actions.
This directory contains workflows and scripts to support our CI infrastructure that runs on GitHub Actions.

## Workflows

Expand Down
29 changes: 27 additions & 2 deletions .github/scripts/filter_test_configs.py
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,9 @@
"functorch",
"inductor",
"inductor_distributed",
"inductor_huggingface",
"inductor_timm",
"inductor_torchbench",
"jit_legacy",
"multigpu",
"nogpu_AVX512",
Expand All @@ -34,6 +36,13 @@
"xla",
}}

# Supported modes when running periodically
SUPPORTED_PERIODICAL_MODES = {
"mem_leak_check",
"rerun_disabled_tests",
}


def parse_args() -> Any:
from argparse import ArgumentParser
parser = ArgumentParser("Filter all test configurations and keep only requested ones")
Expand Down Expand Up @@ -109,6 +118,23 @@ def filter(test_matrix: Dict[str, List[Any]], labels: Set[str]) -> Dict[str, Lis
return filtered_test_matrix


def set_periodic_modes(test_matrix: Dict[str, List[Any]]) -> Dict[str, List[Any]]:
"""
Apply all periodic modes when running under a schedule
"""
scheduled_test_matrix: Dict[str, List[Any]] = {
"include": [],
}

for config in test_matrix.get("include", []):
for mode in SUPPORTED_PERIODICAL_MODES:
cfg = config.copy()
cfg[mode] = mode
scheduled_test_matrix["include"].append(cfg)

return scheduled_test_matrix


def set_output(name: str, val: Any) -> None:
if os.getenv("GITHUB_OUTPUT"):
with open(str(os.getenv("GITHUB_OUTPUT")), "a") as env:
Expand Down Expand Up @@ -163,8 +189,7 @@ def main() -> None:
filtered_test_matrix = test_matrix

if args.event_name == "schedule":
for config in filtered_test_matrix.get("include", []):
config["mem_leak_check"] = "mem_leak_check"
filtered_test_matrix = set_periodic_modes(filtered_test_matrix)

# Set the filtered test matrix as the output
set_output("test-matrix", json.dumps(filtered_test_matrix))
Expand Down
6 changes: 3 additions & 3 deletions .github/scripts/generate_binary_build_matrix.py
Original file line number Diff line number Diff line change
Expand Up @@ -219,9 +219,9 @@ def generate_wheels_matrix(os: str,
"container_image": WHEEL_CONTAINER_IMAGES[arch_version],
"package_type": package_type,
"pytorch_extra_install_requirements":
"nvidia-cuda-runtime-cu11;"
"nvidia-cudnn-cu11==8.5.0.96;"
"nvidia-cublas-cu11==11.10.3.66",
"nvidia-cuda-runtime-cu11; platform_system == 'Linux' | "
"nvidia-cudnn-cu11==8.5.0.96; platform_system == 'Linux' | "
"nvidia-cublas-cu11==11.10.3.66; platform_system == 'Linux'",
"build_name":
f"{package_type}-py{python_version}-{gpu_arch_type}{gpu_arch_version}-with-pypi-cudnn"
.replace(
Expand Down
4 changes: 2 additions & 2 deletions .github/scripts/generate_ci_workflows.py
Original file line number Diff line number Diff line change
Expand Up @@ -154,7 +154,7 @@ class OperatingSystem:
package_type="libtorch",
abi_version=generate_binary_build_matrix.PRE_CXX11_ABI,
build_configs=generate_binary_build_matrix.generate_libtorch_matrix(
OperatingSystem.LINUX, generate_binary_build_matrix.CXX11_ABI,
OperatingSystem.LINUX, generate_binary_build_matrix.PRE_CXX11_ABI,
arches=["cpu"],
libtorch_variants=["shared-with-deps"],
),
Expand Down Expand Up @@ -277,7 +277,7 @@ class OperatingSystem:
BinaryBuildWorkflow(
os=OperatingSystem.MACOS_ARM64,
package_type="wheel",
build_configs=generate_binary_build_matrix.generate_wheels_matrix(OperatingSystem.MACOS),
build_configs=generate_binary_build_matrix.generate_wheels_matrix(OperatingSystem.MACOS_ARM64),
cross_compile_arm64=True,
ciflow_config=CIFlowConfig(
labels={LABEL_CIFLOW_BINARIES, LABEL_CIFLOW_BINARIES_WHEEL},
Expand Down
Loading