This project is mirrored from https://github.com/llvm/llvm-project.git.
Pull mirroring failed .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer or owner.
Last successful update .
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer or owner.
Last successful update .
- Mar 25, 2022
-
-
Argyrios Kyrtzidis authored
BLAKE3 is a cryptographic hash function that is secure and very performant. The C implementation originates from https://github.com/BLAKE3-team/BLAKE3/tree/1.3.1/c License is at https://github.com/BLAKE3-team/BLAKE3/blob/1.3.1/LICENSE This patch adds: * `llvm/include/llvm-c/blake3.h`: The BLAKE3 C API * `llvm/include/llvm/Support/BLAKE3.h`: C++ wrapper of the C API * `llvm/lib/Support/BLAKE3`: Directory containing the BLAKE3 C implementation files, including the `LICENSE` file * `llvm/unittests/Support/BLAKE3Test.cpp`: unit tests for the BLAKE3 C++ wrapper This initial patch contains the pristine BLAKE3 sources, a follow-up patch will introduce LLVM-specific prefixes to avoid conflicts if a client also links with its own BLAKE3 version. And here's some timings comparing BLAKE3 with LLVM's SHA1/SHA256/MD5. Timings include `AVX512`, `AVX2`, `neon`, and the generic/portable implementations. The table shows the speed-up multiplier of BLAKE3 for hashing 100 MBs: | Processor | SHA1 | SHA256 | MD5 | |-------------------------|-------|--------|------| | Intel Xeon W (AVX512) | 10.4x | 27x | 9.4x | | Intel Xeon W (AVX2) | 6.5x | 17x | 5.9x | | Intel Xeon W (portable) | 1.3x | 3.3x | 1.1x | | M1Pro (neon) | 2.1x | 4.7x | 2.8x | | M1Pro (portable) | 1.1x | 2.4x | 1.5x | Differential Revision: https://reviews.llvm.org/D121510
-
Shraiysh Vaishay authored
This patch adds the nowait parameter to `createSingle` in OpenMPIRBuilder and handling for IR generation from OpenMP Dialect. Also added tests for the same. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D122371
-
Simon Pilgrim authored
InstCombine can convert some cases to 'splat bit' sra(shl(x,y),bw-1) patterns which we need to handle as well
-
Michael Jones authored
The main code for the FILE struct is only enabled on platforms that it works on, but before this patch the tests were included unconditionally. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D122363
-
Louis Dionne authored
I audited all uses of _LIBCPP_ASSERT to make sure that we only used it for "basic assertions", i.e. assertions with constant-time conditions. I also audited all uses of _LIBCPP_DEBUG_ASSERT to make sure we used it only for debug-mode assertions, and in one case had to change for _LIBCPP_ASSERT instead. As a fly-by, I also changed a couple of tests against nullptr or 0 to be more explicit. After this patch, all uses of _LIBCPP_ASSERT should be with constant-time conditions, and all uses of _LIBCPP_DEBUG_ASSERT should be with conditions that we only want to check when the debug mode is enabled. Differential Revision: https://reviews.llvm.org/D122395
-
Shafik Yaghmour authored
[LLDB] Cleanup for Fixing DWARFExpression handling of ValueType::FileAddress case for DW_OP_deref_size Late review on https://reviews.llvm.org/D121408 spotted some nice quick clean-ups on this code.
-
Arthur Eubanks authored
As mentioned in D120782, the loop block order can be different depending on if LoopInfo is incrementally updated or freshly computed. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D122195
-
Dmitry Vyukov authored
Currently, we only print how threads involved in data race are created from their parent threads. Add a runtime flag 'print_full_thread_history' to print thread creation stacks for the threads involved in the data race and their ancestors up to the main thread. Reviewed By: dvyukov Differential Revision: https://reviews.llvm.org/D122131
-
Mike Rice authored
Adds basic parsing/sema/serialization support for the #pragma omp target parallel loop directive. Differential Revision: https://reviews.llvm.org/D122359
-
Momchil Velikov authored
Re-commit of 32e8b550 This patch rearranges emission of CFI instructions, so the resulting DWARF and `.eh_frame` information is precise at every instruction. The current state is that the unwind info is emitted only after the function prologue. This is fine for synchronous (e.g. C++) exceptions, but the information is generally incorrect when the program counter is at an instruction in the prologue or the epilogue, for example: ``` stp x29, x30, [sp, #-16]! // 16-byte Folded Spill mov x29, sp .cfi_def_cfa w29, 16 ... ``` after the `stp` is executed the (initial) rule for the CFA still says the CFA is in the `sp`, even though it's already offset by 16 bytes A correct unwind info could look like: ``` stp x29, x30, [sp, #-16]! // 16-byte Folded Spill .cfi_def_cfa_offset 16 mov x29, sp .cfi_def_cfa w29, 16 ... ``` Having this information precise up to an instruction is useful for sampling profilers that would like to get a stack backtrace. The end goal (towards this patch is just a step) is to have fully working `-fasynchronous-unwind-tables`. Reviewed By: danielkiss, MaskRay Differential Revision: https://reviews.llvm.org/D111411
-
Valentin Clement authored
This patch adds lowering for the `!$acc wait` directive from the PFT to OpenACC dialect. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D122399
-
- Mar 24, 2022
-
-
Craig Topper authored
The alignment needs to be part of the folding set hash. This is handled by getAssertAlign when nodes are created, but needs to repeated here. No test case as I found it as part of a very early experimental patch. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D122279
-
Alexander Belyaev authored
Asan complained about uninitialized bool `invalid-bool-load` llvm/lib/Target/X86/AsmParser/X86Operand.h:389:12: runtime error: load of value 171, which is not a valid value for type 'bool' Differential Revision: https://reviews.llvm.org/D122405
-
Aaron Ballman authored
This matches the behavior of cl.exe.
-
Jakob Koschel authored
This is the orignal patch + a check that LLVM_BUILD_EXAMPLES is enabled before adding a dependency on the 'Bye' example pass. Original summary: Add cli options for new passmanager plugin support to lld. Currently it is not possible to load dynamic NewPM plugins with lld. This is an incremental update to D76866. While that patch only added cli options for llvm-lto2, this adds them for lld as well. This is especially useful for running dynamic plugins on the linux kernel with LTO. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D120490
-
Florian Hahn authored
This patch moves pointer induction handling from VPWidenPHIRecipe to its own recipe. In the process, it adds all information required to generate code for pointer inductions without relying on Legal to access the list of induction phis. Alternatively VPWidenPHIRecipe could also take an optional pointer to InductionDescriptor. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D121615
-
Aaron Ballman authored
Naked functions have no prolog, so it's not valid to emit prolog code to evaluate the variably-modified type. This fixes Issue 50541.
-
Valentin Clement authored
This patch adds lowering for the `!$acc data` from the PFT to OpenACC dialect. This patch is part of the upstreaming effort from fir-dev branch. Depends on D122384 Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D122398
-
Sanjay Patel authored
Add the "(0 - X) --> (X * -1)" reverse identity to the list of alternate form binops. We need a little hack to make the existing logic work because it does not expect to move constants from op0 to op1, but the code comment hopefully makes that clear. I don't think there are any other identities like that. Fixes #54364 Differential Revision: https://reviews.llvm.org/D122390
-
Louis Dionne authored
Add missing commit SHAs in a few cases, and use "All Platforms" when the change affected all platforms instead of explicitly listing the platforms, since there's too many to list exhaustively.
-
Kadir Cetinkaya authored
Clangd ignores fixits if the diagnsotics is outside the main file (e.g. a note on a declaration from a header), but the fix might still be inside the main file (e.g. change the function call). This patch changes the logic to retain fixes that touch main file, if the diagnostic owning them is still inside main file, even if they are attached to a note outside the main file. Differential Revision: https://reviews.llvm.org/D122315
-
Valentin Clement authored
This patch adds lowering for the `!$acc update` from the PFT to OpenACC dialect. This patch is part of the upstreaming effort from fir-dev branch. Depends on D122387 Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D122396
-
Valentin Clement authored
This patch adds lowering for the `!$acc init` and `!$acc shutdown` from the PFT to OpenACC dialect. This patch is part of the upstreaming effort from fir-dev branch. Depends on D122384 Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D122387
-
Valentin Clement authored
This patch adds lowering for the `!$acc exit data` directive from the PFT to OpenACC dialect. This patch is part of the upstreaming effort from fir-dev branch. Depends on D122384 Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D122386
-
Valentin Clement authored
This patch adds lowering for the `!$acc enter data` directive from the PFT to OpenACC dialect. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D122384
-
Valentin Clement authored
This patches adds lowering tests for some array expressions use cases. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D122380 Co-authored-by:
Jean Perier <jperier@nvidia.com> Co-authored-by:
Eric Schweitz <eschweitz@nvidia.com>
-
Louis Dionne authored
-
Sven van Haastregt authored
This simplifies completeness comparisons against OpenCLBuiltins.td and also makes the header no longer "claim" the identifiers "x", "y" and "z". Continues the direction set out in D119560.
-
LLVM GN Syncbot authored
-
Haojian Wu authored
Parse forest is the output of the GLR parser, it is a tree-like DAG which presents all possible parse trees without duplicating subparse structures. This is a patch split from https://reviews.llvm.org/D121150. Differential Revision: https://reviews.llvm.org/D122139
-
Djordje Todorovic authored
After removing the MaybePromotable, some comments became stale. This improves them. Differential Revision: https://reviews.llvm.org/D122319
-
Haojian Wu authored
Reductions need to be performed in a careful order in GLR parser, to make sure we gather all alternatives before creating an ambigous forest node. This patch encodes the nonterminal order into the rule id, so that we can efficiently to determinal ordering of reductions in GLR parser. This patch also abstracts to a TestGrammar, which is shared among tests. This is a part of the GLR parser, https://reviews.llvm.org/D121368, https://reviews.llvm.org/D121150 Differential Revision: https://reviews.llvm.org/D122303
-
Qiu Chaofan authored
This change formats PowerPC intrinsics wrapper headers into LLVM style, and add extra prefix '__' to all variables to prevent conflict with user code.
-
Nico Weber authored
-
Alexey Bataev authored
-
Nathan Sidwell authored
Reformat some misindentation that is coincidentally close to a piece being worked on. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D122314
-
Aaron Ballman authored
-
Martin Storsjö authored
There's no other references to this variable name, but other files refer to the same with the same name but with double Fs.
-
Jakob Johnson authored
Update the response schema of the TraceGetState packet and add Intel PT specific response structure that contains the TSC conversion, if it exists. The IntelPTCollector loads the TSC conversion and caches it to prevent unnecessary calls to perf_event_open. Move the TSC conversion calculation from Perf.h to TraceIntelPTGDBRemotePackets.h to remove dependency on Linux specific headers. Differential Revision: https://reviews.llvm.org/D122246
-
LLVM GN Syncbot authored
-