Commits · 0e0b0feff194358e5e68bf36f5a563d269fa8e88 · Panda / LLVM project

This project is mirrored from https://github.com/llvm/llvm-project.git. Pull mirroring failed 2 years ago.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer or owner.
Last successful update 2 years ago.

Apr 08, 2022

[clang-tidy] Make performance-inefficient-vector-operation work on members · 0e0b0fef

Nathan James authored 2 years ago

Fixes https://llvm.org/PR50157

Adds support for when the container being read from in a range-for is a member of a struct.

Reviewed By: aaron.ballman

Differential Revision: https://reviews.llvm.org/D101624

Unverified

0e0b0fef

[mlir][Linalg] Add pooling_nchw_sum op. · b20719dc

Vivek Khandelwal authored 2 years ago

This commit adds pooling_nchw_sum as a yaml op.

Reviewed By: cathyzhyi, gysit

Differential Revision: https://reviews.llvm.org/D123013

b20719dc

[flang][NFC] rename isAbsent to isStaticallyAbsent in IntrinsicCall.cpp · f1cfa461

Jean Perier authored 2 years ago

isAbsent/isPresent helpers only give information about static presence
of intrinsic arguments. Many intrinsic arguments optionality is dynamic
(an absent dummy can legally be passed to these intrinsics). This
requires a different handling (like `handleDynamicOptional`).

Rename the helpers to avoid misleading coder/reader into thinking all
optionality cases are covered by them.

Differential Revision: https://reviews.llvm.org/D123378

f1cfa461

[VP] Explicitly map from VP intrinsic to ISD opcode · 18106b99

Fraser Cormack authored 2 years ago

This patch aims to overcome an issue in these mappings where, when an ISD
node was registered with BEGIN_REGISTER_VP_SDNODE but outwidth the scope
of a pair of BEGIN_REGISTER_VP_INTRINSIC/END_REGISTER_VP_INTRINSIC
macros, the switch cases fell apart. This in particular happened with
VP_SETCC, where we'd end up with something along the lines of:

  case Intrinsic::vp_fcmp:
    break;
  case Intrinsic::vp_icmp:
    break;
    ResOpc = ISD::VP_SETCC;
  case Intrinsic::vp_store:
    ...

To remedy this, we introduce a special-purpose mapping macro which can
map any number of VP intrinsic opcodes to an ISD opcode.

As a result, we no longer need to special-case the mapping from vp.icmp
and vp.fcmp to VP_SETCC, as the new helper macro does it for us.

Thanks to @craig.topper for noticing this and to @rogfer01 for the idea.

Reviewed By: rogfer01

Differential Revision: https://reviews.llvm.org/D123324

18106b99

[gn build] Port 08920cc0 · c8084fd9
LLVM GN Syncbot authored 2 years ago

c8084fd9

[AArch64] Remove always true Perfect cost check. NFC · a93607c4

David Green authored 2 years ago

Perfect shuffle costs are always encoded less than 4, and shouldn't
really have a cost more than 3, so it makes no sense to check it when
generating shuffles. The perfect shuffle is likely always better than a
tbl too (although that may depend on whether it is in a loop).

a93607c4

Fix Sphinx build · 33ab88ef
Aaron Ballman authored 2 years ago

33ab88ef
[OpenCL] Add generic addrspace guards for get_fence · 1331ad22
Sven van Haastregt authored 2 years ago
```
Align guards of these builtins with opencl-c.h.
```
1331ad22
[gn build] (manually) port bf2dc4b3 · 26b3a1ea
Nico Weber authored 2 years ago

26b3a1ea
[AMDGPU] Use GCNPat in the buffer atomic pattern multiclasses · b536f24d
Abinav Puthan Purayil authored 2 years ago

b536f24d

Disambiguate conversion cast for GCC · 932f27dc

Benjamin Kramer authored 2 years ago

GCC 9 has problems with this.

mlir/include/mlir/IR/OperationSupport.h: In member function ‘mlir::Value mlir::MutableOperandRange::operator[](unsigned int) const’:
mlir/include/mlir/IR/OperationSupport.h:912:43: error: call of overloaded ‘OperandRange(const mlir::MutableOperandRange&)’ is ambiguous
  912 |     return static_cast<OperandRange>(*this)[index];
      |
mlir/include/mlir/IR/OperationSupport.h:789:21: note: candidate: mlir::OperandRange::OperandRange(const llvm::iterator_range<llvm::detail::indexed_accessor_
range_base<mlir::OperandRange, mlir::OpOperand*, mlir::Value, mlir::Value, mlir::Value>::iterator>&)
   using RangeBaseT::RangeBaseT;
                     ^~~~~~~~~~
mlir/include/mlir/IR/OperationSupport.h:786:7: note: candidate: constexpr mlir::OperandRange::OperandRange(const mlir::OperandRange&)
 class OperandRange final : public llvm::detail::indexed_accessor_range_base<
       ^~~~~~~~~~~~
mlir/include/mlir/IR/OperationSupport.h:786:7: note: candidate: constexpr mlir::OperandRange::OperandRange(mlir::OperandRange&&)

932f27dc

[AMDGPU] Increase detection range for s_mov, v_cmpx transformation. · 6d97ca69

Thomas Symalla authored 2 years ago

We found that it might be beneficial to have the SIOptimizeExecMasking
pass detect more cases where v_cmp, s_and_saveexec patterns can be
transformed to s_mov, v_cmpx patterns. Currently, the search range
for finding a fitting v_cmp instruction is 5, however, this is doubled
to 10 here.

Reviewed By: foad

Differential Revision: https://reviews.llvm.org/D123367

6d97ca69

[libc++] Add __is_callable type trait and begin granularizing type_traits · 08920cc0

Nikolas Klauser authored 2 years ago

`__is_callable` is required to ensure that the classic algorithms are only called with functions or functors. I also begin to granularize `<type_traits>`.

Reviewed By: ldionne, #libc

Spies: libcxx-commits, mgorny

Differential Revision: https://reviews.llvm.org/D123114

08920cc0

[libc++] Add tests for std::string default constructor and destructor · 628fcfd5

Nikolas Klauser authored 2 years ago

Reviewed By: ldionne, var-const, #libc, nilayvaish

Spies: nilayvaish, libcxx-commits

Differential Revision: https://reviews.llvm.org/D123129

628fcfd5

compiler-rt/lib/builtins/udivmodei5.c: Fix missing macro argument · 492c5c05
Matthias Gehre authored 2 years ago

492c5c05
[InstCombine] Add various other modulo-by-constant tests for Issue #22303 · 5b45c0b6
Simon Pilgrim authored 2 years ago

5b45c0b6

[mlir][tensor] Fix verifier and bufferization of collapse_shape · d7a9bf91

Matthias Springer authored 2 years ago

Insert a buffer copy unless the dims are guaranteed to be collapsible. In the verifier, accept collapses unless they are guaranteed to be non-collapsible.

Differential Revision: https://reviews.llvm.org/D123316

d7a9bf91

[mlir][bufferize] Do not insert useless casts for newly allocated buffers · d2608adf
Matthias Springer authored 2 years ago
```
Differential Revision: https://reviews.llvm.org/D123369
```
d2608adf

[mlir][arith][bufferize] Fix tensors with different layouts after bufferization · 8b091419

Matthias Springer authored 2 years ago

Insert a cast if the two tensors with identical layout (that are passed to `arith.select`) have different layout maps after bufferization.

Differential Revision: https://reviews.llvm.org/D123321

8b091419

[X86] Fix SLM scheduler model for PMULLD (PR37059) · 5626bd42

Simon Pilgrim authored 2 years ago

Adjust the PMULLD entry to match the Intel AoM numbers - PMULLD is a uop nightmare on SLM and we should model it as such.

We had reports of internal regressions the last time this was attempted (rG13a0f83a05ff), but no public repros, and tests I did last year when I had access to a SLM box failed to see anything. My hunch is that the more aggressive PMULLD -> PMADDWD folds we now perform might have helped. We can revisit this again if we ever receive an actual repro.

Fixes #36407

5626bd42

[spirv] Make header self-contained. NFC. · 656f0b82
Benjamin Kramer authored 2 years ago

656f0b82
[X86] Add additional test for PR54369 (NFC) · 8ae33cb3
Nikita Popov authored 2 years ago
```
From this comment: https://reviews.llvm.org/D123014#3436522
```
8ae33cb3

[gold] Remove support for legacy pass manager · 6ec8c6fc

Nikita Popov authored 2 years ago

This removes support for performing LTO using the legacy pass
manager in LLVMgold.so. Explicitly enabling the new pass manager
is retained as a no-op.

Differential Revision: https://reviews.llvm.org/D123294

6ec8c6fc

Revert "Reland "[RISCV][NFC] Moving RVV intrinsic type related util to llvm/Support"" · f922dbb7
Kito Cheng authored 2 years ago
```
This reverts commit fc2d8326.
```
f922dbb7

[analyzer] Don't track function calls as control dependencies · fd8e5762

Kristóf Umann authored 3 years ago

I recently evaluated ~150 of bug reports on open source projects relating to my
GSoC'19 project, which was about tracking control dependencies that were
relevant to a bug report.

Here is what I found: when the condition is a function call, the extra notes
were almost always unimportant, and often times intrusive:

void f(int *x) {
  x = nullptr;
  if (alwaysTrue()) // We don't need a whole lot of explanation
                    // here, the function name is good enough.
    *x = 5;
}
It almost always boiled down to a few "Returning null pointer, which participates
in a condition later", or similar notes. I struggled to find a single case
where the notes revealed anything interesting or some previously hidden
correlation, which is kind of the point of condition tracking.

This patch checks whether the condition is a function call, and if so, bails
out.

The argument against the patch is the popular feedback we hear from some of our
users, namely that they can never have too much information. I was specifically
fishing for examples that display best that my contribution did more good than
harm, so admittedly I set the bar high, and one can argue that there can be
non-trivial trickery inside functions, and function names may not be that
descriptive.

My argument for the patch is all those reports that got longer without any
notable improvement in the report intelligibility. I think the few exceptional
cases where this patch would remove notable information are an acceptable
sacrifice in favor of more reports being leaner.

Differential Revision: https://reviews.llvm.org/D116597

fd8e5762

[MemoryBuiltins] Remove unnecessary lambda capture (NFC) · 4e85b427
Nikita Popov authored 2 years ago

4e85b427
[SafeStack] Move test to X86 directory · f38d9388
Nikita Popov authored 2 years ago
```
This test requires the X86 target to be available.
```
f38d9388
[LICM] Pass MemorySSAUpdater by referene (NFC) · c8c63625
Nikita Popov authored 2 years ago
```
Make it clearer that this is a required dependency.
```
c8c63625

[C++20][Modules] Adjust handling of exports of namespaces and using-decls. · f60dc3ca

Iain Sandoe authored 3 years ago

This adjusts the handling for:

export module  M;

export namespace {};

export namespace N {};
export using namespace N;

In the first case, we were allowing empty anonymous namespaces
as part of an extension allowing empty top-level entities, but that seems
inappropriate in this case, since the linkage would be internal for the
anonymous namespace.  We now report an error for this.

The second case was producing a warning diagnostic that this was
accepted as an extension - however the C++20 standard does allow this
as well-formed.

In the third case we keep the current practice that this is accepted with a
warning (as an extension). The C++20 standard says it's an error.

We also ensure that using decls are only applied to items with external linkage.

This adjusts error messages for exports involving redeclarations in modules to
be more specific about the reason that the decl has been rejected.

Differential Revision: https://reviews.llvm.org/D122119

f60dc3ca

[mlir][Vector] Fold extractelement splat. · e79b7f50

jacquesguan authored 2 years ago

This revision supports to fold vector.extractelement (splat X) -> X.

Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D122960

e79b7f50

[LoopSink] Require MemorySSA · 5cefe7d9

Nikita Popov authored 2 years ago

This makes MemorySSA in LoopSink required, and removes the AST-based
implementation, as well as the related support code in LICM.

Differential Revision: https://reviews.llvm.org/D123288

5cefe7d9

[SafeStack] Don't create SCEV min between pointer and integer (PR54784) · a5a272a4

Nikita Popov authored 2 years ago

Rather than rewriting the alloca pointer to zero, use
removePointerBase() to drop the base pointer. This will simply bail
if the base pointer is not the alloca. We could try doing something
more fancy here (like dropping the sources not based on the alloca
on the premise that they aren't SafeStack-relevant), but I don't
think that's worthwhile.

Fixes https://github.com/llvm/llvm-project/issues/54784.

Differential Revision: https://reviews.llvm.org/D123309

a5a272a4

[mlir][Arithmetic] Add constant folder for negf. · 088d3889
jacquesguan authored 2 years ago
```
Reviewed By: ftynse

Differential Revision: https://reviews.llvm.org/D123293
```
088d3889

[Clang][Fortify] drop inline decls when redeclared · 301e0d91

serge-sans-paille authored 2 years ago

When an inline builtin declaration is shadowed by an actual declaration, we must
reference the actual declaration, even if it's not the last, following GCC
behavior.

This fixes #54715

Differential Revision: https://reviews.llvm.org/D123308

301e0d91

[builtin_object_size] Basic support for posix_memalign · aa15ea47

serge-sans-paille authored 3 years ago

It actually implements support for seeing through loads, using alias analysis to
refine the result.

This is rather limited, but I didn't want to rely on more than available
analysis at that point (to be gentle with compilation time), and it does seem to
catch common scenario, as showcased by the included tests.

Differential Revision: https://reviews.llvm.org/D122431

aa15ea47

[clang][deps] Ensure deterministic filename case · b672638d

Jan Svoboda authored 2 years ago

The dependency scanner can reuse single FileManager instance across multiple translation units. This may lead to non-deterministic output depending on which TU gets processed first.

One of the problems is that Clang uses DirectoryEntry::getName in the header search algorithm. This function returns the path that was first used to construct the (shared) entry in FileManager. Using DirectoryEntryRef::getName instead preserves the case as it was spelled out for the current "get directory entry" request.

rdar://90647508

Reviewed By: dexonsmith

Differential Revision: https://reviews.llvm.org/D123229

b672638d

Reland "[RISCV][NFC] Moving RVV intrinsic type related util to llvm/Support" · fc2d8326

Kito Cheng authored 2 years ago

Reland Note: We've resolve the circular dependency issue on llvm/lib/Support and
llvm/TableGen.

Differential Revision: https://reviews.llvm.org/D121984

fc2d8326

Bump minimum toolchain version · 4c72deb6

Tobias Hieta authored 2 years ago

RFC: https://discourse.llvm.org/t/rfc-increasing-the-gcc-and-clang-requirements-to-support-c-17-in-llvm

Following the policy here: https://llvm.org/docs/DeveloperPolicy.html#toolchain

This forum post here will be updated with the timeline and status: https://discourse.llvm.org/t/important-new-toolchain-requirements-to-build-llvm-will-most-likely-be-landing-within-a-week-prepare-your-buildbots/61447

Reviewed By: mehdi_amini, jyknight, jhenderson, cor3ntin, MaskRay

Differential Revision: https://reviews.llvm.org/D122976

4c72deb6

Introduce branchless sorting functions for sort3, sort4 and sort5. · 194d1965

Marco Gelmi authored 2 years ago

We are introducing branchless variants for sort3, sort4 and sort5.
These sorting functions have been generated using Reinforcement
Learning and aim to replace __sort3, __sort4 and __sort5 variants
for integral types.

The libc++ benchmarks were run on isolated machines for Skylake, ARM and
AMD architectures and achieve statistically significant improvement in
sorting random integers on test cases from sort1 to sort262144 for
uint32 and uint64.

A full performance overview for Intel Skylake, AMD and Arm can be
found here: https://bit.ly/3AtesYf

Reviewed By: ldionne, #libc, philnik

Spies: daniel.mankowitz, mgrang, Quuxplusone, andreamichi, philnik, libcxx-commits, nilayvaish, kristof.beyls

Differential Revision: https://reviews.llvm.org/D118029

194d1965

compiler-rt: Add udivmodei5 to builtins and add bitint library · bf2dc4b3

Matthias Gehre authored 3 years ago

According to the RFC [0], this review contains the compiler-rt parts of large integer divison for _BitInt.

It adds the functions
```
/// Computes the unsigned division of a / b for two large integers
/// composed of n significant words.
/// Writes the quotient to quo and the remainder to rem.
///
/// \param quo The quotient represented by n words. Must be non-null.
/// \param rem The remainder represented by n words. Must be non-null.
/// \param a The dividend represented by n + 1 words. Must be non-null.
/// \param b The divisor represented by n words. Must be non-null.

/// \note The word order is in host endianness.
/// \note Might modify a and b.
/// \note The storage of 'a' needs to hold n + 1 elements because some
///       implementations need extra scratch space in the most significant word.
///       The value of that word is ignored.
COMPILER_RT_ABI void __udivmodei5(su_int *quo, su_int *rem, su_int *a,
                                  su_int *b, unsigned int n);

/// Computes the signed division of a / b.
/// See __udivmodei5 for details.
COMPILER_RT_ABI void __divmodei5(su_int *quo, su_int *rem, su_int *a, su_int *b,
                                 unsigned int words);
```
into builtins.
In addition it introduces a new "bitint" library containing only those new functions,
which is meant as a way to provide those when using libgcc as runtime.

[0] https://discourse.llvm.org/t/rfc-add-support-for-division-of-large-bitint-builtins-selectiondag-globalisel-clang/60329

Differential Revision: https://reviews.llvm.org/D120327

bf2dc4b3