Commits · 3077bc90de8df633d59fe30c2a2aa265d68fb987 · Panda / LLVM project

This project is mirrored from https://github.com/llvm/llvm-project.git. Pull mirroring failed 3 years ago.
Repository mirroring has been paused due to too many failed attempts. It can be resumed by a project maintainer or owner.
Last successful update 3 years ago.

Oct 01, 2021

[NFC] Restore magic and magicu to a globally visible location · 3077bc90

Christopher Tetreault authored 3 years ago

While these functions are only used in one location in upstream,
it has been reused in multiple downstreams. Restore this file to
a globally visibile location (outside of APInt.h) to eliminate
donwstream breakage and enable potential future reuse.

Additionally, this patch renames types and cleans up
clang-tidy issues.

3077bc90

add tsan shared library · 91bfccf8
ZijunZhao authored 3 years ago

91bfccf8
[NFC][sanitizer] Add const into method · 5c3568d0
Vitaly Buka authored 3 years ago

5c3568d0

BPF: implement isLegalAddressingMode() properly · 3562ad3e

Yonghong Song authored 3 years ago

Latest upstream llvm caused the kernel bpf selftest emitting the
following warnings:

  In file included from progs/profiler3.c:6:
  progs/profiler.inc.h:489:2: warning: loop not unrolled:
    the optimizer was unable to perform the requested transformation;
    the transformation might be disabled or specified as part of an unsupported
    transformation ordering [-Wpass-failed=transform-warning]
          for (int i = 0; i < MAX_PATH_DEPTH; i++) {
          ^

Further bisecting shows this SimplifyCFG patch ([1]) changed
the condition on how to fold branch to common dest. This caused
some unroll pragma is not honored in selftests/bpf.

The patch [1] test getUserCost() as the condition to
perform the certain basic block folding transformation.
For the above example, before the loop unroll pass, the control flow
looks like:
    cond_block:
       branch target: body_block, cleanup_block
    body_block:
       branch target: cleanup_block, end_block
    end_block:
       branch target: cleanup_block, end10_block
    end10_block:
       %add.ptr = getelementptr i8, i8* %payload.addr.0, i64 %call2
       %inc = add nuw nsw i32 %i.0, 1
       branch target: cond_block

In the above, %call2 is an unknown scalar.

Before patch [1], end10_block will be folded into end_block, forming
the code like
    cond_block:
       branch target: body_block, cleanup_block
    body_block:
       branch target: cleanup_block, end_block
    end_block:
       branch target: cleanup_block, cond_block
and the compiler is happy to perform unrolling.

With patch [1], getUserCost(), which calls getGEPCost(), which calls
isLegalAddressingMode() in TargetLoweringBase.cpp, considers IR
  %add.ptr = getelementptr i8, i8* %payload.addr.0, i64 %call2
is free, so the above basic block folding transformation is not performed
and unrolling does not happen.

For BPF target, the IR
  %add.ptr = getelementptr i8, i8* %payload.addr.0, i64 %call2
is not free and we don't have ld/st instruction address with 'r+r' mode.

This patch implemented a BPF hook for isLegalAddressingMode(), which is
identical to Mips isLegalAddressingMode() implementation where
the address pattern like 'r+r', 'r+r+i' or '2*r' are not allowed.
With testing kernel bpf selftests, all loop not unrolled warnings
are gone and all selftests run successfully.

  [1] https://reviews.llvm.org/D108837

Differential Revision: https://reviews.llvm.org/D110789

3562ad3e

[test] Add tests covering a missing opt in SCEV's isSCEVExprNeverPoison · bdb5aa65
Philip Reames authored 3 years ago

bdb5aa65

[libcxx][test] Use python specified by build rather than system default python · 9f641c96

Leonard Chan authored 3 years ago

As of e9564c36, libcxx/gdb/gdb_pretty_printer_test.sh.cpp
fails locally for me because the REQUIRES check for host-has-gdb-with-python
uses python, which for me expands to python 2.7.18. This failure does not seem
to be caught on any upstream builders, potentially because they don't have gdb,
python, or a version of python that makes the test UNSUPPORTED (like python3).

This updates the check to use the python specified by the build (which should
be the python that runs this code), rather than just python.

Differential Revision: https://reviews.llvm.org/D110887

9f641c96

[SCEV] Modernize code style of isSCEVExprNeverPoison [NFC] · c5e491e6
Philip Reames authored 3 years ago
```
Use for-range and all_of to make code easier to read in advance of other changes.
```
c5e491e6

[MemProf] Record accesses for all words touched in mem intrinsic · 0d8bdc17

Teresa Johnson authored 3 years ago

Previously for mem* intrinsics we only incremented the access count for
the first word in the range. However, after thinking it through I think
it makes more sense to record an access for every word in the range.
This better matches the behavior of inlined memory intrinsics, and also
allows better analysis of utilization at a future date.

Differential Revision: https://reviews.llvm.org/D110799

0d8bdc17

[MC] Fix buildbots with shared lib builds · c82f98ba

Rafael Auler authored 3 years ago

In D109412 I forgot to add a dependency on libObject. Fix that.

Reviewed By: maksfb

Differential Revision: https://reviews.llvm.org/D110886

c82f98ba

[GlobalISel] Extend CombinerHelper::matchConstantOp() to match constant splat vectors. · ca8316b7
Amara Emerson authored 3 years ago
```
This allows the "x op 0 -> x" fold to optimize vector constant RHSs.

Differential Revision: https://reviews.llvm.org/D110802
```
ca8316b7

[flang][NFC] Add debug dump method to evaluate::Expr and semantics::Symbol · fdcbb540

Jean Perier authored 3 years ago


Helps debugging when working with symbol/expression issue. The dump
method is easy to call in the debugger.

Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>

Differential Revision: https://reviews.llvm.org/D110856

fdcbb540

[RISCV] Remove Zbproposedc extension · a21c5579

Craig Topper authored 3 years ago

This consists of 3 compressed instructions, c.not, c.neg, and c.zext.w.
I believe these have been picked up by the Zce effort using different
encodings. I don't think it makes sense to keep them in bitmanip. It
will eventually cause a conflict if/when Zce is implemented in llvm.

Differential Revision: https://reviews.llvm.org/D110871

a21c5579

[flang] Take into account SubprogramDetails in GetInterfaceSymbol · 962e503c

Jean Perier authored 3 years ago

When the ProcRef is Symbol is a SubprogramDetails, the interface is
the SubprogramDetails. Do not return nullptr.

Differential Revision: https://reviews.llvm.org/D110853

962e503c

[openmp][docs] Describe how the internal components are found · 72e8a4c4

Jon Chesterfield authored 3 years ago

Add a FAQ entry about the names of openmp offloading components
and how they are searched for.

Reviewed By: jhuber6

Differential Revision: https://reviews.llvm.org/D109619

72e8a4c4

[flang][NFC] Fix header comments in some runtime headers · cf1f5fbd
Jean Perier authored 3 years ago
```
Differential Revision: https://reviews.llvm.org/D110850
```
cf1f5fbd
[CMake] Remove the LLD LTO check for Darwin · 0c4a75f1
Petr Hosek authored 3 years ago
```
LLD now supports LTO on Darwin.

Differential Revision: https://reviews.llvm.org/D110881
```
0c4a75f1

[compiler-rt] Add -fno-omit-frame-pointer check to builtins · 72e7e15a

Gwen Mittertreiner authored 3 years ago

rG210d72e9d6b4a8e7633921d0bd7186fd3c7a2c8c moved the check from
builtin-config-ix to config-ix so that the check would be made even when
the builtins are not built. However, now the check is no longer made
when the builtins are built standalone which causes the builtins to fail
to build.

Add the check back to builtins-config-ix so that the check gets
performed both when the builtins are not built, and when they are built
standalone.

Reviewed By: smeenai

Differential Revision: https://reviews.llvm.org/D110879

72e7e15a

[openmp] Add addrspacecast to getOrCreateIdent · 32473291

Jon Chesterfield authored 3 years ago

Fixes 51982. Adds a missing CreatePointerCast and allocates a global in
the correct address space.

Test case derived from https://github.com/ROCm-Developer-Tools/aomp/\
blob/aomp-dev/test/smoke/nest_call_par2/nest_call_par2.c by deleting
parts while checking the assertion failure still occurred.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D110556

32473291

[libomptarget] Apply D110029 to amdgpu · b75a7481

Jon Chesterfield authored 3 years ago

Use enum for execution mode.

This is partly a port from ROCm and partly a port from D110029. Attempted to
make the same choices as ROCm as far as comments etc go to reduce the merge
conflicts.

There is some cleanup warranted here - in particular I like the cuda patch
factoring out the comparisons into named variables - but I'd like to leave
that for a follow up patch, keeping this one minimal.

Reviewed By: carlo.bertolli

Differential Revision: https://reviews.llvm.org/D110845

b75a7481

[cora async] Cleanup undefined llvm.coro.async.resume · 2df2b27d

Arnold Schwaighofer authored 3 years ago

In situations where the coroutine function is not split we can just
replace the async.resume by null.

rdar://82591919

Differential Revision: https://reviews.llvm.org/D110191

2df2b27d

[mlir][Linalg] Refactor comprehensive bufferize for external uses - NFC · b016bd12

Nicolas Vasilache authored 3 years ago

This revision exposes some minimal funcitonality to allow comprehensive
bufferization to interop with external projects.

Differential Revision: https://reviews.llvm.org/D110875

b016bd12

[AIX] Rename binder option for PGO support · 2443320d
Jinsong Ji authored 3 years ago
```
Update the binder option.
```
2443320d

Revert "Recommit "[SCEV] Look through single value PHIs." (take 2)" · 1fbdbb55

Florian Hahn authored 3 years ago

This reverts commit 764d9aa9.

This patch exposed a few additional cases where SCEV expressions are not
properly invalidated.

See PR52024, PR52023.

Unverified

1fbdbb55

[DFSan] Optimize code for writing to shadow. Move SetShadow to namespace. · d81723c9

Andrew Browne authored 3 years ago

Writing zeros to shadow (including checking for existing zero) is now ~2x
faster on one example.

Reviewed By: morehouse

Differential Revision: https://reviews.llvm.org/D110733

d81723c9

[gn build] Port 050edef8 · 0337e228
LLVM GN Syncbot authored 3 years ago

0337e228

[MC] Make MCDwarfLineStr class public · 050edef8

Maksim Panchenko authored 3 years ago

Add MCDwarfLineStr class to the public API.

Note that MCDwarfLineTableHeader::Emit(), takes MCDwarfLineStr as
an Optional<> parameter making it impossible to use the API if the class
is not publicly defined.

Reviewed By: alexander-shaposhnikov

Differential Revision: https://reviews.llvm.org/D109412

050edef8

[PowerPC] Improved codegen related to xscvdpsxws/xscvdpuxws · 4195ed99

Albion Fung authored 3 years ago

This patch removes the uneccessary mf/mtvsr generated in conjunction
with xscvdpsxws/xscvdpuxws.

Differential revision: https://reviews.llvm.org/D109902

4195ed99

[GlobalISel] Extend G_SELECT of known condition combine to vectors. · 80f4bb5c

Amara Emerson authored 3 years ago

Adds a new utility function: isConstantOrConstantSplatVector().

Differential Revision: https://reviews.llvm.org/D110786

80f4bb5c

[flang] Fold FINDLOC() · 82568675

Peter Klausler authored 3 years ago

Fold the transformational intrinsic function FINDLOC() for
all combinations of optional arguments and data types.

Differential Revision: https://reviews.llvm.org/D110757

82568675

[InstCombine] restrict shift-trunc-shift fold to opposite direction shifts · 3fcb00df

Sanjay Patel authored 3 years ago

This is NFCI because the pattern with 2 left-shifts should get
folded independently by smaller folds.

The motivation is to refine this block to avoid infinite loops
seen with D110170.

3fcb00df

[InstCombine] add tests for shift-trunc-shift; NFC · 66c069d7
Sanjay Patel authored 3 years ago

66c069d7
Reland "[clang-cl] Accept `#pragma warning(disable : N)` for some N" · e31899c7
Nico Weber authored 3 years ago
```
This reverts commit 0cd9d8a4 and
adds the changes described in https://reviews.llvm.org/D110668#3034461.
```
e31899c7

[BasicAA] Move more extension logic into ExtendedValue (NFC) · b989211d

Nikita Popov authored 3 years ago

Add methods to appropriately extend KnownBits/ConstantRange there,
same as with APInt. Also clean up the known bits handling by
actually doing that extension rather than checking ZExtBits. This
doesn't matter now, but becomes relevant once truncation is
involved.

b989211d

[mlir][sparse] Correcting a few typos · 21895486
wren romano authored 3 years ago
```
Reviewed By: aartbik

Differential Revision: https://reviews.llvm.org/D110773
```
21895486

[clang] Don't modify OptRemark if the argument is not relevant · 76902079

Arthur Eubanks authored 3 years ago

A followup to D110201.

 For example, we'd set OptimizationRemarkMissed's Regex to '.*' when
encountering -Rpass. Normally this doesn't actually affect remarks we
emit because in clang::ProcessWarningOptions() we'll separately look at
all -R arguments and turn on/off corresponding diagnostic groups.
However, this is reproducible with -round-trip-args.

Reviewed By: JamesNagurne

Differential Revision: https://reviews.llvm.org/D110673

76902079

[flang] Fix test regression from SQRT folding · 691814f9

peter klausler authored 3 years ago

The algorithm used to fold SQRT has some holes that
led to test failures; debug and add more tests.

Differential Revision: https://reviews.llvm.org/D110744

691814f9

[clang] Make crash reproducer work with clang-cl · 8dfbe9b0

Nico Weber authored 3 years ago

When clang crashes, it writes a standalone source file and shell script
to reproduce the crash.

The Driver used to set `Mode = CPPMode` in generateCompilationDiagnostics()
to force preprocessing mode. This has the side effect of making
IsCLMode() return false, which in turn meant Clang::AddClangCLArgs()
didn't get called when creating the standalone source file, which meant
the stand-alone file was preprocessed with the gcc driver's defaults
In particular, exceptions default to on with the gcc driver, but to
off with the cl driver. The .sh script did use the original command
line, so in the reproducer for a clang-cl crash, the standalone source
file could contain exception-using code after preprocessing that the
compiler invocation in the shell script would then complain about.

This patch removes the `Mode = CPPMode;` line and instead additionally
checks for `CCGenDiagnostics` in most places that check `CCCIsCPP().
This also matches the strategy Clang::ConstructJob() uses to add
-frewrite-includes for creating the standalone source file for a crash
report.

Fixes PR52007.

Differential Revision: https://reviews.llvm.org/D110783

8dfbe9b0

[clang] do not emit note for bad conversion when destination type qualifiers... · dbaa4083

Zequan Wu authored 3 years ago

[clang] do not emit note for bad conversion when destination type qualifiers are not compatibly include source type qualifiers

llvm.org/PR52014

Differential Revision: https://reviews.llvm.org/D110780

dbaa4083

[clang] Remove duplication in types::getCompilationPhases() · fa32fd3b

Nico Weber authored 3 years ago

Call Driver::getFinalPhase() instead of duplicating it.

https://reviews.llvm.org/D65993 added the duplication, then
02e35832 maded it more obviously a copy of getFinalPhase().

The only difference is that getCompilationPhases() used to use
LastPhase / IfsMerge where getFinalPhase() used Link. Adapt
getFinalPhase() to return IfsMerge when needed.

No intentional behavior change.

Differential Revision: https://reviews.llvm.org/D110770

fa32fd3b

[libc++abi][NFCI] Consistently group new_handler, unexpected_handler and terminate_handler · 6714e1ce

Louis Dionne authored 3 years ago

Previously, the definitions of __cxa_terminate_handler and __cxa_unexpected_handler
(and their set_xxx_handler functions) were grouped together, but the
definition of __cxa_new_handler wasn't. This commit simply moves those
to the same file to treat all handlers consistently.

6714e1ce