llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-10-10 12:55:00 +00:00

Author	SHA1	Message	Date
Dehao Chen	6e0c8446db	Invoke add-discriminator at -g0 -fsample-profile Summary: -fsample-profile needs discriminator, which will not be added if built with -g0. This patch makes sure the discriminator is added for sample-profile at -g0. A followup patch will be send out to update clang tests. Reviewers: davidxl, dblaikie, echristo, dnovillo Subscribers: mehdi_amini, probinson, llvm-commits Differential Revision: https://reviews.llvm.org/D25132 llvm-svn: 283565	2016-10-07 15:21:31 +00:00
Matthew Simpson	a371c14ffe	[LV] Don't mark multi-use branch conditions uniform Previously, we marked the branch conditions of latch blocks uniform after vectorization if they were instructions contained in the loop. However, if a condition instruction has users other than the branch, it may not remain uniform. This patch ensures the conditions we mark uniform are only used by the branch. This should fix PR30627. Reference: https://llvm.org/bugs/show_bug.cgi?id=30627 llvm-svn: 283563	2016-10-07 15:20:13 +00:00
Krzysztof Parzyszek	e513e17b23	Only track physical registers in LivePhysRegs llvm-svn: 283561	2016-10-07 14:50:49 +00:00
Sam Kolton	a3ec5c10e2	[AMDGPU] Assembler: support v_mac_f32 DPP and SDWA. Move getNamedOperandIdx to AMDGPUBaseInfo.h Reviewers: artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D25084 llvm-svn: 283560	2016-10-07 14:46:06 +00:00
Simon Pilgrim	02f623e74c	[X86][SSE] Tidied up tests - use standard check prefixes llvm-svn: 283559	2016-10-07 14:42:22 +00:00
Konstantin Zhuravlyov	c09e2d7e46	[AMDGPU] AMDGPUCodeGenPrepare: remove extra ';' llvm-svn: 283558	2016-10-07 14:39:53 +00:00
Tom Stellard	17eb3413cd	[ValueTracking] Fix crash in GetPointerBaseWithConstantOffset() Summary: While walking defs of pointer operands we were assuming that the pointer size would remain constant. This is not true, because addresspacecast instructions may cast the pointer to an address space with a different pointer width. This partial reverts r282612, which was a more conservative solution to this problem. Reviewers: reames, sanjoy, apilipenko Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D24772 llvm-svn: 283557	2016-10-07 14:23:29 +00:00
Konstantin Zhuravlyov	f74fc60a7d	[AMDGPU] Promote uniform (i1, i16] operations to i32 Differential Revision: https://reviews.llvm.org/D25302 llvm-svn: 283555	2016-10-07 14:22:58 +00:00
Benjamin Kramer	d47feecc45	Remove spurious non-printable character from source file. NFC. llvm-svn: 283552	2016-10-07 13:46:38 +00:00
Javed Absar	9797989ca7	[ARM]: add missing switch case for cortex-r52 Adds a missing switch case for handling cortex-r52 in init-subtarget-features. llvm-svn: 283551	2016-10-07 13:41:55 +00:00
Martin Storsjo	04864f45b2	[ARM] Reapply: Use __rt_div functions for divrem on Windows Reapplying r283383 after revert in r283442. The additional fix is a getting rid of a stray space in a function name, in the refactoring part of the commit. This avoids falling back to calling out to the GCC rem functions (__moddi3, __umoddi3) when targeting Windows. The __rt_div functions have flipped the two arguments compared to the __aeabi_divmod functions. To match MSVC, we emit a check for division by zero before actually calling the library function (even if the library function itself also might do the same check). Not all calls to __rt_div functions for division are currently merged with calls to the same function with the same parameters for the remainder. This is more wasteful than a div + mls as before, but avoids calls to __moddi3. Differential Revision: https://reviews.llvm.org/D25332 llvm-svn: 283550	2016-10-07 13:28:53 +00:00
Javed Absar	fb4b6e8db9	[ARM]: Add Cortex-R52 target to LLVM This patch adds Cortex-R52, the new ARM real-time processor, to LLVM. Cortex-R52 implements the ARMv8-R architecture. llvm-svn: 283542	2016-10-07 12:06:40 +00:00
Simon Pilgrim	a5d019ee95	[X86][SSE] Update register class during MOVSD/MOVSS - BLENDPD/BLENDPS commutation MOVSD/MOVSS take a 128-bit register and a FR32/FR64 register input, the commutation code wasn't taking this into account leading to verification errors. This patch inserts a vreg copy mi to ensure that the registers are correct. Fix for PR30607 Differential Revision: https://reviews.llvm.org/D25280 llvm-svn: 283539	2016-10-07 11:18:38 +00:00
Alexey Bataev	6ad5da7c81	[SLPVectorizer] Fix for PR25748: reduction vectorization after loop unrolling. The next code is not vectorized by the SLPVectorizer: ``` int test(unsigned int *p) { int sum = 0; for (int i = 0; i < 8; i++) sum += p[i]; return sum; } ``` During optimization this loop is fully unrolled and SLPVectorizer is unable to vectorize it. Patch tries to fix this problem. Differential Revision: https://reviews.llvm.org/D24796 llvm-svn: 283535	2016-10-07 09:39:22 +00:00
Oliver Stannard	4df1cc0b00	[ARM] Don't convert switches to lookup tables of pointers with ROPI/RWPI With the ROPI and RWPI relocation models we can't always have pointers to global data or functions in constant data, so don't try to convert switches into lookup tables if any value in the lookup table would require a relocation. We can still safely emit lookup tables of other values, such as simple constants. Differential Revision: https://reviews.llvm.org/D24462 llvm-svn: 283530	2016-10-07 08:48:24 +00:00
Mehdi Amini	68c6c8cd78	Use StringRef in ARMELFStreamer (NFC) llvm-svn: 283529	2016-10-07 08:48:07 +00:00
Nicolai Haehnle	87bc4c218b	AMDGPU: Fix use-after-free in SIOptimizeExecMasking Summary: There was a bug with sequences like s_mov_b64 s[0:1], exec s_and_b64 s[2:3]<def>, s[0:1], s[2:3]<kill> ... s_mov_b64_term exec, s[2:3] because s[2:3] was defined and used in the same instruction, ending up with SaveExecInst inside OtherUseInsts. Note that the test case also exposes an unrelated bug. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98028 Reviewers: tstellarAMD, arsenm Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D25306 llvm-svn: 283528	2016-10-07 08:40:14 +00:00
Mehdi Amini	a0016ec95f	Use StringReg in TargetParser APIs (NFC) llvm-svn: 283527	2016-10-07 08:37:29 +00:00
Mehdi Amini	9ff8e87ca4	Revert "Revert "Add a static_assert to enforce that parameters to llvm::format() are not totally unsafe"" This reverts commit r283510 and reapply r283509, with updates to clang-tools-extra as well. llvm-svn: 283525	2016-10-07 08:25:42 +00:00
Craig Topper	948625633f	[X86] Fix patterns for VPMULLD and VPCMPEQQ to not require aligned loads. llvm-svn: 283524	2016-10-07 06:54:43 +00:00
Craig Topper	871da8ebea	[X86] Remove unused PatFrags. NFC llvm-svn: 283523	2016-10-07 06:54:39 +00:00
Dylan McKay	e5d89e8001	[AVR] Add the AVRMCInstLower class Summary: This class deals with the lowering of CodeGen `MachineInstr` objects to MC `MCInst` objects. Reviewers: kparzysz, arsenm Subscribers: wdng, beanz, japaric, mgorny Differential Revision: https://reviews.llvm.org/D25269 llvm-svn: 283522	2016-10-07 06:13:09 +00:00
Matt Arsenault	93401f4b5e	AMDGPU: Change check prefix in test llvm-svn: 283521	2016-10-07 03:55:04 +00:00
Hal Finkel	5d41f03215	[llvm-opt-report] Left justify unrolling counts, etc. In the left part of the reports, we have things like U<number>; if some of these numbers use more digits than others, we don't want a space in between the U and the start of the number. Instead, the space should come afterward. This way it is clear that the number goes with the U and not any other optimization indicator that might come later on the line. Tests committed in r283518. llvm-svn: 283519	2016-10-07 02:01:03 +00:00
Hal Finkel	bd5a172d9c	[llvm-opt-report] Left justify unrolling counts, etc. In the left part of the reports, we have things like U<number>; if some of these numbers use more digits than others, we don't want a space in between the U and the start of the number. Instead, the space should come afterward. This way it is clear that the number goes with the U and not any other optimization indicator that might come later on the line. llvm-svn: 283518	2016-10-07 01:57:06 +00:00
David Majnemer	8c03c1bade	[SimplifyCFG] Correctly test for unconditional branches in GetCaseResults GetCaseResults assumed that a terminator with one successor was an unconditional branch. This is not necessarily the case, it could be a cleanupret. Strengthen the check by querying whether or not the terminator is exceptional. llvm-svn: 283517	2016-10-07 01:38:35 +00:00
Hal Finkel	16d29e3111	[llvm-opt-report] Use -no-demangle to disable demangling As this is intended to be a user-facing option, -no-demangle seems much better than -demangle=0. Add testing for the option. llvm-svn: 283516	2016-10-07 01:30:59 +00:00
Peter Collingbourne	2261d78cd2	Target: Remove unused patterns and transforms. NFC. llvm-svn: 283515	2016-10-07 00:30:49 +00:00
Colin LeMahieu	8ed1aee9dd	[Hexagon] NFC Removing 'V4_' prefix from duplex instruction names. llvm-svn: 283514	2016-10-07 00:15:07 +00:00
Michael Kuperstein	5185b7dde3	[LV] Remove triples from target-independent vectorizer tests. NFC. Vectorizer tests in the target-independent directory should not have a target triple. If a test really needs to query a specific backend, it belongs in the right target subdirectory (which "REQUIRES" the right backend). Otherwise, it should not specify a triple. llvm-svn: 283512	2016-10-06 23:57:25 +00:00
Mehdi Amini	292f376934	Revert "Add a static_assert to enforce that parameters to llvm::format() are not totally unsafe" This reverts commit r283509, clang is hitting the assert. llvm-svn: 283510	2016-10-06 23:41:49 +00:00
Mehdi Amini	a7e893f638	Add a static_assert to enforce that parameters to llvm::format() are not totally unsafe Summary: I had for the second time today a bug where llvm::format("%s", Str) was called with Str being a StringRef. The Linux and MacOS bots were fine, but windows having different calling convention, it printed garbage. Instead we can catch this at compile-time: it is never expected to call a C vararg printf-like function with non scalar type I believe. Reviewers: bogner, Bigcheese, dexonsmith Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25266 llvm-svn: 283509	2016-10-06 23:26:29 +00:00
Colin LeMahieu	9675de5ba8	[Hexagon] NFC. Canonicalizing absolute address instruction names. llvm-svn: 283507	2016-10-06 23:02:11 +00:00
Vedant Kumar	7beb423765	Delete some dead code in SelectionDAG (NFC) Differential Revision: https://reviews.llvm.org/D24435 llvm-svn: 283505	2016-10-06 22:53:43 +00:00
Dan Gohman	2726b88c03	[WebAssemby] Implement block signatures. Per spec changes, this implements block signatures, and adds just enough logic to produce correct block signatures at the ends of functions. Differential Revision: https://reviews.llvm.org/D25144 llvm-svn: 283503	2016-10-06 22:29:32 +00:00
Dan Gohman	3a643e8d46	[WebAssembly] Remove loop's bottom label. Per spec changes, loop constructs no longer have a bottom label. https://reviews.llvm.org/D25118 llvm-svn: 283502	2016-10-06 22:10:23 +00:00
Dan Gohman	7f1bdb2e02	[WebAssembly] Remove the output operand from stores. Per spec changes, store instructions in WebAssembly no longer have a return value. Update the instruction descriptions. Differential Revision: https://reviews.llvm.org/D25122 llvm-svn: 283501	2016-10-06 22:08:28 +00:00
Wolfgang Pieb	e51bede1d8	Preserve the debug location when CodeGenPrepare sinks a compare instruction into the basic block of a user. Patch by Andrea DiBiagio. Differential Revision: https://reviews.llvm.org/D24632 llvm-svn: 283500	2016-10-06 21:43:45 +00:00
Pirama Arumuga Nainar	cc152ac794	Handle *_EXTEND_VECTOR_INREG during Integer Legalization Summary: These nodes need legalization for 3-element vectors. This commit handles the legalization and adds tests for zext and sext. This fixes PR30614. Reviewers: RKSimon, srhines Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25268 llvm-svn: 283496	2016-10-06 21:27:05 +00:00
Rong Xu	0e79f7d11d	[PGO] Create weak alias for the renamed Comdat function Add a weak alias to the renamed Comdat function in IR level instrumentation, using it's original name. This ensures the same behavior w/ and w/o IR instrumentation, even for non standard conforming code. Differential Revision: http://reviews.llvm.org/D25339 llvm-svn: 283490	2016-10-06 20:38:13 +00:00
Michael Kuperstein	e524e22846	[X86] Preserve BasePtr for LEA64_32r When replacing FrameIndex with BasePtr, we must preserve BasePtr for LEA64_32r since BasePtr is used later for stack adjustment if it is the same as StackPtr. Patch by H.J Lu <hjl.tools@gmail.com> Differential Revision: https://reviews.llvm.org/D23575 llvm-svn: 283486	2016-10-06 19:31:27 +00:00
Simon Pilgrim	bddb412896	[X86][SSE] Add f16/f80/f128 vector sitofp test cases As discussed on D23808 llvm-svn: 283485	2016-10-06 19:29:25 +00:00
Michael Kuperstein	7cc2123847	[DAG] Generalize build_vector -> vector_shuffle combine for more than 2 inputs This generalizes the build_vector -> vector_shuffle combine to support any number of inputs. The idea is to create a binary tree of shuffles, where the first layer performs pairwise shuffles of the input vectors placing each input element into the correct lane, and the rest of the tree blends these shuffles together. This doesn't try to be smart and create any sort of "optimal" shuffles. The assumption is that even a "poor" shuffle sequence is better than extracting and inserting the elements one by one. Differential Revision: https://reviews.llvm.org/D24683 llvm-svn: 283480	2016-10-06 18:58:24 +00:00
Michael Ilseman	6d6b4d87a3	Revert "Add -strip-nonlinetable-debuginfo capability" This reverts commit r283473. Reverted until review is completed. llvm-svn: 283478	2016-10-06 18:30:26 +00:00
Matt Arsenault	5e63a04e46	AMDGPU: Don't fold undef uses or copies with implicit uses llvm-svn: 283476	2016-10-06 18:12:13 +00:00
Matt Arsenault	c59a92387e	AMDGPU: Remove scheduling info from si_mask_branch llvm-svn: 283475	2016-10-06 18:12:07 +00:00
Michael Ilseman	d0a4db7632	Add -strip-nonlinetable-debuginfo capability This adds a new function to DebugInfo.cpp that takes an llvm::Module as input and removes all debug info metadata that is not directly needed for line tables, thus effectively stripping all type and variable information from the module. The primary motivation for this feature was the bitcode work flow (cf. http://lists.llvm.org/pipermail/llvm-dev/2016-June/100643.html for more background). This is not wired up yet, but will be in subsequent patches. For testing, the new functionality is exposed to opt with a -strip-nonlinetable-debuginfo option. The secondary use-case (and one that works right now!) is as a reduction pass in bugpoint. I added two new bugpoint options (-disable-strip-debuginfo and -disable-strip-debug-types) to control the new features. By default it will first attempt to remove all debug information, then only the type info, and then proceed to hack at any remaining MDNodes. llvm-svn: 283473	2016-10-06 17:58:38 +00:00
Matt Arsenault	c2ee42cd16	AMDGPU: Remove leftover implicit operands when folding immediates When constant folding an operation to a copy or an immediate mov, the implicit uses/defs of the old instruction were left behind, e.g. replacing v_or_b32 left the implicit exec use on the new copy. llvm-svn: 283471	2016-10-06 17:54:30 +00:00
Matt Arsenault	11f7402075	Reapply "AMDGPU: Support using tablegened MC pseudo expansions" Fix bad merge llvm-svn: 283470	2016-10-06 17:19:11 +00:00
Matt Arsenault	cbc879ee2f	Revert "AMDGPU: Support using tablegened MC pseudo expansions" llvm-svn: 283469	2016-10-06 17:08:01 +00:00
Matt Arsenault	d20a2dd7ac	AMDGPU: Support using tablegened MC pseudo expansions Make the necessary refactorings to make use of PseudoInstExpansion llvm-svn: 283467	2016-10-06 16:56:41 +00:00
Brian Gesiak	49f8c02eb7	[docs] Add PR to Lexicon Summary: The acronym PR could be ambiguous to some users, especially those who are used to interpreting it as GitHub's "pull request". Reviewers: ddunbar, jordan_rose, void, beanz Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25331 llvm-svn: 283465	2016-10-06 16:39:22 +00:00
Matt Arsenault	6bc43d8627	BranchRelaxation: Support expanding unconditional branches AMDGPU needs to expand unconditional branches in a new block with an indirect branch. llvm-svn: 283464	2016-10-06 16:20:41 +00:00
Krzysztof Parzyszek	d391d6f1c3	[Hexagon] Avoid replacing full regs with subregisters in tied operands Doing so will result in the two-address pass generating incorrect code. llvm-svn: 283463	2016-10-06 16:18:04 +00:00
Matt Arsenault	ef5bba0136	BranchRelaxation: Account for function alignment llvm-svn: 283462	2016-10-06 16:00:58 +00:00
Matt Arsenault	36919a4f7c	Move AArch64BranchRelaxation to generic code llvm-svn: 283459	2016-10-06 15:38:53 +00:00
Matt Arsenault	0a3ea89e85	AArch64: Move remaining target specific BranchRelaxation bits to TII llvm-svn: 283458	2016-10-06 15:38:09 +00:00
Nirav Dave	ee554e6155	[X86] Fix intel syntax push parsing bug Change erroneous parsing of push immediate instructions in intel syntax to default to pointer size by rewriting into the ATT style for matching. This fixes PR22028. Reviewers: majnemer, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D25288 llvm-svn: 283457	2016-10-06 15:28:08 +00:00
Mehdi Amini	a5ee89863c	Revert "Use StringRef in LTOModule implementation (NFC)" This reverts commit r282997, a windows bot is asserting in one test apparently. llvm-svn: 283456	2016-10-06 15:12:22 +00:00
Rafael Espindola	d9525a166d	Centralize sh_entsize checking. llvm-svn: 283455	2016-10-06 15:08:10 +00:00
Rafael Espindola	c3befb2e39	Refactor to use getSectionContentsAsArray. This centralizes quite a bit of error checking. llvm-svn: 283454	2016-10-06 14:47:04 +00:00
Rafael Espindola	6bc2990d16	Refactor duplicated typedefs. NFC. llvm-svn: 283453	2016-10-06 14:07:26 +00:00
Tim Northover	fe6fec9f65	GlobalISel: fix misuse of using declaration in test. Clang didn't diagnose it before. Oops. llvm-svn: 283451	2016-10-06 13:57:31 +00:00
Sam Kolton	3381d7a216	[AMDGPU] Disassembler: print label names in branch instructions Summary: Add AMDGPUSymbolizer for finding names for labels from ELF symbol table. Initialize MCObjectFileInfo with some default values. Reviewers: vpykhtin, artem.tamazov, tstellarAMD Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, tony-tye Differential Revision: https://reviews.llvm.org/D24802 llvm-svn: 283450	2016-10-06 13:46:08 +00:00
Anna Thomas	488c05763c	[RS4GC] Fix comment to show TODO. NFC llvm-svn: 283449	2016-10-06 13:24:20 +00:00
Rafael Espindola	28c63d3ed8	Use range loop. NFC. llvm-svn: 283447	2016-10-06 13:11:12 +00:00
Krzysztof Parzyszek	459a1c9f2b	[RDF] Replace some expensive copies with references in range-based loops llvm-svn: 283446	2016-10-06 13:05:46 +00:00
Krzysztof Parzyszek	61d9032bf3	[RDF] Replace potentially unclear autos with real types llvm-svn: 283445	2016-10-06 13:05:13 +00:00
Hal Finkel	4d6f3088c3	[llvm-opt-report] Record VF, etc. correctly for multiple opts on one line When there are multiple optimizations on one line, record the vectorization factors, etc. correctly (instead of incorrectly substituting default values). llvm-svn: 283443	2016-10-06 11:58:52 +00:00
Diana Picus	6341e46cd1	Revert "[ARM] Use __rt_div functions for divrem on Windows" This reverts commit r283383 because it broke some of the bots: undefined reference to ` __aeabi_uldivmod' It affected (at least) clang-cmake-armv7-a15-selfhost, clang-cmake-armv7-a15-selfhost and clang-native-arm-lnt. llvm-svn: 283442	2016-10-06 11:24:29 +00:00
Hal Finkel	47faf3be89	[llvm-opt-report] Print line numbers starting from 1 Line numbers should start from 1, not 2. llvm-svn: 283440	2016-10-06 11:11:11 +00:00
Henric Karlsson	54a53bd303	Test commit access (NFC) llvm-svn: 283439	2016-10-06 10:58:41 +00:00
Matt Arsenault	10c17ca6c6	AMDGPU: Partially fix reported code size for some instructions These ones need to have the size on the pseudo instruction set for getInstSizeInBytes to work correctly. These also have a statically known size. llvm-svn: 283437	2016-10-06 10:13:23 +00:00
Zvi Rackover	08a37f46e3	Add test-cases which demontrate pr30561 llvm-svn: 283436	2016-10-06 10:04:00 +00:00
Bjorn Pettersson	3961603921	[ValueTracking] Teach computeKnownBits and ComputeNumSignBits to look through ExtractElement. Summary: The computeKnownBits and ComputeNumSignBits functions in ValueTracking can now do a simple look-through of ExtractElement. Reviewers: majnemer, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24955 llvm-svn: 283434	2016-10-06 09:56:21 +00:00
Sagar Thakur	f9292220dc	[EfficiencySanitizer] Adds shadow memory parameters for 40-bit virtual memory address. Adding 40-bit shadow memory parameters because MIPS64 uses 40-bit virtual memory addresses. Reviewed by rengolin. Differential: https://reviews.llvm.org/D23801 llvm-svn: 283433	2016-10-06 09:52:06 +00:00
Nuno Lopes	d3f5af0fe4	fix build on cygwin Cygwin has dlfcn.h, but no Dl_info llvm-svn: 283427	2016-10-06 09:32:16 +00:00
James Molloy	6215fad0e9	[ARM] Constant pool promotion - fix alignment calculation Global variables are GlobalValues, so they have explicit alignment. Querying DataLayout for the alignment was incorrect. Testcase added. llvm-svn: 283423	2016-10-06 07:56:00 +00:00
James Molloy	78561c4917	[ARM] Improve testcase for r283323 We can work around a shortcoming of FileCheck by using {{\[}} to match a square bracket before a [[ sequence. Thanks to Eli Friedman for the heads up! llvm-svn: 283422	2016-10-06 07:44:05 +00:00
Petr Hosek	e023d62e76	[Triple] Add triple for Fuchsia Fuchsia is a new operating system. Differential Revision: https://reviews.llvm.org/D25116 llvm-svn: 283419	2016-10-06 05:17:26 +00:00
Kostya Serebryany	936b1e774f	[libFuzzer] be more careful with memory usage, print peak rss in status lines llvm-svn: 283418	2016-10-06 05:14:00 +00:00
Konstantin Zhuravlyov	b4eb5d5049	[AMDGPU] Promote uniform i16 bitreverse intrinsic to i32 Differential Revision: https://reviews.llvm.org/D25121 llvm-svn: 283415	2016-10-06 02:20:46 +00:00
Kostya Serebryany	3b564e9765	[libFuzzer] when re-running for lsan, don't look at the coverage llvm-svn: 283411	2016-10-05 23:31:01 +00:00
Sanjay Patel	edc2baddf8	[DAG] add tests to show missing checks for SDNode FMF The AVX attribute is added to remove noise caused by SSE's destructive insts. llvm-svn: 283410	2016-10-05 23:20:32 +00:00
Kostya Serebryany	1c73f1bf27	[libFuzzer] refactoring to make -shrink=1 work for value profile, added a test. llvm-svn: 283409	2016-10-05 22:56:21 +00:00
Hal Finkel	5d0fbbbca1	Fix tests for Windows We need to match file names with both forward and backward slashes. llvm-svn: 283407	2016-10-05 22:48:13 +00:00
Richard Smith	061a0bf8fd	Add missing #include from r283039. Found by modules build. llvm-svn: 283405	2016-10-05 22:40:54 +00:00
Reid Kleckner	bb96df602e	[codeview] Truncate records to maximum record size near 64KB If we don't truncate, LLVM asserts when the label difference doesn't fit in a 16 bit field. This patch truncates two kinds of data: trailing null terminated names in symbol records, and inline line tables. The inline line table test that I have is too large (many MB), so I'm not checking it in. Hopefully fixes PR28264. llvm-svn: 283403	2016-10-05 22:36:07 +00:00
Hal Finkel	5aa0248059	[llvm-opt-report] Distinguish inlined contexts when optimizations differ How code is optimized sometimes, perhaps often, depends on the context into which it was inlined. This change allows llvm-opt-report to track the differences between the optimizations performed, or not, in different contexts, and when these differ, display those differences. For example, this code: $ cat /tmp/q.cpp void bar(); void foo(int n) { for (int i = 0; i < n; ++i) bar(); } void quack() { foo(4); } void quack2() { foo(4); } will now produce this report: < /home/hfinkel/src/llvm/test/tools/llvm-opt-report/Inputs/q.cpp 2 \| void bar(); 3 \| void foo(int n) { [[ > foo(int): 4 \| for (int i = 0; i < n; ++i) > quack(), quack2(): 4 U4 \| for (int i = 0; i < n; ++i) ]] 5 \| bar(); 6 \| } 7 \| 8 \| void quack() { 9 I \| foo(4); 10 \| } 11 \| 12 \| void quack2() { 13 I \| foo(4); 14 \| } 15 \| Note that the tool has demangled the function names, and grouped the reports associated with line 4. This shows that the loop on line 4 was unrolled by a factor of 4 when inlined into the functions quack() and quack2(), but not in the function foo(int) itself. llvm-svn: 283402	2016-10-05 22:25:33 +00:00
Adrian Prantl	b3510afcd1	Verifier: Reject any unknown named MD nodes in the llvm.dbg namespace. This came out of a discussion in https://reviews.llvm.org/D25285. There used to be various other llvm.dbg.* nodes, but we don't support upgrading them and we want to reserve the namespace for future uses. This also removes an entirely obsolete and bitrotted testcase for PR7662. Reapplies 283390 with a forgotten testcase. llvm-svn: 283400	2016-10-05 22:15:37 +00:00
Adrian Prantl	497f085475	Revert "Verifier: Reject any unknown named MD nodes in the llvm.dbg namespace." Forgot to add a testcase in r283390. llvm-svn: 283399	2016-10-05 22:15:34 +00:00
Hal Finkel	52031b7e65	Add an llvm-opt-report tool to generate basic source-annotated optimization summaries LLVM now has the ability to record information from optimization remarks in a machine-consumable YAML file for later analysis. This can be enabled in opt (see r282539), and D25225 adds a Clang flag to do the same. This patch adds llvm-opt-report, a tool to generate basic optimization "listing" files (annotated sources with information about what optimizations were performed) from one of these YAML inputs. D19678 proposed to add this capability directly to Clang, but this more-general YAML-based infrastructure was the direction we decided upon in that review thread. For this optimization report, I focused on making the output as succinct as possible while providing information on inlining and loop transformations. The goal here is that the source code should still be easily readable in the report. My primary inspiration here is the reports generated by Cray's tools (http://docs.cray.com/books/S-2496-4101/html-S-2496-4101/z1112823641oswald.html). These reports are highly regarded within the HPC community. Intel's compiler, for example, also has an optimization-report capability (https://software.intel.com/sites/default/files/managed/55/b1/new-compiler-optimization-reports.pdf). $ cat /tmp/v.c void bar(); void foo() { bar(); } void Test(int res, int c, int d, int p, int n) { int i; #pragma clang loop vectorize(assume_safety) for (i = 0; i < 1600; i++) { res[i] = (p[i] == 0) ? res[i] : res[i] + d[i]; } for (i = 0; i < 16; i++) { res[i] = (p[i] == 0) ? res[i] : res[i] + d[i]; } foo(); foo(); bar(); foo(); } D25225 adds -fsave-optimization-record (and -fsave-optimization-record=filename), and this would be used as follows: $ clang -O3 -o /tmp/v.o -c /tmp/v.c -fsave-optimization-record $ llvm-opt-report /tmp/v.yaml > /tmp/v.lst $ cat /tmp/v.lst < /tmp/v.c 2 \| void bar(); 3 \| void foo() { bar(); } 4 \| 5 \| void Test(int res, int c, int d, int p, int n) { 6 \| int i; 7 \| 8 \| #pragma clang loop vectorize(assume_safety) 9 V4,2 \| for (i = 0; i < 1600; i++) { 10 \| res[i] = (p[i] == 0) ? res[i] : res[i] + d[i]; 11 \| } 12 \| 13 U16 \| for (i = 0; i < 16; i++) { 14 \| res[i] = (p[i] == 0) ? res[i] : res[i] + d[i]; 15 \| } 16 \| 17 I \| foo(); 18 \| 19 \| foo(); bar(); foo(); I \| ^ I \| ^ 20 \| } Each source line gets a prefix giving the line number, and a few columns for important optimizations: inlining, loop unrolling and loop vectorization. An 'I' is printed next to a line where a function was inlined, a 'U' next to an unrolled loop, and 'V' next to a vectorized loop. These are printed on the relevant code line when that seems unambiguous, or on subsequent lines when multiple potential options exist (messages, both positive and negative, from the same optimization with different column numbers are taken to indicate potential ambiguity). When on subsequent lines, a '^' is output in the relevant column. Annotated source for all relevant input files are put into the listing file (each starting with '<' and then the file name). You can disable having the unrolling/vectorization factors appear by using the -s flag. Differential Revision: https://reviews.llvm.org/D25262 llvm-svn: 283398	2016-10-05 22:10:35 +00:00
Reid Kleckner	6f83e8b1d7	Remove extra semicolon llvm-svn: 283395	2016-10-05 21:46:56 +00:00
Reid Kleckner	b0311b290e	Fix the build with MSVC 2013, still cannot default move ctors yet Ten days. llvm-svn: 283394	2016-10-05 21:44:46 +00:00
Sanjay Patel	5839858584	[DAG] change test to use 'unsafe' function attribute instead of global setting But we have node-level FMF, so the next step is to fix this at the instruction/node-level. llvm-svn: 283393	2016-10-05 21:43:50 +00:00
David Callahan	c1051ab26e	Modify df_iterator to support post-order actions Summary: This makes a change to the state used to maintain visited information for depth first iterator. We know assume a method "completed(...)" which is called after all children of a node have been visited. In all existing cases, this method does nothing so this patch has no functional changes. It will however allow a client to distinguish back from cross edges in a DFS tree. Reviewers: nadav, mehdi_amini, dberlin Subscribers: MatzeB, mzolotukhin, twoh, freik, llvm-commits Differential Revision: https://reviews.llvm.org/D25191 llvm-svn: 283391	2016-10-05 21:36:16 +00:00
Adrian Prantl	71bba7253e	Verifier: Reject any unknown named MD nodes in the llvm.dbg namespace. This came out of a discussion in https://reviews.llvm.org/D25285. There used to be various other llvm.dbg.* nodes, but we don't support upgrading them and we want to reserve the namespace for future uses. This also removes an entirely obsolete and bitrotted testcase for PR7662. llvm-svn: 283390	2016-10-05 21:31:19 +00:00
Dan Gohman	5a68ec7f09	[WebAssembly] Add binary-encoding opcode values to instruction descriptions. llvm-svn: 283389	2016-10-05 21:24:08 +00:00
Reid Kleckner	2b3e6428e5	[codeview] Translate bitpiece metadata to DEFRANGE_SUBFIELD* records This allows LLVM to describe locations of aggregate variables that have been split by SROA. Fixes PR29141 Reviewers: amccarth, majnemer Differential Revision: https://reviews.llvm.org/D25253 llvm-svn: 283388	2016-10-05 21:21:33 +00:00
Lang Hames	a5e873e2a1	[Object] Fix a crash in Archive::child_iterator's default constructor. To be default constructible, Archive::child_iterator needs to be able to construct an Archive::Child with a null parent, however Archive::Child's constructor always dereferenced its Parent argument to compute the remaining archive size. This commit fixes Archive::Child's constructor to only do the size calculation when the parent is non-null. llvm-svn: 283387	2016-10-05 21:20:00 +00:00

1 2 3 4 5 ...

139137 Commits