llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-12-02 18:58:15 +00:00

Author	SHA1	Message	Date
Jez Ng	b63919e180	[lld-macho] Require -arch and -platform_version to always be specified We previously defaulted to x86_64 and an unknown platform, which was fine when we only supported one arch and did no platform checks, but that will no longer be true going ahead. Therefore, we should require those flags to be specified whenever the linker is invoked. Note that LLD-ELF and ld64 both infer the arch from their input object files, but the usefulness of that is questionable since clang will always specify these flags, and most of the time `lld` will be invoked via clang. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D97799	2021-03-03 15:52:10 -05:00
Greg McGary	4af1522a85	[lld-macho] Rework length check when opening input files This reverts diff D97610 (commit `0223ab035c`) and adds a one-line fix to verify that a `MemoryBufferRef` has sufficient length before reading a 4-byte magic number. Differential Revision: https://reviews.llvm.org/D97757	2021-03-02 13:00:57 -08:00
Vy Nguyen	9a2e2de15f	[lld-macho] Change loadReexport to handle the case where a TAPI re-exports to reference documents nested within other TBD. Currently, it was delibrately impleneted to not handle this case, but as it has turnt out, we need this feature. The concrete use case is `System/Library/Frameworks/Cocoa.framework/Versions/A/Cocoa` reexports /System/Library/Frameworks/AppKit.framework/Versions/C/AppKit , which then rexports /System/Library/PrivateFrameworks/UIFoundation.framework/Versions/A/UIFoundation The current implemention uses a global currentTopLevelTapi, which is not reset until it finishes loading the whole tree. This is a problem because if the top-level is set to Cocoa, then when we get to UIFoundation, it will try to find UIFoundation in the current top level, which is Cocoa and will not find it. The right thing should be: - When loading a library from a TBD file, re-exports need to be looked up in the auxiliary documents within the same TBD. - When loading from an actual dylib, no additional TBD documents need to be examined. - In no case does a re-export mentioned in one TBD file need to be looked up in a document in an auxiliary document from a different TBD file Differential Revision: https://reviews.llvm.org/D97438	2021-03-02 12:14:31 -05:00
Peter Smith	e35929e026	[LLD][ELF][ARM] Refactor inBranchRange to use addend for PC Bias In AArch32 ARM, the PC reads two instructions ahead of the currently executiing instruction. This evaluates to 8 in ARM state and 4 in Thumb state. Branch instructions on AArch32 compensate for this by subtracting the PC bias from the addend. For a branch to symbol this will result in an addend of -8 in ARM state and -4 in Thumb state. The existing ARM Target::inBranchRange function accounted for this implict addend within the function meaning that if the addend were to be taken into account by the caller then it would be double counted. This complicates the interface for all Targets as callers wanting to account for addends had to account for the ARM PC-bias. In certain situations such as: https://github.com/ClangBuiltLinux/linux/issues/1305 the PC-bias compensation code didn't match up. In particular normalizeExistingThunk() didn't put the PC-bias back in as Arm thunks did not store the addend. The simplest fix for the problem is to add the PC bias in normalizeExistingThunk when restoring the addend. However I think it is worth refactoring the Arm inBranchRange implementation so that fewer calls to getPCBias are needed for other Targets. I wasn't able to remove getPCBias completely but hopefully the Relocations.cpp code is simpler now. In principle a test could be written to replicate the linux kernel build failure but I wasn't able to reproduce with a small example that I could build up from scratch. Fixes https://github.com/ClangBuiltLinux/linux/issues/1305 Differential Revision: https://reviews.llvm.org/D97550	2021-03-02 11:02:33 +00:00
Sam Clegg	74ca5c2c1e	[lld][WebAssembly] Convert lld/test/wasm/data-layout.ll to asm. NFC. Part of ongoing conversion work started in https://reviews.llvm.org/D80361. Differential Revision: https://reviews.llvm.org/D97651	2021-03-01 16:14:36 -08:00
Nico Weber	0658fc654c	[lld/mac] Implement the missing bits of -undefined This adds support for `-undefined dynamic_lookup`, and for `-undefined warning` and `-undefined suppress` with `-flat_namespace`. We just replace undefined symbols with a DynamicLookup when we hit them. With this, `check-llvm` passes when using ld64.lld.darwinnew as host linker. Differential Revision: https://reviews.llvm.org/D97642	2021-03-01 15:30:53 -05:00
Nico Weber	8174f33dc9	[lld/mac] Add support for -flat_namespace -flat_namespace makes lld emit binaries that use name lookup that's more in line with other POSIX systems: Instead of looking up symbols as (dylib,name) pairs by dyld, they're instead looked up just by name. -flat_namespace has three effects: 1. MH_TWOLEVEL and MH_NNOUNDEFS are no longer set in the Mach-O header 2. All symbols use BIND_SPECIAL_DYLIB_FLAT_LOOKUP as ordinal 3. When a dylib is added to the link, its dependent dylibs are also added, so that lld can verify that no undefined symbols remain at the end of a link with -flat_namespace. These transitive dylibs are added for symbol resolution, but they are not emitted in LC_LOAD_COMMANDs. -undefined with -flat_namespace still isn't implemented. Before this change, it was impossible to hit that combination because -flat_namespace caused a diagnostic. Now that it no longer does, emit a dedicated temporary diagnostic when both flags are used. Differential Revision: https://reviews.llvm.org/D97641	2021-03-01 15:25:10 -05:00
Nico Weber	3e6b6cee00	[lld/mac] Use libSystem.dylib instead of libSystem.B.dylib in tests For -flat_namespace, lld needs to load dylibs in LC_LOAD_DYLIB. The current setup meant that libSystem.dylib would cause a LC_LOAD_DYLIB with libSystem.B.dylib, but that didn't exist in our libsysroot for tests. So just drop the .B. See https://reviews.llvm.org/D97641#2595237 and https://reviews.llvm.org/D97641#2595270	2021-03-01 15:23:28 -05:00
Nico Weber	2e803ec746	[lld/mac on non-mac] fix test/MachO/search-paths.test after `ab45289d2e`	2021-03-01 15:21:24 -05:00
Nico Weber	ab45289d2e	[lld/mac] Make -v print version and search paths in additon to linking, not instead of linking This matches ld64's behavior. Differential Revision: https://reviews.llvm.org/D97718	2021-03-01 15:09:46 -05:00
Jez Ng	415c0cd698	[lld-macho] Switch default to new Darwin backend The new Darwin backend for LLD is now able to link reasonably large real-world programs on x86_64. For instance, we have achieved self-hosting for the X86_64 target, where all LLD tests pass when building lld with itself on macOS. As such, we would like to make it the default back-end. The new port is now named `ld64.lld`, and the old port remains accessible as `ld64.lld.darwinold` This [annoucement email][1] has some context. (But note that, unlike what the email says, we are no longer doing this as part of the LLVM 12 branch cut -- instead we will go into LLVM 13.) Numerous mechanical test changes were required to make this change; in the interest of creating something that's reviewable on Phabricator, I've split out the boring changes into a separate diff (D95905). I plan to merge its contents with those in this diff before landing. (@gkm made the original draft of this diff, and he has agreed to let me take over.) [1]: https://lists.llvm.org/pipermail/llvm-dev/2021-January/147665.html Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D95204	2021-03-01 12:30:10 -05:00
Andy Wingo	2632ba6a35	[WebAssembly] call_indirect issues table number relocs If the reference-types feature is enabled, call_indirect will explicitly reference its corresponding function table via TABLE_NUMBER relocations against a table symbol. Also, as before, address-taken functions can also cause the function table to be created, only with reference-types they additionally cause a symbol table entry to be emitted. Differential Revision: https://reviews.llvm.org/D90948	2021-03-01 16:49:00 +01:00
Sam Clegg	d49270b087	[lld][ELF] Removing redundant cast. NFC. Also a couple of minor cleanups in merge-string.s: - fix inconsistent use of tabs - use `.p2align` rather than `.align` since `.p2align` works the same on all platforms (the meaning of align seems to differ between platforms according to `AlignmentIsInBytes`. I noticed these potential cleanups while porting SHF_STRINGS support to wasm-ld. Differential Revision: https://reviews.llvm.org/D97647	2021-02-28 16:53:41 -08:00
Greg McGary	0223ab035c	[lld-macho] check minimum header length when opening linkable input files Bifurcate the `readFile()` API into ... * `readRawFile()` which performs no checks, and * `readLinkableFile()` which enforces minimum length of 20 bytes, same as ld64 There are no new tests because tweaks to existing tests are sufficient. Differential Revision: https://reviews.llvm.org/D97610	2021-02-27 14:41:40 -08:00
Greg McGary	6f9dd843db	[lld-macho] Implement options -rename_section -rename_segment Implement command-line options to rename output sections & segments. Differential Revision: https://reviews.llvm.org/D97600	2021-02-27 11:44:12 -08:00
Jez Ng	82b3da6f6f	[lld-macho] Extract embedded addends for arm64 UNSIGNED relocations On arm64, UNSIGNED relocs are the only ones that use embedded addends instead of the ADDEND relocation. Also ensure that the addend works when UNSIGNED is part of a SUBTRACTOR pair. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D97105	2021-02-27 12:31:34 -05:00
Jez Ng	8da88d4b60	[lld-macho] Add test for a variety of arm64 relocations Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D97104	2021-02-27 12:31:34 -05:00
Jez Ng	541390131e	[lld-macho] Don't emit rebase opcodes for subtractor minuend relocs Also add a few asserts to verify that we are indeed handling an UNSIGNED relocation as the minued. I haven't made it an actual user-facing error since I don't think llvm-mc is capable of generating SUBTRACTOR relocations without an associated UNSIGNED. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D97103	2021-02-27 12:31:34 -05:00
Jez Ng	cc5c03e109	[lld-macho] Properly test subtractor relocations & fix their attributes `llvm-mc` doesn't generate any relocations for subtractions between local symbols -- they must be global -- so the previous test wasn't actually testing any relocation logic. I've fixed that and extended the test to cover r_length=3 relocations as well as both x86_64 and arm64. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D97057	2021-02-27 12:31:34 -05:00
Nico Weber	cafb6cd10c	[lld/mac] Add some support for dynamic lookup symbols, and implement -U Dynamic lookup symbols are symbols that work like dynamic symbols in ELF: They're not bound to a dylib like normal Mach-O twolevel lookup symbols, but they live in a global pool and dyld resolves them against exported symbols from all loaded dylibs. This adds support for dynamical lookup symbols to lld/mac. They are represented as DylibSymbols with file set to nullptr. This also uses this support to implement the -U flag, which makes a specific symbol that's undefined at the end of the link a dynamic lookup symbol. For -U, it'd be sufficient to just to a pass over remaining undefined symbols at the end of the link and to replace them with dynamic lookup symbols then. But I'd like to use this code to implement flat_namespace too, and that will require real support for resolving dynamic lookup symbols in SymbolTable. So this patch adds this now already. While writing tests for this, I noticed that we didn't set N_WEAK_DEF in the symbol table for DylibSymbols, so this fixes that too. Differential Revision: https://reviews.llvm.org/D97521	2021-02-26 16:50:53 -05:00
Fangrui Song	4bbcd63eea	[ELF] Add -z start-stop-gc to let __start_/__stop_ not retain C identifier name sections For one metadata section usage, each text section references a metadata section. The metadata sections have a C identifier name to allow the runtime to collect them via `__start_/__stop_` symbols. Since `__start_`/`__stop_` references are always present from live sections, the C identifier name sections appear like GC roots, which means they cannot be discarded by `ld --gc-sections`. To make such sections GCable, either SHF_LINK_ORDER or a section group is needed. SHF_LINK_ORDER is not suitable for the references can be inlined into other functions (See D97430: Function A (in the section .text.A) references its `__sancov_guard` section. Function B inlines A (so now .text.B references `__sancov_guard` - this is invalid with the semantics of SHF_LINK_ORDER). In the linking stage, if `.text.A` gets discarded, and `__sancov_guard` is retained via the reference from `.text.B`, the output will be invalid because `__sancov_guard` references the discarded `.text.A`. LLD errors "sh_link points to discarded section". ) A section group have size overhead, and is cumbersome when there is just one metadata section. Add `-z start-stop-gc` to drop the "__start_/__stop_ references retain non-SHF_LINK_ORDER non-SHF_GROUP C identifier name sections" rule. We reserve the rights to switch the default in the future. Reviewed By: phosek, jrtc27 Differential Revision: https://reviews.llvm.org/D96914	2021-02-25 15:46:37 -08:00
Jez Ng	84579fc24f	[lld-macho] Basic support for linkage and visibility attributes in LTO When parsing bitcode, convert LTO Symbols to LLD Symbols in order to perform resolution. The "winning" symbol will then be marked as Prevailing at LTO compilation time. This is similar to what the other LLD ports do. This change allows us to handle `linkonce` symbols correctly, and to deal with duplicate bitcode symbols gracefully. Previously, both scenarios would result in an assertion failure inside the LTO code, complaining that multiple Prevailing definitions are not allowed. While at it, I also added basic logic around visibility. We don't do anything useful with it yet, but we do check that its value is valid. LLD-ELF appears to use it only to set FinalDefinitionInLinkageUnit for LTO, which I think is just a performance optimization. From my local experimentation, the linker itself doesn't seem to do anything differently when encountering linkonce / linkonce_odr / weak / weak_odr. So I've only written a test for one of them. LLD-ELF has more, but they seem to mostly be testing the intermediate bitcode output of their LTO backend...? I'm far from an expert here though, so I might very well be missing things. Reviewed By: #lld-macho, MaskRay, smeenai Differential Revision: https://reviews.llvm.org/D94342	2021-02-25 13:27:40 -05:00
Jez Ng	9ced8b3b61	[lld-macho] Add REQUIRES to incompatible-arch-tapi test	2021-02-24 09:42:52 -05:00
Douglas Yung	691c3d5948	Update REQUIRES line in 4 tests that attempt to use arm64 which should be aarch64 not arm.	2021-02-24 02:26:19 -08:00
Martin Storsjö	075539ddf6	[LLD] [COFF] Allow invoking lib.exe mode via -lib in addition to /lib Remove a stray -lib argument in guardcf-lto.ll; llvm-lib doesn't support generating import libs from a def file unlike lib.exe. Previously this worked because the -lib argument was ignored (printing only a warning). Differential Revision: https://reviews.llvm.org/D96699	2021-02-24 11:16:12 +02:00
Douglas Yung	91e31ebbbe	Add REQUIRES aarch64 to test incompatible-arch.s since it fails if that target is not built. This should fix the PS4 linux buildbot failure: http://lab.llvm.org:8011/#/builders/139/builds/63	2021-02-24 00:02:21 -08:00
Jez Ng	4a5e111aea	[lld-macho] Better deduplication of personality pointers {D95809} introduced a mechanism for synthetic symbol creation of personality pointers. When multiple section relocations referred to the same personality pointer, it would deduplicate them. However, it neglected to consider that we could have symbol relocations that also refer to the same personality pointer. This diff fixes it. In practice, this mix of relocations arises when there is a statically-linked personality routine that is referenced from multiple object files. Within the same object file, it will be referred to via section relocations, but (obviously) other object files will refer to it via symbol relocations. Failing to deduplicate these references resulted in us going over the 3-personality-pointer limit when linking some larger applications. Fixes llvm.org/PR48389. Reviewed By: #lld-macho, thakis, alexshap Differential Revision: https://reviews.llvm.org/D97245	2021-02-23 22:02:38 -05:00
Jez Ng	4752cdc9a2	[lld-macho] Check for arch compatibility when loading ObjFiles and TBDs The silent failures had confused me a few times. I haven't added a similar check for platform yet as we don't yet have logic to infer the platform automatically, and so adding that check would require updating dozens of test files. Reviewed By: #lld-macho, thakis, alexshap Differential Revision: https://reviews.llvm.org/D97209	2021-02-23 22:02:38 -05:00
Jez Ng	5e851733c5	[lld-macho] Fix semantics & add tests for ARM64 GOT/TLV relocs I've adjusted the RelocAttrBits to better fit the semantics of the relocations. In particular: 1. _UNSIGNED relocations are no longer marked with the `TLV` bit, even though they can occur within TLV sections. Instead the `TLV` bit is reserved for relocations that can reference thread-local symbols, and _UNSIGNED relocations have their own `UNSIGNED` bit. The previous implementation caused TLV and regular UNSIGNED semantics to be conflated, resulting in rebase opcodes being incorrectly emitted for TLV relocations. 2. I've added a new `POINTER` bit to denote non-relaxable GOT relocations. This distinction isn't important on x86 -- the GOT relocations there are either relaxable or non-relaxable loads -- but arm64 has `GOT_LOAD_PAGE21` which loads the page that the referent symbol is in (regardless of whether the symbol ends up in the GOT). This relocation must reference a GOT symbol (so must have the `GOT` bit set) but isn't itself relaxable (so must not have the `LOAD` bit). The `POINTER` bit is used for relocations that must reference a GOT slot. 3. A similar situation occurs for TLV relocations. 4. ld64 supports both a pcrel and an absolute version of ARM64_RELOC_POINTER_TO_GOT. But the semantics of the absolute version are pretty weird -- it results in the value of the GOT slot being written, rather than the address. (That means a reference to a dynamically-bound slot will result in zeroes being written.) The programs I've tried linking don't use this form of the relocation, so I've dropped our partial support for it by removing the relevant RelocAttrBits. Reviewed By: alexshap Differential Revision: https://reviews.llvm.org/D97031	2021-02-23 22:02:38 -05:00
Jez Ng	e5d780e049	[lld-macho] Use full input file name in invalid relocation error message Just something I noticed while debugging arm relocations... Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D97078	2021-02-23 22:02:38 -05:00
Andy Wingo	7dc98adbb0	Revert "[WebAssembly] call_indirect issues table number relocs" This reverts commit `861dbe1a02`. It broke emscripten -- see https://reviews.llvm.org/D90948#2578843.	2021-02-23 11:48:08 +01:00
Nico Weber	e6d1f261a5	[lld-link] Add /reproduce: support for several flags /reproduce: now works correctly with: - /call-graph-ordering-file: - /def: - /natvis: - /order: - /pdbstream: I went through all instances of MemoryBuffer::getFile() and made sure everything that didn't already do so called takeBuffer(). For natvis, that wasn't possible since DebugInfo/PDB wants to take owernship of the natvis buffer. For that case, I'm manually adding the tar file entry. /natvis: and /pdbstream: is slightly awkward, since createResponseFile() always adds these flags to the response file but createPDB() (which ultimately adds the files referenced by the flags) is only called if /debug is also passed. So when using /natvis: without /debug with /reproduce:, lld won't warn, but when linking using the response file from the archive, it won't find the natvis file since it's not in the tar. This isn't a new issue though, and after this patch things at least work with using /natvis: _with_ debug with /reproduce:. (Same for /pdbstream:) Differential Revison: https://reviews.llvm.org/D97212	2021-02-22 16:52:49 -05:00
Jez Ng	c32aee5a5e	[lld-macho] Try to fix cross-platform test from D96565	2021-02-22 14:47:45 -05:00
Vy Nguyen	5a856f5b44	Reland [lld-macho]Implement bundle_loader Reland `1a0afcf518` https://reviews.llvm.org/D95913 New change: fix UB bug caused by copying empty path/name. (since the executable does not have a name)	2021-02-22 14:05:12 -05:00
Jez Ng	5bfdbdeb40	[lld-macho] Fix cpuSubtype for non-x86_64 archs dyld on iOS will complain if the LIB64 bit is set. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D96565	2021-02-22 12:13:17 -05:00
Andy Wingo	861dbe1a02	[WebAssembly] call_indirect issues table number relocs If the reference-types feature is enabled, call_indirect will explicitly reference its corresponding function table via `TABLE_NUMBER` relocations against a table symbol. Also, as before, address-taken functions can also cause the function table to be created, only with reference-types they additionally cause a symbol table entry to be emitted. We abuse the used-in-reloc flag on symbols to indicate which tables should end up in the symbol table. We do this because unfortunately older wasm-ld will carp if it see a table symbol. Differential Revision: https://reviews.llvm.org/D90948	2021-02-22 10:13:36 +01:00
Petr Hosek	1a3f3a3fa1	[lld][ELF] __start_/__stop_ refs don't retain C-ident named group sections The special root semantics for identifier-named sections is meant specifically for the metadata sections. In the context of group semantics, where group members are always retained or discarded as a unit, it's natural not to have this semantics apply to a section in a group, otherwise we would never discard the group defeating the purpose of using the group in the first place. This change modifies the GC behavior so that __start_/__stop_ references don't retain C identifier named sections in section groups which allows for these groups to be collected. This matches the behavior of BFD ld. The only kind of existing case that might break is interdependent metadata sections that are all in a group together, but that group doesn't contain any other sections referenced by anything except implicit inclusion in a `__start_` and/or `__stop_`-referenced identifier-named section, but such cases should be unlikely. Differential Revision: https://reviews.llvm.org/D96753	2021-02-20 22:22:05 -08:00
Nico Weber	28d9953af9	[lld/mac] reject -undefined warning and -undefined suppress with -twolevel_namespace See discussion on https://reviews.llvm.org/D93263 -flat_namespace isn't implemented yet, and neither is -undefined dynamic, so this makes -undefined pretty pointless in lld/MachO for now. But once we implement -flat_namespace (which we need to do anyways to get check-llvm to pass with lld as host linker), the code's already there. Follow-up to https://reviews.llvm.org/D93263#2491865 Differential Revision: https://reviews.llvm.org/D96963	2021-02-20 13:35:22 -05:00
Vitaly Buka	c17547df44	Revert "Implement -bundle_loader" D95913 passes null pointer into memcpy This reverts commit `1a0afcf518`.	2021-02-19 17:40:07 -08:00
Nikita Popov	ac065b7a37	[LLD] Fix tests after D96993 We now need mustprogress to eliminate these calls. The code doesn't really make sense, but that's not the point of the test...	2021-02-19 13:08:17 +01:00
Vy Nguyen	1a0afcf518	Implement -bundle_loader Differential Revision: https://reviews.llvm.org/D95913 Usage: -bundle_loader <executable> This option specifies the executable that will load the build output file being linked. When building a bundle, users can use the --bundle_loader to specify an executable that contains symbols referenced, but not implemented in the bundle.	2021-02-18 16:11:37 -05:00
Andy Wingo	48219d06b1	[lld][WebAssembly] Fix resolveIndirectFunctionTable for relocatable output For relocatable output that needs the indirect function table, identify the well-known function table. This allows us to properly fix the limits on the imported table, and in a followup will allow the element section to reference the indirect function table even if it's not assigned to table number 0. Adapt tests for import reordering. Differential Revision: https://reviews.llvm.org/D96770	2021-02-18 09:33:54 +01:00
Andy Wingo	f48923e884	[WebAssembly][lld] --importTable flag only imports table if needed Before, --importTable forced the creation of an indirect function table, whether it was needed or not. Now it only imports a table if needed. Differential Revision: https://reviews.llvm.org/D96872	2021-02-18 09:16:29 +01:00
Fangrui Song	018a484cd2	[llvm-objdump] Map STT_TLS to ST_Other (previously ST_Data) ST_Data is used to model BFD `BFD_OBJECT`. A STT_TLS symbol does not have the `BFD_OBJECT` flag in BFD. This makes sense because a STT_TLS symbol is like in a different address space, normal data/object properties do not apply on them. With this change, a STT_TLS symbol will not be displayed as 'O'. This new behavior matches objdump. Differential Revision: https://reviews.llvm.org/D96735	2021-02-17 23:17:20 -08:00
Teresa Johnson	3c4c205060	[WPD][lld] Test handling of vtable definition from shared libraries Adds a lld test for a case that the handling added for dynamically exported symbols in `1487747e99` already fixes. Because isExportDynamic returns true when the symbol is SharedKind with default visibility, it will treat as dynamically exported and block devirtualization when the definition of a vtable comes from a shared library. This is desireable as it is dangerous to devirtualize in that case, since there could be hidden overrides in the shared library. Typically that happens when the shared library header contains available externally definitions, which applications can override. An example is std::error_category, which is overridden in LLVM and causing failures after a self build with WPD enabled, because libstdc++ contains hidden overrides of the virtual base class methods. The regular LTO case in the new test already worked, but there are 2 fixes in this patch needed for the index-only case and the hybrid LTO case. For the index-only case, WPD should not simply ignore available externally vtables. A follow on fix will be made to clang to emit type metadata for those vtables, which the new test is modeling. For the hybrid case, we need to ensure when the module is split that any llvm.*used globals are cloned to the regular LTO split module so available externally vtable definitions are not prematurely deleted. Another follow on fix will add the equivalent gold test, which requires a small fix to the plugin to treat symbols in dynamic libraries the same way lld already is. Differential Revision: https://reviews.llvm.org/D96721	2021-02-17 12:49:24 -08:00
Petr Hosek	bfa4235e6e	[lld][ELF] Support for zero flag section groups This change introduces support for zero flag ELF section groups to lld. lld already supports COMDAT sections, which in ELF are a special type of ELF section groups. These are generally useful to enable linker GC where you want a group of sections to always travel together, that is to be either retained or discarded as a whole, but without the COMDAT semantics. Other ELF linkers already support zero flag ELF section groups and this change helps us reach feature parity. Differential Revision: https://reviews.llvm.org/D96636	2021-02-16 14:33:09 -08:00
Teresa Johnson	b3bafbcb18	[lld] Reorder cases in test to match comments (NFC) The test added in `1487747e99` had a few cases that were out of order compared to the comments. Reordered to match.	2021-02-13 16:32:39 -08:00
Fangrui Song	962b29d716	ELFObjectWriter: Don't sort non-local symbols As we don't sort local symbols, don't sort non-local symbols. This makes non-local symbols appear in their register order, which matches GNU as. The register order is nice in that you can write tests with interleaved CHECK prefixes, e.g. ``` // CHECK: something about foo .globl foo foo: // CHECK: something about bar .globl bar bar: ``` With the lexicographical order, the user needs to place lexicographical smallest symbol first or keep CHECK prefixes in one place.	2021-02-13 10:32:27 -08:00
Fangrui Song	1f69355802	[test] Make ELF tests amenable to the order of non-local symbols	2021-02-12 21:00:42 -08:00
Andy Wingo	4fc2557308	[WebAssembly][lld] Preassign table number 0 to indirect function table for MVP inputs MVP object files may import at most one table, and if they do, it must be assigned table number zero in the output, as the references to that table are not relocatable. Ensure that this is the case, even if some inputs define other tables. Differential Revision: https://reviews.llvm.org/D96001	2021-02-12 20:20:19 +01:00
Sam Clegg	e1617d23ff	Revert "[lld][WebAssembly] Fix for weak undefined functions in -pie mode" This reverts commit `ac2be2b6a3`. This causes a whole much of emscripten tests to fail due to newly undefined symbols appearing. Will investigate and look into re-landing later.	2021-02-12 09:04:18 -08:00
Sam Clegg	ac2be2b6a3	[lld][WebAssembly] Fix for weak undefined functions in -pie mode This fixes two somewhat related issues. Firstly we were never generating imports for weak functions (even with the `import-functions` policy for undefined symbols). Adding a direct call to foo in the `weak-undefined-pic.s` exposed a crash in the linker which this change fixes. Secondly we were failing to call `handleWeakUndefines` for the `-pie` case which is PIC but doesn't set the undefined symbol policy to `import-functions`. With this change `-pie` binaries will by default call `handleWeakUndefines` which generates the undefined stub handlers for any weakly undefined symbols. Fixes: https://github.com/emscripten-core/emscripten/issues/13337 Differential Revision: https://reviews.llvm.org/D95914	2021-02-11 17:16:03 -08:00
Hongtao Yu	0f848a24e1	Undo test changs introduced by D96193. Summary: The test doesn't work on Windows but there seems no good way to disable the test for Windows only so I'm undoing the test changes.	2021-02-11 13:29:41 -08:00
Sam Clegg	70f3c6e9e6	[lld][WebAssembly] Delay the merging of data section when dynamic linking With dynamic linking we have the current limitation that there can be only a single active data segment (since we use __memory_base as the load address and we can't do arithmetic in constant expresions). This change delays the merging of active segments until a little later in the linking process which means that the grouping of data by section, and the magic __start/__end symbols work as expected under dynamic linking. Differential Revision: https://reviews.llvm.org/D96453	2021-02-11 10:54:15 -08:00
Fangrui Song	0557b1bdec	[ELF] Resolve defined symbols before undefined symbols When parsing an object file, LLD interleaves undefined symbol resolution (which may recursively fetch other lazy objects) with defined symbol resolution. This may lead to surprising results, e.g. if an object file defines currently undefined symbols and references another lazy symbol, we may interleave defined symbols with the lazy fetch, potentially leading to the defined symbols resolving to different files. As an example, if both `a.a(a.o)` and `a.a(b.o)` define `foo` (not in COMDAT group, or in different COMDAT groups) and `__profd_foo` (in COMDAT group `__profd_foo`). LLD may resolve `foo` to `a.a(a.o)` and `__profd_foo` to `b.a(b.o)`, i.e. different files. ``` parse ArchiveFile a.a entry fetches a.a(a.o) parse ObjectFile a.o define entry define foo reference b b fetches a.a(b.o) parse ObjectFile b.o define prevailing __profd_foo define (ignored) non-prevailing __profd_foo ``` Assuming a set of interconnected symbols are defined all or none in several lazy objects. Arguably making them resolve to the same file is preferable than making them resolve to different files (some are lazy objects). The main argument favoring the new behavior is the stability. The relative order between a defined symbol and an undefined symbol does not change the symbol resolution behavior. Only the relative order between two undefined symbols can affect fetching behaviors. --- The real world case is reduced from a Fuchsia PGO usage: `a.a(a.o)` has a constructor within COMDAT group C5 while `a.a(b.o)` has a constructor within COMDAT group C2. Because they use different group signatures, they are not de-duplicated. It is not entirely whether Clang behavior is entirely conforming. LLD selects the PGO counter section (`__profd_`) from `a.a(b.o)` and the constructor section from `a.a(a.o)`. The `__profd_` is a SHF_LINK_ORDER section linking to its own non-prevailing constructor section, so LLD errors `sh_link points to discarded section`. This patch fixes the error. Differential Revision: https://reviews.llvm.org/D95985	2021-02-11 09:41:46 -08:00
Stella Stamenova	ed98676fa4	Support multi-configuration generators correctly in several config files Multi-configuration generators (such as Visual Studio and Xcode) allow the specification of a build flavor at build time instead of config time, so the lit configuration files need to support that - and they do for the most part. There are several places that had one of two issues (or both!): 1) Paths had %(build_mode)s set up, but then not configured, resulting in values that would not work correctly e.g. D:/llvm-build/%(build_mode)s/bin/dsymutil.exe 2) Paths did not have %(build_mode)s set up, but instead contained $(Configuration) (which is the value for Visual Studio at configuration time, for Xcode they would have had the equivalent) e.g. "D:/llvm-build/$(Configuration)/lib". This seems to indicate that we still have a lot of fragility in the configurations, but also that a number of these paths are never used (at least on Windows) since the errors appear to have been there a while. This patch fixes the configurations and it has been tested with Ninja and Visual Studio to generate the correct paths. We should consider removing some of these settings altogether. Reviewed By: JDevlieghere, mehdi_amini Differential Revision: https://reviews.llvm.org/D96427	2021-02-11 09:32:20 -08:00
Thomas Lively	067f005500	[lld][WebAssembly] Fix segfault in map file support The code previously assumed that `getChunk` would return a non-null pointer for every symbol, but in fact it only returns non-null pointers for DefinedFunction and DefinedData symbols. This patch fixes the segfault by checking whether `getChunk` returns a null for each symbol and skipping the mapping output for any symbols for which it does. Differential Revision: https://reviews.llvm.org/D88369	2021-02-09 14:42:43 -08:00
Sam Clegg	34d033ca12	[lld][WebAssembly] Allow --export of optional start/stop symbols This moves the error checking until after all optional symbols (including the section start/end symbols) have been created. Differential Revision: https://reviews.llvm.org/D96318	2021-02-09 13:14:52 -08:00
Nico Weber	e0b8604e5d	[lld/mac] Implement -u flag Since we emit diagnostics for undefineds in Writer::scanRelocations() and symbols referenced by -u flags aren't referenced by any relocations, this needs some manual code (similar to the entry point). Differential Revision: https://reviews.llvm.org/D94371	2021-02-09 08:23:06 -05:00
Fangrui Song	d82679d805	[ELF] Drop Android specific workaround -m aarch64_elf64_le_vec `extern const bfd_target aarch64_elf64_le_vec;` is a variable in BFD. It was somehow misused as an emulation by Android. ``` % aarch64-linux-gnu-ld -m aarch64_elf64_le_vec a.o aarch64-linux-gnu-ld: unrecognised emulation mode: aarch64_elf64_le_vec Supported emulations: aarch64linux aarch64elf aarch64elf32 aarch64elf32b aarch64elfb armelf armelfb aarch64linuxb aarch64linux32 aarch64linux32b armelfb_linux_eabi armelf_linux_eabi ``` Acked by Stephen Hines, who removed the flag from Android a while back.	2021-02-09 00:43:10 -08:00
Hongtao Yu	5b8db127a3	[ELF] Rewriting the path of sample profile file for --reproduce response.txt Rewritting the path of the sample profile file in response.txt to be relative to the repro tar. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D96193	2021-02-09 00:00:16 -08:00
Greg McGary	87104faac4	[lld-macho] Add ARM64 target arch This is an initial base commit for ARM64 target arch support. I don't represent that it complete or bug-free, but wish to put it out for review now that some basic things like branch target & load/store address relocs are working. I can add more tests to this base commit, or add them in follow-up commits. It is not entirely clear whether I use the "ARM64" (Apple) or "AArch64" (non-Apple) naming convention. Guidance is appreciated. Differential Revision: https://reviews.llvm.org/D88629	2021-02-08 18:14:07 -07:00
Fangrui Song	830ead58fe	[test] Fix unused check prefixes	2021-02-08 13:44:30 -08:00
Jez Ng	5112035751	[lld-macho] Emit LSDA info in compact unwind The LSDA pointers are encoded as offsets from the image base, and arranged in one big contiguous array. Each second-level page records the offset within that LSDA array which corresponds to the LSDA for its first CU entry. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D95810	2021-02-08 13:48:00 -05:00
Jez Ng	525bfa10ec	[lld-macho] Emit personalities in compact unwind Note that there is a triple indirection involved with personalities and compact unwind: 1. Two bits of each CU encoding are used as an offset into the personality array. 2. Each entry of the personality array is an offset from the image base. The resulting address (after adding the image base) should point within the GOT. 3. The corresponding GOT entry contains the actual pointer to the personality function. To further complicate things, when the personality function is in the object file (as opposed to a dylib), its references in `__compact_unwind` may refer to it via a section + offset relocation instead of a symbol relocation. Since our GOT implementation can only create entries for symbols, we have to create a synthetic symbol at the given section offset. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D95809	2021-02-08 13:47:59 -05:00
Fangrui Song	eea34aae2e	[ELF] Inspect -EL & -EB for OUTPUT_FORMAT(default, big, little) Choose big if -EB is specified, little if -EL is specified, or default if neither is specified. The new behavior matches GNU ld. Fixes: https://github.com/ClangBuiltLinux/linux/issues/1025 Differential Revision: https://reviews.llvm.org/D96214	2021-02-08 10:34:57 -08:00
Fangrui Song	7605a9a009	[ELF] Support aarch64_be This patch adds * Big-endian values for `R_AARCH64_{ABS,PREL}{16,32,64}` and `R_AARCH64_PLT32` * aarch64elfb & aarch64linuxb BFD emulations * elf64-bigaarch64 output format (bfdname) Link: https://github.com/ClangBuiltLinux/linux/issues/1288 Differential Revision: https://reviews.llvm.org/D96188	2021-02-08 08:55:29 -08:00
Fangrui Song	980d28d955	ELFObjectWriter: Don't sort local symbols GNU as does not sort local symbols. This has several advantages: * The .symtab order is roughly the symbol occurrence order. * The closest preceding STT_SECTION symbol is the definition of a local symbol. * The closest preceding STT_FILE symbol is the defining file of a local symbol, if there are multiple default-version .file directives. (Not implemented in MC.)	2021-02-07 15:47:10 -08:00
Fangrui Song	4b5dbc7a3b	[ELF][test] Improve aarch64 tests	2021-02-06 11:50:57 -08:00
Fangrui Song	6a1235211d	[ELF] --gc-sections: collect unused SHF_LINK_ORDER .gcc_except_table A SHF_LINK_ORDER .gcc_except_table is similar to a .gcc_except_table in a section group. The associated text section is responsible for retaining it. LLD still does not support GC of non-group non-SHF_LINK_ORDER .gcc_except_table - but that is not necessary because we can teach the compiler to set SHF_LINK_ORDER.	2021-02-05 21:35:27 -08:00
Greg McGary	c3e4f3b231	[lld-macho] Fix alignment & layout to match ld64 and satisfy kernel & codesign The Mach kernel & codesign on arm64 macOS has strict requirements for alignment and sequence of segments and sections. Dyld probably is just as picky, though kernel & codesign reject malformed Mach-O files before dyld ever has a chance. I developed this diff by incrementally changing alignments & sequences to match the output of ld64. I stopped when my hello-world test program started working: `codesign --verify` succeded, and `execve(2)` didn't immediately fail with `errno == EBADMACHO` = `"Malformed Mach-O file"`. Differential Revision: https://reviews.llvm.org/D94935	2021-02-05 17:22:03 -07:00
Fangrui Song	5f4d7b2f0a	[ELF] Improve --icf=safe diagnostic The current diagnostic has confused users. The new wording is adapted from one suggested by Ian Lance Taylor. Differential Revision: https://reviews.llvm.org/D95917	2021-02-05 09:37:37 -08:00
Fangrui Song	ed399d508f	[ELF] Make SHF_GNU_RETAIN sections GC roots binutils 2.36 introduced the new section flag SHF_GNU_RETAIN (for ELFOSABI_GNU & ELFOSABI_FREEBSD) to mark a sections as a GC root. Several LLVM side toolchain folks (including me) were involved in the design process of SHF_GNU_RETAIN and were happy with this proposal. Currently GNU ld only respects SHF_GNU_RETAIN semantics for ELFOSABI_GNU & ELFOSABI_FREEBSD object files (https://sourceware.org/bugzilla/show_bug.cgi?id=27282). GNU ld sets EI_OSABI to ELFOSABI_GNU for relocatable output (https://sourceware.org/bugzilla/show_bug.cgi?id=27091). In practice the single value EI_OSABI is neither a good indicator for object file compatibility, nor a useful mechanism marking used ELF extensions. For input, we respect SHF_GNU_RETAIN semantics even for ELFOSABI_NONE object files. This is compatible with how LLD and GNU ld handle (mildly useful) STT_GNU_IFUNC / (emitted by GCC, considered misfeature by some folks) STB_GNU_UNIQUE input. (As of LLVM 12.0.0, the integrated assembler does not set ELFOSABI_GNU for STT_GNU_IFUNC/STB_GNU_UNIQUE). Arguably STT_GNU_IFUNC/STB_GNU_UNIQUE probably need indicators in object files but SHF_GNU_RETAIN is more likely accepted by more OSABI platforms. For output, we take a step further than GNU ld: we don't promote ELFOSABI_NONE to ELFOSABI_GNU for all output. Differential Revision: https://reviews.llvm.org/D95749	2021-02-04 09:23:01 -08:00
Fangrui Song	b3165a70ae	[ELF] Allow R_386_GOTOFF from .debug_info In GCC emitted .debug_info sections, R_386_GOTOFF may be used to relocate DW_AT_GNU_call_site_value values (https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98946). R_386_GOTOFF (`S + A - GOT`) is one of the `isStaticLinkTimeConstant` relocation type which is not PC-relative, so it can be used from non-SHF_ALLOC sections. We current allow new relocation types as needs come. The diagnostic has caught some bugs in the past. Differential Revision: https://reviews.llvm.org/D95994	2021-02-04 09:17:47 -08:00
Jez Ng	f843bb82c0	[lld-macho] Force-loading should share code path with regular archive loads This extends {D92539} to work even when we are loading archive members via `-force_load`. I uncovered this issue while trying to force-load archives containing bitcode -- we were segfaulting. In addition to fixing the `-force_load` case, this diff also addresses the behavior of `-ObjC` when LTO bitcode is involved -- we need to force-load those archive members if they contain ObjC categories. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D95265	2021-02-03 13:43:47 -05:00
Jez Ng	163dcd8513	[lld-macho] Associate each Symbol with an InputFile This makes our error messages more informative. But the bigger motivation is for LTO symbol resolution, which will be in an upcoming diff. The changes in this one are largely mechanical. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D94316	2021-02-03 13:43:47 -05:00
Abhina Sreeskantharajan	e59d336e75	[test] Use host platform specific error message substitution in lit tests - continued On z/OS, other error messages are not matched correctly in lit tests. ``` EDC5121I Invalid argument. EDC5111I Permission denied. ``` This patch adds a lit substitution to fix it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D95808	2021-02-03 09:53:22 -05:00
Greg McGary	3a9d2f1488	[lld-macho][NFC] refactor relocation handling Add per-reloc-type attribute bits and migrate code from per-target file into target independent code, driven by reloc attributes. Many cleanups Differential Revision: https://reviews.llvm.org/D95121	2021-02-02 10:54:53 -07:00
Fangrui Song	57bfa2ddb6	[ELF] Delete unused --warn-ifunc-textrel The option catches incompatibility between `R_*_IRELATIVE` and DT_TEXTREL/DF_TEXTREL before glibc 2.29. Newer glibc versions are more common nowadays and I don't think this option has ever been used. Diagnosing this problem is also straightforward by reading the stack trace.	2021-02-02 09:47:06 -08:00
Fangrui Song	3f8dda50cb	[test] Fix unuses FileCheck prefixes in lld	2021-02-01 20:52:33 -08:00
Wouter van Oortmerssen	0d9b17d0ef	[WebAssembly] fixed wasm64 data segment init exp not 64-bit As defined in the spec: https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Overview.md Differential Revision: https://reviews.llvm.org/D95651	2021-02-01 11:32:50 -08:00
Fangrui Song	3949a3ed49	[ELF][test] Add --emit-relocs --gc-sections test for relocation section for a non-SHF_ALLOC section This has been fixed by D89841.	2021-01-30 00:51:36 -08:00
Abhina Sreeskantharajan	42a21778f6	[test] Use host platform specific error message substitution in lit tests On z/OS, the following error message is not matched correctly in lit tests. ``` EDC5129I No such file or directory. ``` This patch uses a lit config substitution to check for platform specific error messages. Reviewed By: muiez, jhenderson Differential Revision: https://reviews.llvm.org/D95246	2021-01-29 07:16:30 -05:00
Sam Clegg	5c0164890c	[lld][WebAssembly] Update comments mentioning legacy function names. NFC The __wasm_apply_relocs function was split into two separate functions as part of https://reviews.llvm.org/D93066 but these comments were not correctly updated at the time. Differential Revision: https://reviews.llvm.org/D95623	2021-01-28 08:52:14 -08:00
Teresa Johnson	1487747e99	[LTO] Prevent devirtualization for symbols dynamically exported Identify dynamically exported symbols (--export-dynamic[-symbol=], --dynamic-list=, or definitions needed to preempt shared objects) and prevent their LTO visibility from being upgraded. This helps avoid use of whole program devirtualization when there may be overrides in dynamic libraries. Differential Revision: https://reviews.llvm.org/D91583	2021-01-27 15:54:13 -08:00
Adhemerval Zanella	988cc0a083	[LLD][ELF][AArch64] Add support for R_AARCH64_LD64_GOTPAGE_LO15 relocation It is not used by LLVM, but GCC might generates it when compiling with -fpie, as indicated by PR#40357 [1]. [1] https://bugs.llvm.org/show_bug.cgi?id=40357	2021-01-26 12:01:38 +00:00
Georgii Rymar	e98d5c3192	[libObject,llvm-readelf/obj] - Don't use @@ when printing versions of undefined symbols. A default version (@@) is only available for defined symbols. Currently we use "@@" for undefined symbols too. This patch fixes the issue and improves our test case. Differential revision: https://reviews.llvm.org/D95219	2021-01-26 12:05:59 +03:00
Sam Clegg	84c6f32584	[Object][WebAssembly] Update format of error messages Error message should start with lowercase in accordance with https://llvm.org/docs/CodingStandards.html#error-and-warning-messages Differential Revision: https://reviews.llvm.org/D95239	2021-01-25 21:12:53 -08:00
Abhina Sreeskantharajan	978444d531	Revert "[SystemZ][z/OS] Fix No such file or directory expression error" This reverts commit `06f8a49693`.	2021-01-25 08:29:38 -05:00
James Henderson	b37a349ff2	[lld][ELF][test] Add testing for IE/LD TLS weak undef references Whilst migrating/retiring some downstream testing, I came across a test for weak undef IE and LD TLS references, but was unable to find any equivalent in LLD's upstream testing. There does seem to be some slight subtle differences that could be worth testing compared to LE TLS references, in particular that IE can be relaxed to LE in this case, hence this change. Differential Revision: https://reviews.llvm.org/D95124 Reviewed by: grimar, MaskRay	2021-01-25 09:58:56 +00:00
Fangrui Song	6fe193bf27	[test] Add -mtriple	2021-01-22 19:07:02 -08:00
Fangrui Song	eda973bbc7	[ELF][test] Add a test about --exclude-libs applying to version symbols D94280 also fixed PR48702.	2021-01-22 18:46:56 -08:00
Jez Ng	041f3ee664	[lld-macho] Ignore -lto_library Just getting rid of some logspew as I test LLD under existing build systems. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D95213	2021-01-22 16:48:50 -05:00
Hongtao Yu	8aa3ee241d	[CSSPGO] LTO option for pseudo probe Adding a lld option to support emitting pseudo probe metadata in LTO mode. Reviewed By: MaskRay, wmi, wenlei Differential Revision: https://reviews.llvm.org/D95056	2021-01-22 11:07:10 -08:00
Fangrui Song	d24b94f070	[ELF] --wrap: retain __wrap_foo if foo is defined in an object/bitcode file If foo is referenced in any object file, bitcode file or shared object, `__wrap_foo` should be retained as the redirection target of sym (`f96ff3c0f8`). If the object file defining foo has foo references, we cannot easily distinguish the case from cases where foo is not referenced (we haven't scanned relocations). Retain `__wrap_foo` because we choose to wrap sym references regardless of whether sym is defined to keep non-LTO/LTO/relocatable links' behaviors similar https://sourceware.org/bugzilla/show_bug.cgi?id=26358 . If foo is defined in a shared object, `__wrap_foo` can still be omitted (`wrap-dynamic-undef.s`). Reviewed By: andrewng Differential Revision: https://reviews.llvm.org/D95152	2021-01-22 09:20:29 -08:00
Bob Haarman	8e0b179315	[ELF] report section sizes when output file too large Fixes PR48523. When the linker errors with "output file too large", one question that comes to mind is how the section sizes differ from what they were previously. Unfortunately, this information is lost when the linker exits without writing the output file. This change makes it so that the error message includes the sizes of the largest sections. Reviewed By: MaskRay, grimar, jhenderson Differential Revision: https://reviews.llvm.org/D94560	2021-01-21 19:47:03 +00:00
Georgii Rymar	51f4958057	[yaml2obj/obj2yaml] - Improve dumping/creating of ELF versioning sections. This makes the following improvements. For `SHT_GNU_versym`: * yaml2obj: set `sh_link` to index of `.dynsym` section automatically. For `SHT_GNU_verdef`: * yaml2obj: set `sh_link` to index of `.dynstr` section automatically. * yaml2obj: set `sh_info` field automatically. * obj2yaml: don't dump the `Info` field when its value matches the number of version definitions. For `SHT_GNU_verneed`: * yaml2obj: set `sh_link` to index of `.dynstr` section automatically. * yaml2obj: set `sh_info` field automatically. * obj2yaml: don't dump the `Info` field when its value matches the number of version dependencies. Also, simplifies few test cases. Differential revision: https://reviews.llvm.org/D94956	2021-01-21 10:36:48 +03:00
Reid Kleckner	9e708ac6b9	[COFF] Fix relocation offsets in pdb-file-statics test input The relocation offsets were incorrect. I fixed them with llvm-readobj -codeview -codeview-subsection-bytes, which has a helpful printout of the relocations that apply to a given symbol record with their offsets. With this, I was able to update the relocation offsets in the yaml to fix the line table and the S_DEFRANGE_REGISTER records. There is still some remaining inconsistency in yaml2obj and obj2yaml when round tripping MSVC objects, but that isn't a blocker for relanding D94267.	2021-01-20 11:45:30 -08:00
Jez Ng	697f4e429b	[lld-macho] Run ObjCContractPass during LTO Run the ObjCARCContractPass during LTO. The legacy LTO backend (under LTO/ThinLTOCodeGenerator.cpp) already does this; this diff just adds that behavior to the new LTO backend. Without that pass, the objc.clang.arc.use intrinsic will get passed to the instruction selector, which doesn't know how to handle it. In order to test both the new and old pass managers, I've also added support for the `--[no-]lto-legacy-pass-manager` flags. P.S. Not sure if the ordering of the pass within the pipeline matters... Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D94547	2021-01-20 14:21:32 -05:00
Jeremy Morse	eff6e75c3f	[LLD][ELF] Correct test temporary file paths In `8031785f4a` the temporary object being built was moved to %t/main.o, but not all run lines were updated to reflect this. Observe the failure on this buildbot: http://lab.llvm.org:8011/#/builders/5/builds/3646/steps/9/logs/stdio It might pass locally for some people due to a stale %t.o hanging around the build directory.	2021-01-20 11:46:11 +00:00
Fangrui Song	f96ff3c0f8	[ELF] --wrap: Produce a dynamic symbol for undefined __wrap_ ``` // a.s jmp fcntl // b.s .globl fcntl fcntl: ret ``` `ld.lld -shared --wrap=fcntl a.o b.o` has an `R_X86_64_JUMP_SLOT` referencing the index 0 undefined symbol, which will cause a glibc `symbol lookup error` at runtime. This is because `__wrap_fcntl` is not in .dynsym We use an approximation `!wrap->isUndefined()`, which doesn't set `isUsedInRegularObj` of `__wrap_fcntl` when `fcntl` is referenced and `__wrap_fcntl` is undefined. Fix this by using `sym->referenced`.	2021-01-19 21:23:57 -08:00
Fangrui Song	8031785f4a	[ELF][test] Improve --wrap tests	2021-01-19 21:21:19 -08:00
Sam Clegg	96ef4f307d	Revert "[WebAssembly] call_indirect issues table number relocs" This reverts commit `418df4a6ab`. This change broke emscripten tests, I believe because it started generating 5-byte a wide table index in the call_indirect instruction. Neither v8 nor wabt seem to be able to handle that. The spec currently says that this is single 0x0 byte and: "In future versions of WebAssembly, the zero byte occurring in the encoding of the call_indirectcall_indirect instruction may be used to index additional tables." So we need to revisit this change. For backwards compat I guess we need to guarantee that __indirect_function_table is always at address zero. We could also consider making this a single-byte relocation with and assert if have more than 127 tables (for now). Differential Revision: https://reviews.llvm.org/D95005	2021-01-19 15:06:07 -08:00
Sam Clegg	b99147b4fa	[lld][WebAssembly] Don't defined indirect function table in relocatable output Object files (and the output --relocatable) should never define __indirect_function_table. It should always be linker synthesized with the final output executable. Differential Revision: https://reviews.llvm.org/D94993	2021-01-19 12:59:20 -08:00
Fangrui Song	5fcb412ed0	[ELF] Support R_PPC64_ADDR16_HIGH R_PPC64_ADDR16_HI represents bits 16-31 of a 32-bit value R_PPC64_ADDR16_HIGH represents bits 16-31 of a 64-bit value. In the Linux kernel, `LOAD_REG_IMMEDIATE_SYM` defined in `arch/powerpc/include/asm/ppc_asm.h` uses @l, @high, @higher, @highest to load the 64-bit value of a symbol. Fixes https://github.com/ClangBuiltLinux/linux/issues/1260	2021-01-19 11:42:53 -08:00
Fangrui Song	e12e0d66c0	[ELF] Error for out-of-range R_PPC64_ADDR16_HA, R_PPC64_ADDR16_HI and their friends There are no tests for REL16_* and TPREL16_*.	2021-01-19 11:42:52 -08:00
Fangrui Song	d39adeaf44	[ELF] Improve R_PPC64_ADDR* relocation tests	2021-01-19 11:42:51 -08:00
Arthur Eubanks	a231786909	[wasm][LLD] Rename --lto-new-pass-manager to --no-lto-legacy-pass-manager This follows a similar ELF change. Reviewed By: MaskRay, sbc100 Differential Revision: https://reviews.llvm.org/D93253	2021-01-19 11:22:40 -08:00
Andy Wingo	1a9b6e4a32	[WebAssembly][lld] Fix call-indirect.s test to validate Add missing address operand, so that we can validate the output files. Depends on D92315. Differential Revision: https://reviews.llvm.org/D92320	2021-01-19 16:12:38 +01:00
Hans Wennborg	ec877106a3	[ThinLTO] Also prune Thin-* files from the ThinLTO cache Such files (Thin-%%%%%%.tmp.o) are supposed to be deleted immediately after they're used (either by renaming or deletion). However, we've seen instances on Windows where this doesn't happen, probably due to the filesystem being flaky. This is effectively a resource leak which has prevented us from using the ThinLTO cache on Windows. Since those temporary files are in the thinlto cache directory which we prune periodically anyway, allowing them to be pruned too seems like a tidy way to solve the problem. Differential revision: https://reviews.llvm.org/D94962	2021-01-19 14:43:49 +01:00
Andy Wingo	418df4a6ab	[WebAssembly] call_indirect issues table number relocs This patch changes to make call_indirect explicitly refer to the corresponding function table, residualizing TABLE_NUMBER relocs against it. With this change, wasm-ld now sees all references to tables, and can link multiple tables. Differential Revision: https://reviews.llvm.org/D90948	2021-01-19 09:32:45 +01:00
Adhemerval Zanella	2f92386e72	[LLD][ELF][AArch64] Set _GLOBAL_OFFSET_TABLE_ at the start of .got The commit `18aa0be36e` changed the default GotBaseSymInGotPlt to true for AArch64. This is different than binutils, where _GLOBAL_OFFSET_TABLE_ points at the start or .got. It seems to not intefere with current relocations used by LLVM. However as indicated by PR#40357 [1] gcc generates R_AARCH64_LD64_GOTPAGE_LO15 for -pie (in fact it also generated the relocation for -fpic). This change is requires to correctly handle R_AARCH64_LD64_GOTPAGE_LO15 by lld from objects generated by gcc. [1] https://bugs.llvm.org/show_bug.cgi?id=40357	2021-01-18 14:51:14 -03:00
Andy Wingo	6339382807	[WebAssembly] Add support for table linking to wasm-ld This patch adds support to wasm-ld for linking multiple table references together, in a manner similar to wasm globals. The indirect function table is synthesized as needed. To manage the transitional period in which the compiler doesn't yet produce TABLE_NUMBER relocations and doesn't residualize table symbols, the linker will detect object files which have table imports or definitions, but no table symbols. In that case it will synthesize symbols for the defined and imported tables. As a change, relocatable objects are now written with table symbols, which can cause symbol renumbering in some of the tests. If no object file requires an indirect function table, none will be written to the file. Note that for legacy ObjFile inputs, this test is conservative: as we don't have relocs for each use of the indirecy function table, we just assume that any incoming indirect function table should be propagated to the output. Differential Revision: https://reviews.llvm.org/D91870	2021-01-18 16:57:18 +01:00
Abhina Sreeskantharajan	689aaba7ac	[SystemZ][z/OS] Fix No such file or directory expression error matching in lit tests On z/OS, the following error message is not matched correctly in lit tests. This patch updates the CHECK expression to match successfully. ``` EDC5129I No such file or directory. ``` Reviewed By: muiez Differential Revision: https://reviews.llvm.org/D94239	2021-01-18 07:14:37 -05:00
Fangrui Song	3809f4ebab	[ELF] Support R_PPC_ADDR24 (ba foo; bla foo)	2021-01-17 00:02:13 -08:00
Derek Schuff	187d771d27	Revert "[WebAssembly] Add support for table linking to wasm-ld" This reverts commit `38dfce706f`. CI discovered a bug where the table is exported twice: see D91870	2021-01-15 15:50:41 -08:00
Andy Wingo	38dfce706f	[WebAssembly] Add support for table linking to wasm-ld This patch adds support to wasm-ld for linking multiple table references together, in a manner similar to wasm globals. The indirect function table is synthesized as needed. To manage the transitional period in which the compiler doesn't yet produce TABLE_NUMBER relocations and doesn't residualize table symbols, the linker will detect object files which have table imports or definitions, but no table symbols. In that case it will synthesize symbols for the defined and imported tables. As a change, relocatable objects are now written with table symbols, which can cause symbol renumbering in some of the tests. If no object file requires an indirect function table, none will be written to the file. Note that for legacy ObjFile inputs, this test is conservative: as we don't have relocs for each use of the indirecy function table, we just assume that any incoming indirect function table should be propagated to the output. Differential Revision: https://reviews.llvm.org/D91870	2021-01-15 09:21:52 +01:00
Teresa Johnson	5b42fd8dd4	[LTO] Test format fix (NFC) As requested in D91583, use ';;' instead of ';' to preceed comments in lld test. I did this in the equivalent gold test as well.	2021-01-14 14:09:50 -08:00
Sam Clegg	07b6aeb568	[lld][WebAssembly] Fix for TLS + --relocatable When running in `-r/--relocatable` we output relocations but the new TLS relocations type was missing from `ObjFile::calcNewAddend` causing this combination of inputs/flags to crash the linker. Also avoid creating tls variables in relocatable mode. These variables are only needed when linking final executables. Fixes: https://github.com/emscripten-core/emscripten/issues/12934 Fixes: PR48506 Differential Revision: https://reviews.llvm.org/D93554	2021-01-13 07:49:39 -08:00
Fangrui Song	93ad0edf67	[ELF] Drop .rel[a].debug_gnu_pub{names,types} for --gdb-index --emit-relocs Fixes PR48693: --emit-relocs keeps relocation sections. --gdb-index drops .debug_gnu_pubnames and .debug_gnu_pubtypes but not their relocation sections. This can cause a null pointer dereference in `getOutputSectionName`. Also delete debug-gnu-pubnames.s which is covered by gdb-index.s Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D94354	2021-01-12 00:07:28 -08:00
Fangrui Song	ac2224c022	[ELF] --exclude-libs: localize defined libcall symbols referenced by lto.tmp Fixes PR48681: after LTO, lto.tmp may reference a libcall symbol not in an IR symbol table of any bitcode file. If such a symbol is defined in an archive matched by a --exclude-libs, we don't correctly localize the symbol. Add another `excludeLibs` after `compileBitcodeFiles` to localize such libcall symbols. Unfortunately we have keep the existing one for D43126. Using VER_NDX_LOCAL is an implementation detail of `--exclude-libs`, it does not necessarily tie to the "localize" behavior. `local:` patterns in a version script can be omitted. The `symbol ... has undefined version ...` error should not be exempted. Ideally we should error as GNU ld does. https://issuetracker.google.com/issues/73020933 Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D94280	2021-01-11 09:33:22 -08:00
Paul Robinson	c161775dec	[FastISel] Flush local value map on every instruction Local values are constants or addresses that can't be folded into the instruction that uses them. FastISel materializes these in a "local value" area that always dominates the current insertion point, to try to avoid materializing these values more than once (per block). https://reviews.llvm.org/D43093 added code to sink these local value instructions to their first use, which has two beneficial effects. One, it is likely to avoid some unnecessary spills and reloads; two, it allows us to attach the debug location of the user to the local value instruction. The latter effect can improve the debugging experience for debuggers with a "set next statement" feature, such as the Visual Studio debugger and PS4 debugger, because instructions to set up constants for a given statement will be associated with the appropriate source line. There are also some constants (primarily addresses) that could be produced by no-op casts or GEP instructions; the main difference from "local value" instructions is that these are values from separate IR instructions, and therefore could have multiple users across multiple basic blocks. D43093 avoided sinking these, even though they were emitted to the same "local value" area as the other instructions. The patch comment for D43093 states: Local values may also be used by no-op casts, which adds the register to the RegFixups table. Without reversing the RegFixups map direction, we don't have enough information to sink these instructions. This patch undoes most of D43093, and instead flushes the local value map after() every IR instruction, using that instruction's debug location. This avoids sometimes incorrect locations used previously, and emits instructions in a more natural order. In addition, constants materialized due to PHI instructions are not assigned a debug location immediately; instead, when the local value map is flushed, if the first local value instruction has no debug location, it is given the same location as the first non-local-value-map instruction. This prevents PHIs from introducing unattributed instructions, which would either be implicitly attributed to the location for the preceding IR instruction, or given line 0 if they are at the beginning of a machine basic block. Neither of those consequences is good for debugging. This does mean materialized values are not re-used across IR instruction boundaries; however, only about 5% of those values were reused in an experimental self-build of clang. () Actually, just prior to the next instruction. It seems like it would be cleaner the other way, but I was having trouble getting that to work. This reapplies commits `cf1c774d` and `dc35368c`, and adds the modification to PHI handling, which should avoid problems with debugging under gdb. Differential Revision: https://reviews.llvm.org/D91734	2021-01-11 08:32:36 -08:00
Jez Ng	daaaed6bb8	[lld-macho] Fix TLV data initialization We were mishandling the case where both `__tbss` and `__thread_data` sections were present. TLVP relocations should be encoded as offsets from the start of `__thread_data`, even if the symbol is actually located in `__thread_bss`. Previously, we were writing the offset from the start of the containing section, which doesn't really make sense since there's no way `tlv_get_addr()` can know which section a given `tlv$init` symbol is in at runtime. In addition, this patch ensures that we place `__thread_data` immediately before `__thread_bss`. This is what ld64 does, likely for performance reasons. Zerofill sections must also be at the end of their segments; we were already doing this, but now we ensure that `__thread_bss` occurs before `__bss`, so that it's always possible to have it contiguous with `__thread_data`. Fixes llvm.org/PR48657. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D94329	2021-01-08 18:48:12 -05:00
Alexandre Ganea	6acfc3a782	Fix build after `eaadb41db6` when the MSVC libs are not in PATH.	2021-01-07 18:52:00 -05:00
Alexandre Ganea	eaadb41db6	[LLD][COFF] When using PCH.OBJ, ensure func_id records indices are remapped under /DEBUG:GHASH Before this patch, when using LLD with /DEBUG:GHASH and MSVC precomp.OBJ files, we had a bunch of: lld-link: warning: S_[GL]PROC32ID record in blabla.obj refers to PDB item index 0x206ED1 which is not a LF[M]FUNC_ID record This was caused by LF_FUNC_ID and LF_MFUNC_ID which didn't have correct mapping to the corresponding TPI records. The root issue was that the indexMapStorage was improperly re-assembled in UsePrecompSource::remapTpiWithGHashes. After this patch, /DEBUG and /DEBUG:GHASH produce exactly the same debug infos in the PDB. Differential Revision: https://reviews.llvm.org/D93732	2021-01-07 17:27:13 -05:00
Wouter van Oortmerssen	5c38ae36c5	[WebAssembly] Fixed byval args missing DWARF DW_AT_LOCATION A struct in C passed by value did not get debug information. Such values are currently lowered to a Wasm local even in -O0 (not to an alloca like on other archs), which becomes a Target Index operand (TI_LOCAL). The DWARF writing code was not emitting locations in for TI's specifically if the location is a single range (not a list). In addition, the ExplicitLocals pass which removes the ARGUMENT pseudo instructions did not update the associated DBG_VALUEs, and couldn't even find these values since the code assumed such instructions are adjacent, which is not the case here. Also fixed asm printing of TIs needed by a test. Differential Revision: https://reviews.llvm.org/D94140	2021-01-07 10:31:38 -08:00
Martin Storsjö	248e3450fb	[LLD] [MinGW] Pass the --demangle and --no-demangle options to the COFF linker Differential Revision: https://reviews.llvm.org/D93950	2021-01-07 10:02:19 +02:00
Peter Collingbourne	aed84542d5	ELF: Teach the linker about the 'B' augmentation string character. This character indicates that when return pointer authentication is being used, the function signs the return address using the B key. Differential Revision: https://reviews.llvm.org/D93954	2021-01-05 19:51:11 -08:00
Brandon Bergren	275eb8289c	[PowerPC] Support powerpcle target in LLD [4/5] Add support for linking powerpcle code in LLD. Rewrite lld/test/ELF/emulation-ppc.s to use a shared check block and add powerpcle tests. Update tests. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D93917	2021-01-02 12:18:05 -06:00
Fangrui Song	b0d6bebe90	[ELF] Drop '>>> defined in ' for locations of linker synthesized symbols Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D93925	2020-12-30 09:16:26 -08:00
Jordan Rupprecht	be89d7460b	[lld][test] Relax test assertion of cmake defaults appearing in --version output. D93491 implemented `--version` for the MachO LLD port, but asserts that the string contains something like "LLD N.N". However, this is just the cmake default for `LLD_VERSION_STRING`, and downstream users may choose a different value, e.g. a rolling distro may print "LLD trunk".	2020-12-23 14:33:04 -08:00
Sriraman Tallam	34e70d722d	Append ".__part." to every basic block section symbol. Every basic block section symbol created by -fbasic-block-sections will contain ".__part." to know that this symbol corresponds to a basic block fragment of the function. This patch solves two problems: a) Like D89617, we want function symbols with suffixes to be properly qualified so that external tools like profile aggregators know exactly what this symbol corresponds to. b) The current basic block naming just adds a ".N" to the symbol name where N is some integer. This collides with how clang creates __cxx_global_var_init.N. clang creates these symbol names to call constructor functions and basic block symbol naming should not use the same style. Fixed all the test cases and added an extra test for __cxx_global_var_init breakage. Differential Revision: https://reviews.llvm.org/D93082	2020-12-23 11:35:44 -08:00
Jez Ng	9d1140e18e	[lld-macho] Simulator & DriverKit executables should always be PIE We didn't have support for parsing DriverKit in our `-platform` flag, so add that too. Also remove a bunch of unnecessary namespace prefixes. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93741	2020-12-23 11:24:12 -05:00
Nico Weber	77fb45e59e	[lld/mac] Add --version flag It's an extension to ld64, but all the other ports have it, and someone asked for it in PR43721. While here, change the COFF help text to match the other ports. Differential Revision: https://reviews.llvm.org/D93491	2020-12-22 22:06:39 -05:00
Nico Weber	57ffbe020a	glld/mac] Don't add names of unreferenced symbols to string table Before this, a hello world program would contain many many unnecessary entries in its string table. No behavior change, just makes the string table in the output smaller and more like ld64's. Differential Revision: https://reviews.llvm.org/D93711	2020-12-22 15:52:33 -05:00
Nico Weber	13f439a187	[lld/mac] Implement support for private extern symbols Private extern symbols are used for things scoped to the linkage unit. They cause duplicate symbol errors (so they're in the symbol table, unlike TU-scoped truly local symbols), but they don't make it into the export trie. They are created e.g. by compiling with -fvisibility=hidden. If two weak symbols have differing privateness, the combined symbol is non-private external. (Example: inline functions and some TUs that include the header defining it were built with -fvisibility-inlines-hidden and some weren't). A weak private external symbol implicitly has its "weak" dropped and behaves like a regular strong private external symbol: Weak is an export trie concept, and private symbols are not in the export trie. If a weak and a strong symbol have different privateness, the strong symbol wins. If two common symbols have differing privateness, the larger symbol wins. If they have the same size, the privateness of the symbol seen later during the link wins (!) -- this is a bit lame, but it matches ld64 and this behavior takes 2 lines less to implement than the less surprising "result is non-private external), so match ld64. (Example: `int a` in two .c files, both built with -fcommon, one built with -fvisibility=hidden and one without.) This also makes `__dyld_private` a true TU-local symbol, matching ld64. To make this work, make the `const char*` StringRefZ ctor to correctly set `size` (without this, writing the string table crashed when calling getName() on the __dyld_private symbol). Mention in CommonSymbol's comment that common symbols are now disabled by default in clang. Mention in -keep_private_externs's HelpText that the flag only has an effect with `-r` (which we don't implement yet -- so this patch here doesn't regress any behavior around -r + -keep_private_externs)). ld64 doesn't explicitly document it, but the commit text of http://reviews.llvm.org/rL216146 does, and ld64's OutputFile::buildSymbolTable() checks `_options.outputKind() == Options::kObjectFile` before calling `_options.keepPrivateExterns()` (the only reference to that function). Fixes PR48536. Differential Revision: https://reviews.llvm.org/D93609	2020-12-21 21:23:33 -05:00
Jez Ng	0f8224c210	[lld-macho][nfc] Remove %T from headerpad.s The llvm-lit docs indicate that it is deprecated. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93605	2020-12-21 14:44:08 -05:00
Jez Ng	a817594de9	[lld-macho][nfc] Clean up tests * Migrate most of our tests to use `split-file` instead of `echo` * Remove individual `rm -f %t/libfoo.a` commands in favor of a top-level `rm -rf %t` * Remove unused `Inputs/libfunction.s` Reviewed By: #lld-macho, compnerd Differential Revision: https://reviews.llvm.org/D93604	2020-12-21 14:44:08 -05:00
Fangrui Song	fb3c1b3de5	[ELF] Reject local-exec TLS relocations for -shared For x86-64, D33100 added a diagnostic for local-exec TLS relocations referencing a preemptible symbol. This patch generalizes it to non-preemptible symbols (see `-Bsymbolic` in `tls.s`) on all targets. Local-exec TLS relocations resolve to offsets relative to a fixed point within the static TLS block, which are only meaningful for the executable. With this change, `clang -fpic -shared -fuse-ld=bfd a.c` on the following example will be flagged for AArch64/ARM/i386/x86-64/RISC-V ``` static __attribute__((tls_model("local-exec"))) __thread long TlsVar = 42; long bump() { return ++TlsVar; } ``` Note, in GNU ld, at least arm, riscv and x86's ports have the similar diagnostics, but aarch64 and ppc64 do not error. Differential Revision: https://reviews.llvm.org/D93331	2020-12-21 08:47:04 -08:00
Fangrui Song	e25afcfa51	[ELF][PPC64] Detect missing R_PPC64_TLSGD/R_PPC64_TLSLD and disable TLS relaxation Alternative to D91611. The TLS General Dynamic/Local Dynamic code sequences need to mark `__tls_get_addr` with R_PPC64_TLSGD or R_PPC64_TLSLD, e.g. ``` addis r3, r2, x@got@tlsgd@ha # R_PPC64_GOT_TLSGD16_HA addi r3, r3, x@got@tlsgd@l # R_PPC64_GOT_TLSGD16_LO bl __tls_get_addr(x@tlsgd) # R_PPC64_TLSGD followed by R_PPC64_REL24 nop ``` However, there are two deviations form the above: 1. direct call to `__tls_get_addr`. This is essential to implement ld.so in glibc/musl/FreeBSD. ``` bl __tls_get_addr nop ``` This is only used in a -shared link, and thus not subject to the GD/LD to IE/LE relaxation issue below. 2. Missing R_PPC64_TLSGD/R_PPC64_TLSGD for compiler generated TLS references According to Stefan Pintille, "In the early days of the transition from the ELFv1 ABI that is used for big endian PowerPC Linux distributions to the ELFv2 ABI that is used for little endian PowerPC Linux distributions, there was some ambiguity in the specification of the relocations for TLS. The GNU linker has implemented support for correct handling of calls to __tls_get_addr with a missing relocation. Unfortunately, we didn't notice that the IBM XL compiler did not handle TLS according to the updated ABI until we tried linking XL compiled libraries with LLD." In short, LLD needs to work around the old IBM XL compiler issue. Otherwise, if the object file is linked in -no-pie or -pie mode, the result will be incorrect because the 4 instructions are partially rewritten (the latter 2 are not changed). Work around the compiler bug by disable General Dynamic/Local Dynamic to Initial Exec/Local Exec relaxation. Note, we also disable Initial Exec to Local Exec relaxation for implementation simplicity, though technically it can be kept. ppc64-tls-missing-gdld.s demonstrates the updated behavior. Reviewed By: #powerpc, stefanp, grimar Differential Revision: https://reviews.llvm.org/D92959	2020-12-21 08:45:41 -08:00
Jez Ng	64e4757200	[lld-macho] Have order files support filtering by archive member paths Also remove iteration over ArchiveFile symbols in buildInputSectionPriorities -- that was rendered unnecessary after D92539, which included ObjFiles from ArchiveFiles inside the `inputFiles` vector. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D93569	2020-12-20 13:49:18 -05:00
Jez Ng	5f9896d3b2	[lld-macho] Support Obj-C symbols in order files Obj-C symbols may have spaces and colons, which our previous order file parser would be confused by. The order file format has made the very unfortunate choice of using colons for its delimiters, which means that we have to use heuristics to determine if a given colon is part of a symbol or not... Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93567	2020-12-20 13:49:18 -05:00
Jez Ng	e2863357de	[lld-macho][nfc] Use split-file in order file test Makes it much easier to edit / maintain. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D93566	2020-12-20 13:49:18 -05:00
Greg McGary	99930719c6	Handle overflow beyond the 127 common encodings limit The common encodings table holds only 127 entries. The encodings index for compact entries is 8 bits wide, and indexes 127..255 are stored locally to each second-level page. Prior to this diff, lld would `fatal()` if encodings overflowed the 127 limit. This diff populates a per-second-level-page encodings table as needed. When the per-page encodings table hits its limit, we must terminate the page. If such early termination would consume fewer entries than a regular (non-compact) encoding page, then we prefer the regular format. Caveat: one reason the common-encoding table might overflow is because of DWARF debug-info references, which are not yet implemented and will come with a later diff. Differential Revision: https://reviews.llvm.org/D93267	2020-12-19 14:54:37 -08:00
Harald van Dijk	adc55b5a5a	[X86] Avoid generating invalid R_X86_64_GOTPCRELX relocations We need to make sure not to emit R_X86_64_GOTPCRELX relocations for instructions that use a REX prefix. If a REX prefix is present, we need to instead use a R_X86_64_REX_GOTPCRELX relocation. The existing logic for CALL64m, JMP64m, etc. already handles this by checking the HasREX parameter and using it to determine which relocation type to use. Do this for all instructions that can use relaxed relocations. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D93561	2020-12-18 23:38:38 +00:00
Georgii Rymar	8590b5ccd5	[libObject, llvm-readobj] - Reimplement `ELFFile<ELFT>::getEntry`. Currently, `ELFFile<ELFT>::getEntry` does not check an index of an entry. Because of that the code might read past the end of the symbol table silently. I've added a test to `llvm-readobj\ELF\relocations.test` to demonstrate the possible issue. Also, I've added a unit test for this method. After this change, `getEntry` stops reporting the section index and reuses the `getSectionContentsAsArray` method, which already has all the validation needed. Our related warnings now provide more and better context sometimes. Differential revision: https://reviews.llvm.org/D93209	2020-12-18 16:52:27 +03:00
Greg McGary	cc1cf6332a	[lld-macho] Implement option: -undefined TREATMENT TREATMENT can be `error`, `warning`, `suppress`, or `dynamic_lookup` The `dymanic_lookup` remains unimplemented for now. Differential Revision: https://reviews.llvm.org/D93263	2020-12-17 17:40:50 -08:00
Amy Huang	7e13694ac7	[llvm-symbolizer][Windows] Add start line when searching in line table sections. Fixes issue where if a line section doesn't start with a line number then the addresses at the beginning of the section don't have line numbers. For example, for a line section like this ``` 0001:00000010-00000014, line/column/addr entries = 1 7 00000013 ! ``` a line number wouldn't be found for addresses from 10 to 12. This matches behavior when using the DIA SDK. Differential Revision: https://reviews.llvm.org/D93306	2020-12-17 07:57:36 -08:00
Adhemerval Zanella	978eb3b87b	[lld] [ELF] AArch64: Handle DT_AARCH64_VARIANT_PCS As indicated by AArch64 ELF specification, symbols with st_other marked with STO_AARCH64_VARIANT_PCS indicates it may follow a variant procedure call standard with different register usage convention (for instance SVE calls). Static linkers must preserve the marking and propagate it to the dynamic symbol table if any reference or definition of the symbol is marked with STO_AARCH64_VARIANT_PCS, and add a DT_AARCH64_VARIANT_PCS dynamic tag if there are R_<CLS>_JUMP_SLOT relocations that reference that symbols. It implements https://bugs.llvm.org/show_bug.cgi?id=48368. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D93045	2020-12-17 11:09:55 -03:00
Jez Ng	4c8276cdc1	[lld-macho] Use LC_LOAD_WEAK_DYLIB for dylibs with only weakrefs Note that dylibs without any refs will still be loaded in the usual (strong) fashion. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93435	2020-12-17 08:49:17 -05:00
Jez Ng	811444d7a1	[lld-macho] Add support for weak references Weak references need not necessarily be satisfied at runtime (but they must still be satisfied at link time). So symbol resolution still works as per usual, but we now pass around a flag -- ultimately emitting it in the bind table -- to indicate if a given dylib symbol is a weak reference. ld64's behavior for symbols that have both weak and strong references is a bit bizarre. For non-function symbols, it will emit a weak import. For function symbols (those referenced by BRANCH relocs), it will emit a regular import. I'm not sure what value there is in that behavior, and since emulating it will make our implementation more complex, I've decided to treat regular weakrefs like function symbol ones for now. Fixes PR48511. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93369	2020-12-17 08:49:16 -05:00
Fangrui Song	66bcbdbc9c	[AArch64InstPrinter] Change printADRPLabel to print the target address in hexadecimal form Similar to D77853. Change ADRP to print the target address in hex, instead of the raw immediate. The behavior is similar to GNU objdump but we also include `0x`. Note: GNU objdump is not consistent whether or not to emit `0x` for different architectures. We try emitting 0x consistently for all targets. ``` GNU objdump: adrp x16, 10000000 Old llvm-objdump: adrp x16, #0 New llvm-objdump: adrp x16, 0x10000000 ``` `adrp Xd, 0x...` assembles to a relocation referencing `ABS+0x10000` which is not intended. We need to use a linker or use yaml2obj. The main test is `test/tools/llvm-objdump/ELF/AArch64/pcrel-address.yaml` Differential Revision: https://reviews.llvm.org/D93241	2020-12-16 09:20:55 -08:00
Fangrui Song	16cb7910f5	[ELF] --emit-relocs: fix a crash if .rela.dyn is an empty output section Fix PR48357: If .rela.dyn appears as an output section description, its type may be SHT_RELA (due to the empty synthetic .rela.plt) while there is no input section. The empty .rela.dyn may be retained due to a reference in a linker script. Don't crash. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D93367	2020-12-16 08:59:38 -08:00
Nico Weber	abc08d5ec7	[mac/lld] fix amend mishap from `ec88746a05`	2020-12-15 19:41:00 -05:00
Nico Weber	ec88746a05	[lld/mac] fill in current and compatibility version for LC_LOAD_(WEAK_)DYLIB Not sure if anything actually depends on this, but it makes `otool -L` output look nicer. Differential Revision: https://reviews.llvm.org/D93332	2020-12-15 19:34:59 -05:00
Nico Weber	09edd9df6e	[mac/lld] simplify code using PackedVersion instead of VersionTuple PackedVersion already does the correct range checks. No behavior change. Differential Revision: https://reviews.llvm.org/D93338	2020-12-15 19:23:07 -05:00
Jez Ng	3aa8e071dd	[lld-macho] Add implicit dylib support for frameworks {D93000} applied to frameworks. Partial fix for PR48511. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93277	2020-12-15 15:58:26 -05:00
Jez Ng	8a5e068823	[lld-macho] Support -sub_umbrella From what I can tell, it's essentially identical to `-sub_library`, but it doesn't match files ending in ".dylib". Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93276	2020-12-15 15:58:26 -05:00
Jez Ng	3184519909	[lld-macho] Don't emit rebase opcodes for relocs in TLV sections Their addresses are already encoded as section-relative offsets, so there's no need to rebase them at runtime. {D85080} has some context on the weirdness of TLV sections. Fixes llvm.org/PR48491. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D93257	2020-12-15 15:58:26 -05:00
Jez Ng	544148ae70	[lld-macho] -weak_{library,framework} should always take priority We were not setting forceWeakImport for file paths given by `-weak_library` if we had already loaded the file. This diff fixes that by having `loadDylib` return a cached DylibFile instance even if we have already loaded that file. We still avoid emitting multiple LC_LOAD_DYLIBs, but we achieve this by making inputFiles a SetVector instead of relying on the `loadedDylibs` cache. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D93255	2020-12-15 15:58:26 -05:00
Nico Weber	601f0fb846	[lld/mac] Set ordinal on dynamic undefined symbols in symbol table This lets `nm -m` print "(from libfoo)" in its output, which is more accessible than dumping the bind table. See https://reviews.llvm.org/D57190#2455761 for the somewhat surprising `AltEntry` that appears in symtab.s. Differential Revision: https://reviews.llvm.org/D93318	2020-12-15 14:39:36 -05:00
Fangrui Song	c8da71b53f	[ELF] Error for out-of-range R_X86_64_[REX_]GOTPCRELX Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D93259	2020-12-15 09:20:07 -08:00
Nico Weber	d058b69b1c	[lld/mac] implement -compatibility_version, -current_version Differential Revision: https://reviews.llvm.org/D93237	2020-12-14 18:41:36 -05:00
LemonBoy	92c6141ce6	lld/ELF: Parse MSP430 BFD/emulation names Follow the naming set by TI's own GCC-based toolchain. Also, force the `osabi` field to `ELFOSABI_STANDALONE`, this matches GNU LD's output (the patching is done in `elf32_msp430_post_process_headers`). Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D92931	2020-12-14 09:38:12 -08:00
Georgii Rymar	98a4289810	[llvm-readobj] - For SHT_REL relocations, don't display an addend. This is https://bugs.llvm.org/show_bug.cgi?id=44257. In LLVM style we always print `0` as addend when dumping SHT_REL relocations. It is confusing, this patch stops printing it as the first comment on the bug page suggests. Differential revision: https://reviews.llvm.org/D93033	2020-12-14 12:03:00 +03:00
Fangrui Song	5d1c723b73	[ELF][test] Rewrite st_value=0 copy relocation tests The original tests have unneeded symbols and copy-relocation-zero-abs-addr.s does not actually test anything. Rewrite them and add copy-relocation-zero-addr.s instead. Add --soname=b so that the address 0x203400 will be stable. (When linking an executable with %t.so, the path %t.so will be recorded in the DT_NEEDED entry if %t.so doesn't have DT_SONAME. .dynstr will have varying lengths on different systems.)	2020-12-12 16:50:25 -08:00
Jinsong Ji	9b05f1889a	[lld][PowerPC][test] Avoid flaky failures This test may fail if there is a new changes to this tests. The archives are not deleted so the contents from the previous test run may affect the contents for the current run, so this will require cleaning up the Output dir or force build of buildbot. The fix is to put all the objects in the temporary dir that we cleanup every run, to avoid run-2-run flaky failures. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D93128	2020-12-11 19:46:51 +00:00
Jez Ng	349d5c9cf7	[lld-macho] Disable some tests that are failing on Windows lto-object-path.ll, like stabs.s, is disabled on Windows as the path separators make it difficult to write a test that works across platforms. This diff also disables implicit-dylibs.s on Windows as we seem to emit LC_LOAD_DYLIBs in a different order on that platform. This seems like a bug in LLD that needs to be addressed (in a future diff).	2020-12-10 17:54:23 -08:00
Jez Ng	553284be2d	[lld-macho] Don't include absolute address value in expected test output Should fix the mattrs.ll failure introduced by rG29d3b0e47113.	2020-12-10 17:54:23 -08:00
Derek Schuff	dd6412c05c	[WebAssembly][lld] Exclude COMDAT sections Allow exclusion/discarding of custom sections with COMDAT groups. It piggybacks on the existing COMDAT-handling code, but applies to custom sections as well. Differential Revision: https://reviews.llvm.org/D92950	2020-12-10 17:47:41 -08:00
Sam Clegg	e52881a287	[lld][WebAssembly] Split __wasm_apply_relocs function in two We have two types of relocations that we apply on startup: 1. Relocations that apply to wasm globals 2. Relocations that apply to wasm memory The first set of relocations use only the `__memory_base` import to update a set of internal globals. Because wasm globals are thread local these need to run on each thread. Memory relocations, like static constructors, must only be run once. To ensure global relocations run on all threads and because the only depend on the immutable `__memory_base` import we can run them during the WebAssembly start functions, instead of waiting until the post-instantiation __wasm_call_ctors. Differential Revision: https://reviews.llvm.org/D93066	2020-12-10 17:07:39 -08:00
Nico Weber	5dad062d7e	fix typo to cycle bots	2020-12-10 19:20:09 -05:00
Jez Ng	76c36c11a9	[lld-macho] Don't load dylibs more than once Also remove `DylibFile::reexported` since it's unused. Fixes llvm.org/PR48393. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D93001	2020-12-10 15:57:52 -08:00
Jez Ng	6a348f6158	[lld-macho] Implement `-no_implicit_dylibs` Dylibs that are "public" -- i.e. top-level system libraries -- are considered implicitly linked when another library re-exports them. That is, we should load them & bind directly to their symbols instead of via their re-exporting umbrella library. This diff implements that behavior by default, as well as an opt-out flag. In theory, this is just a performance optimization, but in practice it seems that it's needed for correctness. Fixes llvm.org/PR48395. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D93000	2020-12-10 15:57:52 -08:00
Jez Ng	74d799926e	[lld-macho] Initialize AsmParsers earlier We need to initialize AsmParsers before any calls to `addFile`, as bitcode files may require them. Otherwise we trigger `Assertion T && T->hasMCAsmParser()' failed`. Reviewed By: #lld-macho, compnerd Differential Revision: https://reviews.llvm.org/D92913	2020-12-10 15:57:52 -08:00
Jez Ng	29d3b0e471	[lld-macho] Add support for -mcpu, -mattr, -code-model in LTO `-mcpu` and `-code-model` tests were copied from similar ones in LLD-ELF. There doesn't seem to be an equivalent test for `-mattr` in LLD-ELF, so I've verified our behavior by cribbing a test from CodeGen/X86/recip-fastmath.ll. Reviewed By: #lld-macho, compnerd, MaskRay Differential Revision: https://reviews.llvm.org/D92912	2020-12-10 15:57:51 -08:00
Jez Ng	863f7a745e	[lld-macho] Don't attempt to emit rebase opcodes for debug sections This was causing a crash as we were attempting to look up the nonexistent parent OutputSection of the debug sections. We didn't detect it earlier because there was no test for PIEs with debug info (PIEs require us to emit rebases for X86_64_RELOC_UNSIGNED). This diff filters out the debug sections while loading the ObjFiles. In addition to fixing the above problem, it also lets us avoid doing redundant work -- we no longer parse / apply relocations / attempt to emit dyld opcodes for these sections that we don't emit. Fixes llvm.org/PR48392. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D92904	2020-12-10 15:57:51 -08:00
Jez Ng	95831a56d0	[lld-macho] Implement -object_path_lto This makes it possible for STABS entries to reference the debug info contained in the LTO-compiled output. I'm not sure how to test the file mtime within llvm-lit -- GNU and BSD `stat` take different command-line arguments. I've omitted the check for now. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D92537	2020-12-10 15:57:51 -08:00
Sam Clegg	199497086e	[lld][WebAssembly] Delay creation of internal __wasm_memory_init function This also allows for its creation to be conditional so it is completely elided when not needed. Differential Revision: https://reviews.llvm.org/D93035	2020-12-10 10:47:18 -08:00
Sam Clegg	d8ed639a6a	[lld][WebAssembly] Don't emit names for data segments that we omit Followup to https://reviews.llvm.org/D92909 Differential Revision: https://reviews.llvm.org/D92997	2020-12-09 20:59:26 -08:00
Fangrui Song	7d38861ce3	[ELF] Rename --[no-]lto-new-pass-manager to --[no-]lto-legacy-pass-manager Normally we should not delete options. However, the Clang driver passes `-plugin-opt={new,legacy}-pass-manager` instead of `--[no-]lto-legacy-pass-manager` (`-plugin-opt=new-pass-manager` has been used since 7.0), and it is unlikely anyone will use the `--lto-*` style options directly. So let's rename them to be consistent with the Clang driver option names. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D92988	2020-12-09 17:53:37 -08:00
Fangrui Song	7adcacda06	Rename -plugin-opt=no-new-pass-manager to -plugin-opt=legacy-pass-manager	2020-12-09 16:43:30 -08:00
Fangrui Song	68ff3b3376	[LLD][gold] Add -plugin-opt=no-new-pass-manager -DENABLE_EXPERIMENTAL_NEW_PASS_MANAGER=on configured LLD and LLVMgold.so will use the new pass manager by default. Add an option to use the legacy pass manager. This will also be used by the Clang driver when -fno-new-pass-manager (D92915) / -fno-experimental-new-pass-manager is set. Reviewed By: aeubanks, tejohnson Differential Revision: https://reviews.llvm.org/D92916	2020-12-09 13:31:03 -08:00
Sam Clegg	9a72d3e3e4	[WebAssembly] Add support for named data sections in wasm binaries Followup to https://reviews.llvm.org/D91769 which added support for names globals. Differential Revision: https://reviews.llvm.org/D92909	2020-12-09 12:57:07 -08:00
Fangrui Song	baef18dffb	[ELF] Reorganize "is only supported on" tests and fix some diagnostics	2020-12-09 12:14:00 -08:00
Yvan Roux	03a77d04b4	[LLD][ELF] Fix typo in relocation-model-pic.ll Should fix non-x86 bot failures.	2020-12-09 15:38:50 +01:00
Jez Ng	78976bf3da	[lld-macho] Support parsing of bitcode within archives Also error out if we find anything other than an object or bitcode file in the archive. Note that we were previously inserting the symbols and sections of the unpacked ObjFile into the containing ArchiveFile. This was actually unnecessary -- we can just insert the ObjectFile (or BitcodeFile) into the `inputFiles` vector. This is the approach taken by LLD-ELF. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D92539	2020-12-08 10:34:32 -08:00
Sean Fertile	8f91f38148	[LLD] Search archives for symbol defs to override COMMON symbols. This patch changes the archive handling to enable the semantics needed for legacy FORTRAN common blocks and block data. When we have a COMMON definition of a symbol and are including an archive, LLD will now search the members for global/weak defintions to override the COMMON symbol. The previous LLD behavior (where a member would only be included if it satisifed some other needed symbol definition) can be re-enabled with the option '-no-fortran-common'. Differential Revision: https://reviews.llvm.org/D86142	2020-12-07 10:09:19 -05:00
Fangrui Song	f2bbbad141	[ELF] Fix relocation-model.ll	2020-12-04 19:33:19 -08:00
Sam Clegg	ab58e4cb51	[lld][WebAssembly] Add suppport for PIC + passive data initialization This change improves our support for shared memory to include PIC executables (and shared libraries). To handle this case the linker-generated `__wasm_init_memory` function (that only exists in shared memory builds) must be capable of loading memory segements at non-const offsets based on the runtime value of `__memory_base`. Differential Revision: https://reviews.llvm.org/D92620	2020-12-04 17:28:23 -08:00
Craig Topper	ad923edfc1	[RISCV] Add support for printing pcrel immediates as absolute addresses in llvm-objdump This makes the llvm-objdump output much more readable and closer to binutils objdump. This builds on D76591 It requires changing the OperandType for certain immediates to "OPERAND_PCREL" so tablegen will generate code to pass the instruction's address. This means we can't do the generic check on these instructions in verifyInstruction any more. Should I add it back with explicit opcode checks? Or should we add a new operand flag to control the passing of address instead of matching the name? Differential Revision: https://reviews.llvm.org/D92147	2020-12-04 10:34:12 -08:00
Nico Weber	16b1f6e385	[mac/lld] Add support for the LC_LINKER_OPTION load command in o files clang puts `-framework CoreFoundation` in this load command for files that use @available / __builtin_available. Without support for this, binaries that don't explicitly link to CoreFoundation fail to link. Differential Revision: https://reviews.llvm.org/D92624	2020-12-04 08:46:53 -05:00
Nico Weber	305852686b	[mac/lld] Run tests with -fatal_warnings by default This helps us catch cases where we add support for a flag but forget to remove HelpHidden from Options.td. More explicit alternative to D92455 Differential Revision: https://reviews.llvm.org/D92575	2020-12-03 21:23:47 -05:00
Sam Clegg	1bb79875e4	[lld][WebAssembly] Set memory limits correctly for PIC + shared memory Don't early return from layoutMemory in PIC mode before we have set the memory limits. This matters in particular with shared-memory + PIC because shared memories require maximum size. Secondly, when we need a maximum, but the user does not supply one, default to MAX_INT rather than 0 (defaulting to zero is completely useless and means that building with -shared didn't previously work at all without --maximum-memory, because zero is never big enough). This is part of an ongoing effort to enable dynamic linking with threads in emscripten. See https://github.com/emscripten-core/emscripten/issues/3494 Differential Revision: https://reviews.llvm.org/D92528	2020-12-03 18:14:28 -08:00
Wouter van Oortmerssen	fd65e4815c	[WebAssembly] Fixed Writer::createInitMemoryFunction to work for wasm64 Differential Revision: https://reviews.llvm.org/D92348	2020-12-03 16:20:55 -08:00
Nico Weber	32b7d0f5e1	try more to fix t.s on Windows after `7cb0a373d1`	2020-12-03 18:06:34 -05:00
Nico Weber	caa99e3f0a	try to fix t.s on Windows after `7cb0a373d1`	2020-12-03 16:42:08 -05:00
Nico Weber	7cb0a373d1	[mac/lld] Implement -t Goes well with `-why_load` to get an idea of load order. Differential Revision: https://reviews.llvm.org/D92583	2020-12-03 16:02:38 -05:00
Sam Clegg	701fa0b5ab	[lld][WebAssembly] Fix malformed output with -pie + --shared-memory The conditional guarding createInitMemoryFunction was incorrect and didn't match that guarding the creation of the associated symbol. Rather that reproduce the same conditions in multiple places we can simply use the presence of the associated symbol. Also, add an assertion that would have caught this bug. Also, add a new test for this flag combination. This is part of an ongoing effort to enable dynamic linking with threads in emscripten. See https://github.com/emscripten-core/emscripten/issues/3494 Differential Revision: https://reviews.llvm.org/D92520	2020-12-03 11:06:07 -08:00
Nico Weber	3422f3cc6e	Reland "[mac/lld] Implement -why_load". The problem was that `sym` became replaced in the call to make<ObjFile> and referring to it afer that read memory that now stored a different kind of symbol (a Defined instead of a LazySymbol). Since this happens only once per archive, just copy the symbol to the stack before make<ObjFile> and read the copy instead. Originally reviewed at https://reviews.llvm.org/D92496	2020-12-03 08:35:12 -05:00
Nico Weber	ea0029f55d	Revert "[mac/lld] Implement -why_load" This reverts commit `542d3b609d`. Seems to break check-lld. Reverting while I take a look.	2020-12-02 18:57:46 -05:00
Nico Weber	542d3b609d	[mac/lld] Implement -why_load This is useful for debugging why lld loads .o files it shouldn't load. It's also useful for users of lld -- I've used ld64's version of this a few times. Differential Revision: https://reviews.llvm.org/D92496	2020-12-02 18:33:12 -05:00
Arthur Eubanks	92475f698e	[test] Make verify-invalid.ll work with legacy and new PMs	2020-12-02 09:56:18 -08:00
Nico Weber	ca634393fc	[mac/lld] Make --reproduce work with thin archives See http://reviews.llvm.org/rL268229 and http://reviews.llvm.org/rL313832 which did the same for the ELF port. Differential Revision: https://reviews.llvm.org/D92456	2020-12-02 09:48:31 -05:00
Georgii Rymar	3f5dc57fd1	[LLD][ELF] - Don't keep empty output sections which have explicit program headers. This reverts a side effect introduced in the code cleanup patch D43571: LLD started to emit empty output sections that are explicitly assigned to a segment. This patch fixes the issue by removing the !sec.phdrs.empty() special case from isDiscardable. As compensation, we add an early phdrs propagation step (see the inline comment). This is similar to one that we do in adjustSectionsAfterSorting. Differential revision: https://reviews.llvm.org/D92301	2020-12-02 11:19:21 +03:00
Nico Weber	b2f00f24a3	[mac/lld] Include archive name in diagnostics Also, for .o files, include full path as given on link command line. Before: lld: error: undefined symbol [...], referenced from sandbox_logging.o After: lld: error: undefined symbol [...], referenced from libseatbelt.a(sandbox_logging.o) Move archiveName up to InputFile so we can consistently use toString() to print InputFiles in diags, and pass it to the ObjFile ctor. This matches the ELF and COFF ports. Differential Revision: https://reviews.llvm.org/D92437	2020-12-01 23:00:25 -05:00
Nico Weber	facdededca	[mac/lld] fix typo in `07ab597bb0` that broke test on Windows	2020-12-01 20:36:49 -05:00
Eric Leese	8b8088ac6c	[lld] Use -1 as tombstone value for discarded code ranges Under existing behavior discarded functions are relocated to have the start pc 0. This causes problems when debugging as they typically overlap the first function and lldb symbol resolution frequently chooses a discarded function instead of the correct one. Using the value -1 or -2 (depending on which DWARF section we are writing) is sufficient to prevent lldb from resolving to these symbols. Reviewed By: MaskRay, yurydelendik, sbc100 Differential Revision: https://reviews.llvm.org/D91803	2020-12-01 17:06:32 -08:00
Fangrui Song	31e03a9bd9	[WebAssembly] Rename --lto-no-new-pass-manager to --no-lto-new-pass-manager In addition, disallow `-lto-new-pass-manager` (see D79371). Note: the ELF port has also adopted --no-lto-new-pass-manager Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D92422	2020-12-01 16:52:37 -08:00
Nico Weber	07ab597bb0	[lld/mac] Fix issues around thin archives - most importantly, fix a use-after-free when using thin archives, by putting the archive unique_ptr to the arena allocator. This ports D65565 to MachO - correctly demangle symbol namess from archives in diagnostics - add a test for thin archives -- it finds this UaF, but only when running it under asan (it also finds the demangling fix) - make forceLoadArchive() use addFile() with a bool to have the archive loading code in fewer places. no behavior change; matches COFF port a bit better Differential Revision: https://reviews.llvm.org/D92360	2020-12-01 18:48:29 -05:00
Jez Ng	c7dbaec396	[lld-macho] Add isCodeSection() This is the same logic that ld64 uses to determine which sections contain functions. This was added so that we could determine which STABS entries should be N_FUN. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D92430	2020-12-01 15:05:21 -08:00
Jez Ng	78f6498cdc	[lld-macho] Flesh out STABS implementation This addresses a lot of the comments in {D89257}. Ideally it'd have been done in the same diff, but the commits in between make that difficult. This diff implements: * N_GSYM and N_STSYM, the STABS for global and static symbols * Has the STABS reflect the section IDs of their referent symbols * Ensures we don't fail when encountering absolute symbols or files with no debug info * Sorts STABS symbols by file to minimize the number of N_OSO entries Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D92366	2020-12-01 15:05:21 -08:00
Jez Ng	b768d57b36	[lld-macho] Add archive name and file modtime to STABS output We should also set the modtime when running LTO. That will be done in a future diff, together with support for the `-object_path_lto` flag. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D91318	2020-12-01 15:05:21 -08:00
Jez Ng	d0c4be42e3	[lld-macho] Emit empty string as first entry of string table ld64 emits string tables which start with a space and a zero byte. We match its behavior here since some tools depend on it. Similar rationale as {D89561}. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D89639	2020-12-01 15:05:20 -08:00
Jez Ng	51629abce0	[lld-macho] Emit local symbols in symtab; record metadata in LC_DYSYMTAB Symbols of the same type must be laid out contiguously: following ld64's lead, we choose to emit all local symbols first, then external symbols, and finally undefined symbols. For each symbol type, the LC_DYSYMTAB load command will record the range (start index and total number) of those symbols in the symbol table. This work was motivated by the fact that LLDB won't search for debug info if LC_DYSYMTAB says there are no local symbols (since STABS symbols are all local symbols). With this change, LLDB is now able to display the source lines at a given breakpoint when debugging our binaries. Some tests had to be updated due to local symbol names now appearing in `llvm-objdump`'s output. Reviewed By: #lld-macho, smeenai, clayborg Differential Revision: https://reviews.llvm.org/D89285	2020-12-01 15:05:20 -08:00
Jez Ng	3fcb0eeb15	[lld-macho] Emit STABS symbols for debugging, and drop debug sections Debug sections contain a large amount of data. In order not to bloat the size of the final binary, we remove them and instead emit STABS symbols for `dsymutil` and the debugger to locate their contents in the object files. With this diff, `dsymutil` is able to locate the debug info. However, we need a few more features before `lldb` is able to work well with our binaries -- e.g. having `LC_DYSYMTAB` accurately reflect the number of local symbols, emitting `LC_UUID`, and more. Those will be handled in follow-up diffs. Note also that the STABS we emit differ slightly from what ld64 does. First, we emit the path to the source file as one `N_SO` symbol instead of two. (`ld64` emits one `N_SO` for the dirname and one of the basename.) Second, we do not emit `N_BNSYM` and `N_ENSYM` STABS to mark the start and end of functions, because the `N_FUN` STABS already serve that purpose. @clayborg recommended these changes based on his knowledge of what the debugging tools look for. Additionally, this current implementation doesn't accurately reflect the size of function symbols. It uses the size of their containing sectioins as a proxy, but that is only accurate if `.subsections_with_symbols` is set, and if there isn't an `N_ALT_ENTRY` in that particular subsection. I think we have two options to solve this: 1. We can split up subsections by symbol even if `.subsections_with_symbols` is not set, but include constraints to ensure those subsections retain their order in the final output. This is `ld64`'s approach. 2. We could just add a `size` field to our `Symbol` class. This seems simpler, and I'm more inclined toward it, but I'm not sure if there are use cases that it doesn't handle well. As such I'm punting on the decision for now. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D89257	2020-12-01 15:05:20 -08:00
Sam Clegg	a38ed62ea8	[lld][WebAssembly] Feedback from D92038. NFC Differential Revision: https://reviews.llvm.org/D92429	2020-12-01 14:53:59 -08:00
Jez Ng	6b3eecd22a	[lld-macho] Extend PIE option handling * Enable PIE by default if targeting 10.6 or above on x86-64. (The manpage says 10.7, but that actually applies only to i386, and in general varies based on the target platform. I didn't update the manpage because listing all the different behaviors would make for a pretty long description.) * Add support for `-no_pie` * Remove `HelpHidden` from `-pie` Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D92362	2020-12-01 14:35:51 -08:00
David Blaikie	615f63e149	Revert "[FastISel] Flush local value map on ever instruction" and dependent patches This reverts commit `cf1c774d6a`. This change caused several regressions in the gdb test suite - at least a sample of which was due to line zero instructions making breakpoints un-lined. I think they're worth investigating/understanding more (& possibly addressing) before moving forward with this change. Revert "[FastISel] NFC: Clean up unnecessary bookkeeping" This reverts commit `3fd39d3694`. Revert "[FastISel] NFC: Remove obsolete -fast-isel-sink-local-values option" This reverts commit `a474657e30`. Revert "Remove static function unused after cf1c774." This reverts commit `dc35368ccf`. Revert "[lldb] Fix TestThreadStepOut.py after "Flush local value map on every instruction"" This reverts commit `53a14a47ee`.	2020-12-01 14:26:23 -08:00
Arthur Eubanks	99d82412f8	[LLD][ELF][NewPM] Add option to force legacy PM In preparation for the NPM switch. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D92417	2020-12-01 13:41:17 -08:00
Arthur Eubanks	1314a4938f	[LTO][wasm][NewPM] Allow using new pass manager for wasm LTO Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D92150	2020-12-01 12:22:40 -08:00
Fangrui Song	bb993b1d9d	[ELF][test] Fix lto/version-script2.ll	2020-12-01 10:22:33 -08:00
Arthur Eubanks	26d3aaeb3a	[LTO][NewPM] Run verifier when doing LTO This matches the legacy PM. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D92138	2020-12-01 10:14:53 -08:00
Fangrui Song	843c2b2303	[ELF] Error for undefined foo@v1 If an object file has an undefined foo@v1, we emit a dynamic symbol foo. This is incorrect if at runtime a shared object provides the non-default version foo@v1 (the undefined foo may bind to foo@@v2, for example). GNU ld issues an error for this case, even if foo@v1 is undefined weak (https://sourceware.org/bugzilla/show_bug.cgi?id=3351). This behavior makes sense because to represent an undefined foo@v1, we have to construct a Verneed entry. However, without knowing the defining filename, we cannot construct a Verneed entry (Verneed::vn_file is unavailable). This patch implements the error. Depends on D92258 Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D92260	2020-12-01 08:59:54 -08:00
Fangrui Song	941e9336d0	[ELF] Make foo@@v1 resolve undefined foo@v1 The symbol resolution rules for versioned symbols are: * foo@@v1 (default version) resolves both undefined foo and foo@v1 * foo@v1 (non-default version) resolves undefined foo@v1 Note, foo@@v1 must be defined (the assembler errors if attempting to create an undefined foo@@v1). For defined foo@@v1 in a shared object, we call `SymbolTable::addSymbol` twice, one for foo and the other for foo@v1. We don't do the same for object files, so foo@@v1 defined in one object file incorrectly does not resolve a foo@v1 reference in another object file. This patch fixes the issue by reusing the --wrap code to redirect symbols in object files. This has to be done after processing input files because foo and foo@v1 are two separate symbols if we haven't seen foo@@v1. Add a helper `Symbol::getVersionSuffix` to retrieve the optional trailing `@...` or `@@...` from the possibly truncated symbol name. Depends on D92258 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D92259	2020-12-01 08:54:01 -08:00
Fangrui Song	a5f95887d0	[ELF][test] Add some tests for versioned symbols in object files Test the symbol resolution related to * defined foo@@v1 and foo@v1 in object files/shared objects * undefined foo@v1 * weak foo@@v1 and foo@v1 * visibility * interaction with --wrap. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D92258	2020-12-01 08:49:14 -08:00
Wei Wang	3acda91742	[Remarks][1/2] Expand remarks hotness threshold option support in more tools This is the #1 of 2 changes that make remarks hotness threshold option available in more tools. The changes also allow the threshold to sync with hotness threshold from profile summary with special value 'auto'. This change modifies the interface of lto::setupLLVMOptimizationRemarks() to accept remarks hotness threshold. Update all the tools that use it with remarks hotness threshold options: * lld: '--opt-remarks-hotness-threshold=' * llvm-lto2: '--pass-remarks-hotness-threshold=' * llvm-lto: '--lto-pass-remarks-hotness-threshold=' * gold plugin: '-plugin-opt=opt-remarks-hotness-threshold=' Differential Revision: https://reviews.llvm.org/D85809	2020-11-30 21:55:49 -08:00
Amy Huang	efd1ec0dec	Recommit "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" This reverts commit `1b63177a56`.	2020-11-30 17:36:12 -08:00
Amy Huang	8cdf4920c4	[llvm-symbolizer] Fix typo in llvm-symbolizer test from a previous commit. (Commit was `00bbef2bb2`)	2020-11-30 15:08:11 -08:00
Amy Huang	00bbef2bb2	[llvm-symbolizer] Fix native symbolization on windows for inline sites. The existing code handles this correctly and I checked that the code in NativeInlineSiteSymbol also handles this correctly, but it was wrong in the NativeFunctionSymbol code. Differential Revision: https://reviews.llvm.org/D92134	2020-11-30 14:27:35 -08:00
Nico Weber	78c04fe99e	[lld/mac] Don't warn on -bundle and -execute flags They've been implemented since D87856 but since they still were HelpHidden, the driver still warned claiming they were implemented. Remove HelpHidden. Use -fatal_warnings to test that the flags now don't warn. The test depends on D91894 and D91891 to pass. Differential Revision: https://reviews.llvm.org/D91971	2020-11-30 16:07:58 -05:00
Nico Weber	ebac710009	[lld-macho] Don't warn on non-existent system libraries Now, new mach-o lld no longer warns if the isysroot has just usr/lib and System/Library/Frameworks but is missing usr/local/lib and System/Frameworks. This matches ld64 and old mach-o lld and fixes a regression from D85992. It also fixes the only test failure in `check-lld` when running it on an M1 Mac. Differential Revision: https://reviews.llvm.org/D91891	2020-11-30 16:07:20 -05:00
Fangrui Song	589e10f858	[ELF] Don't relax R_X86_64_GOTPCRELX if addend != -4 clang may produce `movl x@GOTPCREL+4(%rip), %eax` when loading the high 32 bits of the address of a global variable in -fpic/-fpie mode. If assembled by GNU as, the fixup emits an R_X86_64_GOTPCRELX with an addend != -4. The instruction loads from the GOT entry with an offset and thus it is incorrect to relax the instruction. If assembled by the integrated assembler, we emit R_X86_64_GOTPCREL for relocations that definitely cannot be relaxed (D92114), so this patch is not needed. This patch disables the relaxation, which is compatible with the implementation in GNU ld ("Add R_X86_64_[REX_]GOTPCRELX support to gas and ld"). Reviewed By: grimar, jhenderson Differential Revision: https://reviews.llvm.org/D91993	2020-11-30 08:30:19 -08:00
Nico Weber	c0e4020c92	[lld-macho] Implement -fatal_warnings Differential Revision: https://reviews.llvm.org/D91894	2020-11-30 09:29:21 -05:00
Nico Weber	83e60f5a55	[lld/mac] Add --reproduce option This adds support for ld.lld's --reproduce / lld-link's /reproduce: flag to the MachO port. This flag can be added to a link command to make the link write a tar file containing all inputs to the link and a response file containing the link command. This can be used to reproduce the link on another machine, which is useful for sharing bug report inputs or performance test loads. Since the linker is usually called through the clang driver and adding linker flags can be a bit cumbersome, setting the env var `LLD_REPRODUCE=foo.tar` triggers the feature as well. The file response.txt in the archive can be used with `ld64.lld.darwinnew $(cat response.txt)` as long as the contents are smaller than the command-line limit, or with `ld64.lld.darwinnew @response.txt` once D92149 is in. The support in this patch is sufficient to create a tar file for Chromium's base_unittests that can link after unpacking on a different machine. Differential Revision: https://reviews.llvm.org/D92274	2020-11-30 08:40:21 -05:00
Nico Weber	d20abb1ec3	[mac/lld] Add support for response files ld64 learned about them in Xcode 12, so we should too. Differential Revision: https://reviews.llvm.org/D92149	2020-11-30 08:23:58 -05:00
Nico Weber	a0994cbe27	lld-link: Let LLD_REPRODUCE control /reproduce:, like in ld.lld Also sync help texts for the option between elf and coff ports. Decisions: - Do this even if /lldignoreenv is passed. /reproduce: does not affect the main output, and this makes the env var more convenient to use. (On the other hand, it's now possible to set this env var and forget about it, and all future builds in the same shell will be much slower. That's true for ld.lld, but posix shells have an easy way to set an env var for a single command; in cmd.exe this is not possible without contortions. Then again, lld-link runs in posix shells too.) Original patch rebased across D68378 and D68381. Differential Revision: https://reviews.llvm.org/D67707	2020-11-27 13:33:55 -05:00
Sam Clegg	48ddf5e182	[lld][WebAssembly] Ensure stub symbols always get address 0 Without this extra flag we can't distingish between stub functions and functions that happen to have address 0 (relative to __table_base). Adding this flag bit the base symbol class actually avoids growing the SymbolUnion struct which would not be true if we added it to the FunctionSymbol subclass (due to bitbacking). The previous approach of setting it's table index to zero worked for normal static relocations but not for `-fPIC` code. See https://github.com/emscripten-core/emscripten/issues/12819 Differential Revision: https://reviews.llvm.org/D92038	2020-11-25 18:26:34 -08:00
Amy Huang	1363dfaf31	[CodeView] Avoid emitting empty debug globals subsection. In https://reviews.llvm.org/D89072 I added static const data members to the debug subsection for globals. It skipped emitting an S_CONSTANT if it didn't have a value, which meant the subsection could be empty. This patch fixes the empty subsection issue. Differential Revision: https://reviews.llvm.org/D92049	2020-11-25 16:13:32 -08:00
Paul Robinson	cf1c774d6a	[FastISel] Flush local value map on ever instruction Local values are constants or addresses that can't be folded into the instruction that uses them. FastISel materializes these in a "local value" area that always dominates the current insertion point, to try to avoid materializing these values more than once (per block). https://reviews.llvm.org/D43093 added code to sink these local value instructions to their first use, which has two beneficial effects. One, it is likely to avoid some unnecessary spills and reloads; two, it allows us to attach the debug location of the user to the local value instruction. The latter effect can improve the debugging experience for debuggers with a "set next statement" feature, such as the Visual Studio debugger and PS4 debugger, because instructions to set up constants for a given statement will be associated with the appropriate source line. There are also some constants (primarily addresses) that could be produced by no-op casts or GEP instructions; the main difference from "local value" instructions is that these are values from separate IR instructions, and therefore could have multiple users across multiple basic blocks. D43093 avoided sinking these, even though they were emitted to the same "local value" area as the other instructions. The patch comment for D43093 states: Local values may also be used by no-op casts, which adds the register to the RegFixups table. Without reversing the RegFixups map direction, we don't have enough information to sink these instructions. This patch undoes most of D43093, and instead flushes the local value map after() every IR instruction, using that instruction's debug location. This avoids sometimes incorrect locations used previously, and emits instructions in a more natural order. This does mean materialized values are not re-used across IR instruction boundaries; however, only about 5% of those values were reused in an experimental self-build of clang. () Actually, just prior to the next instruction. It seems like it would be cleaner the other way, but I was having trouble getting that to work. Differential Revision: https://reviews.llvm.org/D91734	2020-11-25 13:05:00 -05:00
Teresa Johnson	07f234be1c	[lld] Add --no-lto-whole-program-visibility Enables overriding earlier --lto-whole-program-visibility. Variant of D91583 while discussing alternate ways to identify and handle the --export-dynamic case. Differential Revision: https://reviews.llvm.org/D92060	2020-11-24 16:46:08 -08:00
Nico Weber	11b7625833	[lld/mac] Implement basic typo correction for flags Also use "unknown flag 'flag'" instead of "unknown flag: flag" for consistency with the other ports. Differential Revision: https://reviews.llvm.org/D91970	2020-11-24 11:33:39 -05:00
Nico Weber	e16c0a9a68	clang+lld: Improve clang+ld.darwinnew.lld interaction, pass -demangle This patch: - adds an ld64.lld.darwinnew symlink for lld, to go with `f2710d4b57`, so that `clang -fuse-ld=lld.darwinnew` can be used to test new Mach-O lld while it's in bring-up. (The expectation is that we'll remove this again once new Mach-O lld is the defauld and only Mach-O lld.) - lets the clang driver know if the linker is lld (currently only triggered if `-fuse-ld=lld` or `-fuse-ld=lld.darwinnew` is passed). Currently only used for the next point, but could be used to implement other features that need close coordination between compiler and linker, e.g. having a diag for calling `clang++` instead of `clang` when link errors are caused by a missing C++ stdlib. - lets the clang driver pass `-demangle` to Mach-O lld (both old and new), in addition to ld64 - implements -demangle for new Mach-O lld - changes demangleItanium() to accept _Z, __Z, ___Z, ____Z prefixes (and updates one test added in D68014). Mach-O has an extra underscore for symbols, and the three (or, on Mach-O, four) underscores are used for block names. Differential Revision: https://reviews.llvm.org/D91884	2020-11-24 08:51:58 -05:00
Martin Storsjö	0b2d84fba8	[LLD] [COFF] Allow wrapping dllimported functions GNU ld doesn't seem to do this though, but it looks like a reasonable use case, is easy to implement, and was requested in https://bugs.llvm.org/show_bug.cgi?id=47384. Differential Revision: https://reviews.llvm.org/D91689	2020-11-24 10:15:20 +02:00
Amy Huang	1b63177a56	Revert "[llvm-symbolizer] Switch to using native symbolizer by default on Windows" Breaks some asan tests on the buildbot. This reverts commit `c74b427cb2`.	2020-11-23 16:29:45 -08:00
Amy Huang	c74b427cb2	[llvm-symbolizer] Switch to using native symbolizer by default on Windows llvm-symbolizer used to use the DIA SDK for symbolization on Windows; this patch switches to using native symbolization, which was implemented recently. Users can still make the symbolizer use DIA by adding the `-dia` flag in the LLVM_SYMBOLIZER_OPTS environment variable. Differential Revision: https://reviews.llvm.org/D91814	2020-11-23 15:57:08 -08:00
Sam Clegg	f7f0fe6184	[lld][WebAssembly] Convert more tests to asm format. NFC. Differential Revision: https://reviews.llvm.org/D91681	2020-11-19 16:57:00 -08:00
Sam Clegg	1827005cfc	[WebAssembly] Add support for named globals in the object format. Differential Revision: https://reviews.llvm.org/D91769	2020-11-19 00:17:22 -08:00
Nico Weber	27e73816d6	lld: Make tests depend on llvm-symbolizer after `bc98034040` Fixes test failures when building just `check-lld` in a clean build dir.	2020-11-18 11:43:44 -05:00
Georgii Rymar	9aa7898200	Reland "[lib/Support/YAMLTraits] - Don't print leading zeroes when dumping Hex8/Hex16/Hex32 types." (https://reviews.llvm.org/D90930 ). This reverts reverting commit `fc40a03323` and fixes LLD (MachO/wasm) tests that failed previously.	2020-11-18 13:08:46 +03:00

... 3 4 5 6 7 ...

6789 Commits