llvm-project

Author	SHA1	Message	Date
Manman Ren	82751a105c	DAGCombine: clamp hi bit in APInt::getBitsSet to avoid assertion rdar://12838504 llvm-svn: 169951	2012-12-12 01:13:50 +00:00
Evan Cheng	04e5518783	Avoid using lossy load / stores for memcpy / memset expansion. e.g. f64 load / store on non-SSE2 x86 targets. llvm-svn: 169944	2012-12-12 00:42:09 +00:00
Evan Cheng	eb54240dc2	Replace TargetLowering::isIntImmLegal() with ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined term for something like integer immediate materialization. It is always possible to materialize an integer immediate. Whether to use it for memcpy expansion is more a "cost" conceern. llvm-svn: 169929	2012-12-11 23:26:14 +00:00
Patrik Hagglund	e98b7a0389	Revert EVT->MVT changes, r169836-169851, due to buildbot failures. llvm-svn: 169854	2012-12-11 11:14:33 +00:00
Patrik Hagglund	b31465b09b	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 169851	2012-12-11 10:24:48 +00:00
Patrik Hagglund	ad432a8e70	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. Accordingly, add bitsLT (and similar) to MVT. llvm-svn: 169850	2012-12-11 10:20:51 +00:00
Patrik Hagglund	d34337495e	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 169849	2012-12-11 10:16:19 +00:00
Patrik Hagglund	03e9628cfa	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 169848	2012-12-11 10:09:23 +00:00
Patrik Hagglund	c50489e203	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 169847	2012-12-11 10:05:04 +00:00
Patrik Hagglund	8d2e7cf561	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 169845	2012-12-11 09:57:18 +00:00
Patrik Hagglund	ffb60f7c08	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 169844	2012-12-11 09:54:23 +00:00
Patrik Hagglund	a970281106	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 169843	2012-12-11 09:51:27 +00:00
Patrik Hagglund	e3bec6365a	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 169842	2012-12-11 09:48:14 +00:00
Patrik Hagglund	7ffcd226dd	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 169841	2012-12-11 09:42:24 +00:00
Patrik Hagglund	cbc9d4d0f9	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 169840	2012-12-11 09:39:09 +00:00
Patrik Hagglund	40e1afe970	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 169839	2012-12-11 09:32:56 +00:00
Patrik Hagglund	57b1694df1	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 169838	2012-12-11 09:31:43 +00:00
Patrik Hagglund	3708e548f8	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. llvm-svn: 169837	2012-12-11 09:10:33 +00:00
Chandler Carruth	b27041c50b	Fix a miscompile in the DAG combiner. Previously, we would incorrectly try to reduce the width of this load, and would end up transforming: (truncate (lshr (sextload i48 <ptr> as i64), 32) to i32) to (truncate (zextload i32 <ptr+4> as i64) to i32) We lost the sext attached to the load while building the narrower i32 load, and replaced it with a zext because lshr always zext's the results. Instead, bail out of this combine when there is a conflict between a sextload and a zext narrowing. The rest of the DAG combiner still optimize the code down to the proper single instruction: movswl 6(...),%eax Which is exactly what we wanted. Previously we read past the end and missed the sign extension: movl 6(...), %eax llvm-svn: 169802	2012-12-11 00:36:57 +00:00
Chad Rosier	df42cf39ab	Fall back to the selection dag isel to select tail calls. This shouldn't affect codegen for -O0 compiles as tail call markers are not emitted in unoptimized compiles. Testing with the external/internal nightly test suite reveals no change in compile time performance. Testing with -O1, -O2 and -O3 with fast-isel enabled did not cause any compile-time or execution-time failures. All tests were performed on my x86 machine. I'll monitor our arm testers to ensure no regressions occur there. In an upcoming clang patch I will be marking the objc_autoreleaseReturnValue and objc_retainAutoreleaseReturnValue as tail calls unconditionally. While it's theoretically true that this is just an optimization, it's an optimization that we very much want to happen even at -O0, or else ARC applications become substantially harder to debug. Part of rdar://12553082 llvm-svn: 169796	2012-12-11 00:18:02 +00:00
Evan Cheng	79e2ca90bc	Some enhancements for memcpy / memset inline expansion. 1. Teach it to use overlapping unaligned load / store to copy / set the trailing bytes. e.g. On 86, use two pairs of movups / movaps for 17 - 31 byte copies. 2. Use f64 for memcpy / memset on targets where i64 is not legal but f64 is. e.g. x86 and ARM. 3. When memcpy from a constant string, do not replace the load with a constant if it's not possible to materialize an integer immediate with a single instruction (required a new target hook: TLI.isIntImmLegal()). 4. Use unaligned load / stores more aggressively if target hooks indicates they are "fast". 5. Update ARM target hooks to use unaligned load / stores. e.g. vld1.8 / vst1.8. Also increase the threshold to something reasonable (8 for memset, 4 pairs for memcpy). This significantly improves Dhrystone, up to 50% on ARM iOS devices. rdar://12760078 llvm-svn: 169791	2012-12-10 23:21:26 +00:00
Eric Christopher	200dd760fa	Fix a coding style nit. llvm-svn: 169776	2012-12-10 22:00:20 +00:00
Tom Stellard	30e2aa5015	LegalizeDAG: Allow type promotion of scalar loads llvm-svn: 169773	2012-12-10 21:41:58 +00:00
Tom Stellard	b785bd776c	LegalizeDAG: Allow type promotion for scalar stores llvm-svn: 169772	2012-12-10 21:41:54 +00:00
Craig Topper	d8005db486	Teach DAG combine to handle vector add/sub with vectors of all 0s. llvm-svn: 169727	2012-12-10 08:12:29 +00:00
Craig Topper	5ea3bdd75b	Remove extra blank line. llvm-svn: 169692	2012-12-09 08:20:52 +00:00
Craig Topper	a183ddb0fe	Teach DAG combine to handle vector logical operations with vectors of all 1s or all 0s. These cases can show up when vectors are split for legalizing. Fix some tests that were dependent on these cases not being combined. llvm-svn: 169684	2012-12-08 22:49:19 +00:00
Evan Cheng	9ec512d768	Replace r169459 with something safer. Rather than having computeMaskedBits to understand target implementation of any_extend / extload, just generate zero_extend in place of any_extend for liveouts when the target knows the zero_extend will be implicit (e.g. ARM ldrb / ldrh) or folded (e.g. x86 movz). rdar://12771555 llvm-svn: 169536	2012-12-06 19:13:27 +00:00
Nadav Rotem	ac450eb59e	Fix a bug in the code that merges consecutive stores. Previously we did not check if loads that happen in between stores alias with the first store in the chain, only with the second store onwards. llvm-svn: 169516	2012-12-06 17:34:13 +00:00
Evan Cheng	5213139f48	Let targets provide hooks that compute known zero and ones for any_extend and extload's. If they are implemented as zero-extend, or implicitly zero-extend, then this can enable more demanded bits optimizations. e.g. define void @foo(i16* %ptr, i32 %a) nounwind { entry: %tmp1 = icmp ult i32 %a, 100 br i1 %tmp1, label %bb1, label %bb2 bb1: %tmp2 = load i16* %ptr, align 2 br label %bb2 bb2: %tmp3 = phi i16 [ 0, %entry ], [ %tmp2, %bb1 ] %cmp = icmp ult i16 %tmp3, 24 br i1 %cmp, label %bb3, label %exit bb3: call void @bar() nounwind br label %exit exit: ret void } This compiles to the followings before: push {lr} mov r2, #0 cmp r1, #99 bhi LBB0_2 @ BB#1: @ %bb1 ldrh r2, [r0] LBB0_2: @ %bb2 uxth r0, r2 cmp r0, #23 bhi LBB0_4 @ BB#3: @ %bb3 bl _bar LBB0_4: @ %exit pop {lr} bx lr The uxth is not needed since ldrh implicitly zero-extend the high bits. With this change it's eliminated. rdar://12771555 llvm-svn: 169459	2012-12-06 01:28:01 +00:00
Chandler Carruth	802d755533	Sort includes for all of the .h files under the 'lib' tree. These were missed in the first pass because the script didn't yet handle include guards. Note that the script is now able to handle all of these headers without manual edits. =] llvm-svn: 169224	2012-12-04 07:12:27 +00:00
Jakub Staszak	ae551a853d	Simplify code. No functionality change. llvm-svn: 169198	2012-12-04 01:00:52 +00:00
Jakub Staszak	bac8ae6506	Use dyn_cast instead of isa and cast. No functionality change. llvm-svn: 169196	2012-12-04 00:50:06 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Nadav Rotem	1157e1410c	Allow merging multiple store sequences on the same chain. llvm-svn: 169111	2012-12-02 17:14:09 +00:00
Justin Holewinski	edec332437	Cleanup recent addition of DAGTypeLegalizer::SplitVecOp_VSELECT llvm-svn: 168932	2012-11-29 19:42:09 +00:00
Justin Holewinski	0ac49bf846	Teach the legalizer how to handle operands for VSELECT nodes If we need to split the operand of a VSELECT, it must be the mask operand. We split the entire VSELECT operand with EXTRACT_SUBVECTOR. llvm-svn: 168883	2012-11-29 14:26:28 +00:00
Justin Holewinski	bc45119b44	Allow targets to prefer TypeSplitVector over TypePromoteInteger when computing the legalization method for vectors For some targets, it is desirable to prefer scalarizing <N x i1> instead of promoting to a larger legal type, such as <N x i32>. llvm-svn: 168882	2012-11-29 14:26:24 +00:00
Nadav Rotem	307d767177	When combining consecutive stores allow loads in between the stores, if the loads do not alias. llvm-svn: 168832	2012-11-29 00:00:08 +00:00
Craig Topper	79bd205d8c	Refactor to make helper method static. llvm-svn: 168557	2012-11-25 08:08:58 +00:00
Craig Topper	268b62288e	Remove duplicate check of LimitFloatPrecision. It was already checked earlier before IsExp10 could be set to true. llvm-svn: 168553	2012-11-25 00:48:58 +00:00
Craig Topper	8571944cf1	Factor common code out of individual if blocks into common tail. llvm-svn: 168551	2012-11-25 00:15:07 +00:00
Craig Topper	d374694b07	Remove redundant calls to getCurDebugLoc in visitIntrinsicCall. It's already called at the start of the function and captured in a local variable. llvm-svn: 168548	2012-11-24 23:05:23 +00:00
Craig Topper	d2638c1894	Refactor a bit to make some helper methods static. llvm-svn: 168546	2012-11-24 18:52:06 +00:00
Craig Topper	4a98175800	Factor some common code out of individual if blocks. llvm-svn: 168538	2012-11-24 08:22:37 +00:00
Craig Topper	bef254ab16	Refactor a bit to make some helper functions static. llvm-svn: 168524	2012-11-23 18:38:31 +00:00
Patrik Hägglund	f77cc055cd	Cleanup: Simplify loop end logic in computeRegisterProperties(). llvm-svn: 168507	2012-11-23 08:35:04 +00:00
Lang Hames	e9541c820a	llvm.fmuladd.* lowering should be checking isOperationLegalOrCustom, rather than isOperationLegal. Thanks to Craig Topper for pointing this out. llvm-svn: 168485	2012-11-22 03:31:45 +00:00
Eli Friedman	30834940ec	Mark FP_EXTEND form v2f32 to v2f64 as "expand" for ARM NEON. Patch by Pete Couperus. llvm-svn: 168240	2012-11-17 01:52:46 +00:00
Craig Topper	ed756c5fc8	Remove conditions from 'else if' that were guaranteed by preceding 'if'. llvm-svn: 168191	2012-11-16 20:01:39 +00:00

... 3 4 5 6 7 ...

5932 Commits