llvm - http://llvm.org

diff options

author	Chandler Carruth <chandlerc@gmail.com>	2012-01-11 08:41:08 +0000
committer	Chandler Carruth <chandlerc@gmail.com>	2012-01-11 08:41:08 +0000
commit	f103b3d1b99db0ae314c36d8b4ae71427816379e (patch)
tree	8ca75ace5c84dc52eae422ad67c51e240e331ef8 /lib/CodeGen/Analysis.cpp
parent	88c5c42c5c832f599b34a5f5f4d361b9c1eacf6c (diff)

Teach the X86 instruction selection to do some heroic transforms to

detect a pattern which can be implemented with a small 'shl' embedded in the addressing mode scale. This happens in real code as follows: unsigned x = my_accelerator_table[input >> 11]; Here we have some lookup table that we look into using the high bits of 'input'. Each entity in the table is 4-bytes, which means this implicitly gets turned into (once lowered out of a GEP): *(unsigned*)((char*)my_accelerator_table + ((input >> 11) << 2)); The shift right followed by a shift left is canonicalized to a smaller shift right and masking off the low bits. That hides the shift right which x86 has an addressing mode designed to support. We now detect masks of this form, and produce the longer shift right followed by the proper addressing mode. In addition to saving a (rather large) instruction, this also reduces stalls in Intel chips on benchmarks I've measured. In order for all of this to work, one part of the DAG needs to be canonicalized *still further* than it currently is. This involves removing pointless 'trunc' nodes between a zextload and a zext. Without that, we end up generating spurious masks and hiding the pattern. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@147936 91177308-0d34-0410-b5e6-96231b3b80d8

Diffstat (limited to 'lib/CodeGen/Analysis.cpp')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: