<feed xmlns='http://www.w3.org/2005/Atom'>
<title>llvm/test/CodeGen/NVPTX, branch testing</title>
<subtitle>http://llvm.org</subtitle>
<id>https://git.amat.us/llvm/atom/test/CodeGen/NVPTX?h=testing</id>
<link rel='self' href='https://git.amat.us/llvm/atom/test/CodeGen/NVPTX?h=testing'/>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/'/>
<updated>2013-03-20T00:10:32Z</updated>
<entry>
<title>Propagate DAG node ordering during type legalization and instruction selection</title>
<updated>2013-03-20T00:10:32Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2013-03-20T00:10:32Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=d73dc544f5e37e6d58befc71e57da3c132804543'/>
<id>urn:sha1:d73dc544f5e37e6d58befc71e57da3c132804543</id>
<content type='text'>
A node's ordering is only propagated during legalization if (a) the new node does
not have an ordering (is not a CSE'd node), or (b) the new node has an ordering
that is higher than the node being legalized.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@177465 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>[NVPTX] Disable vector registers</title>
<updated>2013-02-12T14:18:49Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2013-02-12T14:18:49Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=7eacad03efda36e09ebd96e95d7891cadaaa9087'/>
<id>urn:sha1:7eacad03efda36e09ebd96e95d7891cadaaa9087</id>
<content type='text'>
Vectors were being manually scalarized by the backend.  Instead,
let the target-independent code do all of the work.  The manual
scalarization was from a time before good target-independent support
for scalarization in LLVM. However, this forces us to specially-handle
vector loads and stores, which we can turn into PTX instructions that
produce/consume multiple operands.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@174968 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>[NVPTX] Remove NoCapture from address space conversion intrinsics. NoCapture is not valid in this case, and was causing incorrect optimizations.</title>
<updated>2013-02-11T18:56:35Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2013-02-11T18:56:35Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=ff5adad9f39f60dc9f1caadef03b6aa7922ed168'/>
<id>urn:sha1:ff5adad9f39f60dc9f1caadef03b6aa7922ed168</id>
<content type='text'>
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@174896 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>[NVPTX] Fix crash with unnamed struct arguments</title>
<updated>2012-12-05T20:50:28Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2012-12-05T20:50:28Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=2f1086137dbabad072c98bd4d8dab15525ca4991'/>
<id>urn:sha1:2f1086137dbabad072c98bd4d8dab15525ca4991</id>
<content type='text'>
Patch by Eric Holk

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@169418 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>Teach the legalizer how to handle operands for VSELECT nodes</title>
<updated>2012-11-29T14:26:28Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2012-11-29T14:26:28Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=7f128ea00c5358729906a9b98f844e887a1c3d73'/>
<id>urn:sha1:7f128ea00c5358729906a9b98f844e887a1c3d73</id>
<content type='text'>
If we need to split the operand of a VSELECT, it must be the mask operand. We
split the entire VSELECT operand with EXTRACT_SUBVECTOR.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@168883 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>Allow targets to prefer TypeSplitVector over TypePromoteInteger when computing the legalization method for vectors</title>
<updated>2012-11-29T14:26:24Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2012-11-29T14:26:24Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=3d200255d5b93344c1ab0a5ba4b47a52cfa5893e'/>
<id>urn:sha1:3d200255d5b93344c1ab0a5ba4b47a52cfa5893e</id>
<content type='text'>
For some targets, it is desirable to prefer scalarizing &lt;N x i1&gt; instead of promoting to a larger legal type, such as &lt;N x i32&gt;.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@168882 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>[NVPTX] Order global variables in def-use order before emiting them in the final assembly</title>
<updated>2012-11-16T21:03:51Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2012-11-16T21:03:51Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=2085d00d09f4f3678a6c67da46df3a04e31b499c'/>
<id>urn:sha1:2085d00d09f4f3678a6c67da46df3a04e31b499c</id>
<content type='text'>
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@168198 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>[NVPTX] Implement custom lowering of loads/stores for i1</title>
<updated>2012-11-14T19:19:16Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2012-11-14T19:19:16Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=a20067b5d4ea718a3f0e47adae8b231e4a2db9f6'/>
<id>urn:sha1:a20067b5d4ea718a3f0e47adae8b231e4a2db9f6</id>
<content type='text'>
Loads from i1 become loads from i8 followed by trunc
Stores to i1 become zext to i8 followed by store to i8

Fixes PR13291

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@167948 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>[NVPTX] Add more precise PTX/SM target attributes</title>
<updated>2012-11-12T03:16:43Z</updated>
<author>
<name>Justin Holewinski</name>
<email>jholewinski@nvidia.com</email>
</author>
<published>2012-11-12T03:16:43Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=08e9cb46feb0c8e08e3d309a0f9fd75a04ca54fb'/>
<id>urn:sha1:08e9cb46feb0c8e08e3d309a0f9fd75a04ca54fb</id>
<content type='text'>
Each SM and PTX version is modeled as a subtarget feature/CPU. Additionally,
PTX 3.1 is added as the default PTX version to be out-of-the-box compatible
with CUDA 5.0.

Available CPUs for this target:

  sm_10 - Select the sm_10 processor.
  sm_11 - Select the sm_11 processor.
  sm_12 - Select the sm_12 processor.
  sm_13 - Select the sm_13 processor.
  sm_20 - Select the sm_20 processor.
  sm_21 - Select the sm_21 processor.
  sm_30 - Select the sm_30 processor.
  sm_35 - Select the sm_35 processor.

Available features for this target:

  ptx30 - Use PTX version 3.0.
  ptx31 - Use PTX version 3.1.
  sm_10 - Target SM 1.0.
  sm_11 - Target SM 1.1.
  sm_12 - Target SM 1.2.
  sm_13 - Target SM 1.3.
  sm_20 - Target SM 2.0.
  sm_21 - Target SM 2.1.
  sm_30 - Target SM 3.0.
  sm_35 - Target SM 3.5.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@167699 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
<entry>
<title>[NVPTX] Use ABI alignment for parameters when alignment is not specified.</title>
<updated>2012-11-09T23:50:24Z</updated>
<author>
<name>Justin Holewinski</name>
<email>justin.holewinski@gmail.com</email>
</author>
<published>2012-11-09T23:50:24Z</published>
<link rel='alternate' type='text/html' href='https://git.amat.us/llvm/commit/?id=89443ff7ae81e6a439b5b67e5dd587a450090599'/>
<id>urn:sha1:89443ff7ae81e6a439b5b67e5dd587a450090599</id>
<content type='text'>
Affects SM 2.0+.  Fixes bug 13324.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@167646 91177308-0d34-0410-b5e6-96231b3b80d8
</content>
</entry>
</feed>
