aboutsummaryrefslogtreecommitdiff
path: root/docs
diff options
context:
space:
mode:
authorChris Lattner <sabre@nondot.org>2011-04-06 00:45:11 +0000
committerChris Lattner <sabre@nondot.org>2011-04-06 00:45:11 +0000
commit1efe27eb797144ba4488e7b771fdd9611ee5cd0e (patch)
treedeed498596e7fb030800db03d4ba6a6a40c58640 /docs
parentb48c791515659c7572dea7e1d53059f58fb3121f (diff)
distribute content out to the appropriate sections
git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@128968 91177308-0d34-0410-b5e6-96231b3b80d8
Diffstat (limited to 'docs')
-rw-r--r--docs/ReleaseNotes.html418
1 files changed, 226 insertions, 192 deletions
diff --git a/docs/ReleaseNotes.html b/docs/ReleaseNotes.html
index b25f1ae142..719346992b 100644
--- a/docs/ReleaseNotes.html
+++ b/docs/ReleaseNotes.html
@@ -102,7 +102,7 @@ standards, fast compilation, and low memory use. Like LLVM, Clang provides a
modular, library-based architecture that makes it suitable for creating or
integrating with other development tools. Clang is considered a
production-quality compiler for C, Objective-C, C++ and Objective-C++ on x86
-(32- and 64-bit), and for darwin-arm targets.</p>
+(32- and 64-bit), and for darwin/arm targets.</p>
<p>In the LLVM 2.9 time-frame, the Clang team has made many improvements in C,
C++ and Objective-C support. C++ support is now generally rock solid, has
@@ -293,137 +293,80 @@ in this section.
<p>LLVM 2.9 includes several major new capabilities:</p>
<ul>
-<li><pre>
- last release for llvm-gcc
-TBAA: On by default in clang. Disable it with -fno-strict-aliasing.
+
+<li>
+ TBAA: On by default in clang. Disable it with -fno-strict-aliasing.
Could be more aggressive for structs.
+</li>
+
+<li>New Nvidia PTX backend, not generally useful in 2.9 though.</li>
-Triple::normalize is new, llvm triples are always stored in normalized form internally.
+<li>
+Much better debug info generated, particularly in optimized code situations.
+</li>
-Triple x86_64--mingw64 is obsoleted. Use x86_64--mingw32 instead.
+<li>
+inline asm multiple alternative constraint support.
+</li>
+
+<li>
+ New naming rules in coding standards: CodingStandards.html#ll_naming
+</li>
-MC Assembler: X86 now generates much better diagnostics for common errors,
- is much faster at matching instructions, is much more bug-compatible with
- the GAS assembler, and is now generally useful for a broad range of X86
- assembly.
+</ul>
-New Nvidia PTX backend, not generally useful in 2.9 though.
+</div>
-Much better debug info generated, particularly in optimized code situations.
+<!--=========================================================================-->
+<h2>
+<a name="coreimprovements">LLVM IR and Core Improvements</a>
+</h2>
-ARM Fast ISel
+<div class="doc_text">
+<p>LLVM IR has several new features for better support of new targets and that
+expose new optimization opportunities:</p>
-ELF MC support: on by default in clang. There are still known missing features
- for human written assembly.
+<ul>
+<li>udiv, ashr, lshr, shl now have exact and nuw/nsw bits:
+ PR8862 / LangRef.html</li>
+
+ unnamed_addr + PR8927
-X86: Reimplemented all of MMX to introduce a new LLVM IR x86_mmx type. Now
- random types like &lt;2 x i32&gt; are not iseld to mmx without emms. The
- -disable-mmx flag is gone now.
+ new 'hotpatch' attribute: LangRef.html#fnattrs
-Some basic <a href="CodeGenerator.html#mc">internals documentation</a> for MC.
-
-MC Assembler support for .file and .loc.
-
-
-inline asm multiple alternative constraint support.
-
-LoopIdiom: memset/memcpy formation and memset_pattern on darwin. Build with
- -ffreestanding or -fno-builtin if your memcpy is being compiled into infinite
- recursion.
-
+</ul>
-TargetLibraryInfo
-
-X86 support for FS/GS relative loads and stores using address space 256/257 are
- reliable now.
-
-ARM: New code placement pass.
-
-unnamed_addr + PR8927
-
-PointerTracking has been removed from mainline, moved to ClamAV.
-
-EarlyCSE pass.
-LoopInstSimplify pass.
+</div>
-- DIBuilder provides simpler interface for front ends like Clang to encode debug info in LLVM IR.
- - This interface hides implementation details (e.g. DIDerivedType, existence of compile unit etc..) that any front end should not know about.
- For example, DIFactory DebugFactory;
- Ty = DebugFactory.CreateDerivedType(DW_TAG_volatile_type,
- findRegion(TYPE_CONTEXT(type)),
- StringRef(),
- getOrCreateFile(main_input_filename),
- 0 /*line no*/,
- NodeSizeInBits(type),
- NodeAlignInBits(type),
- 0 /*offset */,
- 0 /* flags */,
- MainTy);
- can be replaced by
- DbgTy = DBuilder.createQualifiedType(DW_TAG_volatile_type, MainTy);
-DIFactory is gone now.
-
-PPC: Switched to MCInstPrinter, and MCCodeEmitter. Ready to implement support
- for directly writing out mach-o object files, but noone seems interested.
+<!--=========================================================================-->
+<h2>
+<a name="optimizer">Optimizer Improvements</a>
+</h2>
-ARM: Improved code generation for Cortex-A8 and Cortex-A9 CPUs.
-
-Scheduler now models operand latency and pipeline forwarding.
-
-Can optimize printf to iprintf when no floating point is used, for embedded
- targets with smaller iprintf implementation.
-
-error_code + libsystem + PathV2 changes
- The system_error header from C++0x was added.
- * Use if (error_code ec = function()) to check for error conditions
- from functions which return it.
- * error_code::message returns a human readable description of the error.
-
- PathV1 has been deprecated in favor of PathV2 (sorry I didn't finish
- this before the release).
- * No Path class, use a r-value convertible to a twine instead.
- * Assumes all paths are UTF-8.
-
-new macho-dump tool
+<div class="doc_text">
-Major regalloc rewrite, not on by default for 2.9 and not advised to use it.
- * New basic register allocator that can be used as a safe fallback when
- debugging. Enable with -regalloc=basic.
- * New infrastructure for live range splitting. SplitKit can break a live
- interval into smaller pieces while preserving SSA form, and SpillPlacement
- can help find the best split points. This is a work in progress so the API
- is changing quickly.
- * The inline spiller has learned to clean up after live range splitting. It
- can hoist spills out of loops, and it can eliminate redundant spills.
- Rematerialization works with live range splitting.
- * New greedy register allocator using live range splitting. This will be the
- default register allocator in the next LLVM release, but it is not turned on
- by default in 2.9.
+<p>In addition to a large array of minor performance tweaks and bug fixes, this
+release includes a few major enhancements and additions to the optimizers:</p>
-ARM: __builtin_prefetch turns into prefetch instructions.
-
-MC assembler support for 3dNow! and 3DNowA instructions.
+<ul>
+ <li>LTO has been improved to use MC for parsing inline asm and now
+ can build large programs like Firefox 4 on both OS X and Linux.</li>
-tblgen support for assembler aliases: <a
- href="CodeGenerator.html#na_instparsing">MnemonicAlias and InstAlias</a>
-LoopIndexSplit pass was removed, unmaintained.
-LiveValues, SimplifyHalfPowrLibCalls, and GEPSplitter were removed.
+ LoopIdiom: memset/memcpy formation and memset_pattern on darwin. Build with
+ -ffreestanding or -fno-builtin if your memcpy is being compiled into infinite
+ recursion.
-include/llvm/System merged into include/llvm/Support.
-
-Win32 PE-COFF support in the MC assembler has made a lot of progress in the 2.9
- timeframe, but is still not generally useful. Please see
- "http://llvm.org/bugs/showdependencytree.cgi?id=9100&amp;hide_resolved=1" for open bugs?
-
+ TargetLibraryInfo
+
+ EarlyCSE pass.
+ LoopInstSimplify pass.
+
New <a href="WritingAnLLVMPass.html#RegionPass">RegionPass</a> infrastructure
for region-based optimizations.
-MicroBlaze: major updates for aggressive delay slot filler, MC-based assembly
- printing, assembly instruction parsing, ELF .o file emission, and MC
- instruction disassembler.
-
-Countless ARM microoptimizations.
+ Can optimize printf to iprintf when no floating point is used, for embedded
+ targets with smaller iprintf implementation.
Speedups to various mid-level passes:
GVN is much faster on functions with deep dominator trees / lots of BBs.
@@ -431,18 +374,9 @@ Speedups to various mid-level passes:
more passes (so they are computed less often)
SRoA is also much faster and doesn't use DominanceFrontier.
-
-new 'hotpatch' attribute: LangRef.html#fnattrs
-
-APInt API changes, see PR5207.
-
DSE is more aggressive with stores of different types: e.g. a large store
following a small one to the same address.
-New naming rules in coding standards: CodingStandards.html#ll_naming
-
-LiveDebugVariables is a new pass that keeps track of debugging information for
- user variables that are kept in registers in optimized builds.
We now optimize various idioms for overflow detection into check of the flag
register on various CPUs, e.g.:
@@ -452,69 +386,7 @@ We now optimize various idioms for overflow detection into check of the flag
addq %rdi, %rbx
jno LBB0_2
-X86: Much better codegen for several cases using adc/sbb instead of cmovs for
- conditional increment and other idioms.
-
-MVT::Flag renamed to MVT::Glue
-
-Removed the PartialSpecialization pass, it was unmaintained and buggy.
-
-SPARC: Many improvements, including using the Y registers for multiplications
- and addition of a simple delay slot filler.
-
-
-udiv, ashr, lshr, shl now have exact and nuw/nsw bits: PR8862 / LangRef.html
-
-lib/Object and llvm-objdump
-
-
- Target Independent Code Gen:
- The pre-register-allocation (preRA) instruction scheduler models register pressure
- much more accurately in some cases. This allows the adoption of more
- aggressive scheduling heuristics.
- The X86 backend has adopted a new preRA scheduling
- mode, "list-ilp", to shorten the height of instruction schedules
- without inducing register spills.
-
- The ARM backend preRA scheduler now models machine resources at cycle
- granularity. This allows the scheduler to both accurately model
- instruction latency and avoid overcommitting functional units.
-
-
-</pre></li>
-</ul>
-
-</div>
-
-<!--=========================================================================-->
-<h2>
-<a name="coreimprovements">LLVM IR and Core Improvements</a>
-</h2>
-
-<div class="doc_text">
-<p>LLVM IR has several new features for better support of new targets and that
-expose new optimization opportunities:</p>
-
-<ul>
-</ul>
-
-</div>
-
-<!--=========================================================================-->
-<h2>
-<a name="optimizer">Optimizer Improvements</a>
-</h2>
-
-<div class="doc_text">
-
-<p>In addition to a large array of minor performance tweaks and bug fixes, this
-release includes a few major enhancements and additions to the optimizers:</p>
-
-<ul>
- <li>TBAA.</li>
- <li>LTO has been improved to use MC for parsing inline asm and now
- can build large programs like Firefox 4 on both OS X and Linux.</li>
</ul>
<!--
@@ -547,6 +419,39 @@ in.</p>
<li>MC supports the <tt>.cfi_*</tt> directives for producing DWARF
frame information, but it is still not used by CodeGen by default.</li>
<li>COFF support?</li>
+
+
+ MC Assembler: X86 now generates much better diagnostics for common errors,
+ is much faster at matching instructions, is much more bug-compatible with
+ the GAS assembler, and is now generally useful for a broad range of X86
+ assembly.
+
+
+ ELF MC support: on by default in clang. There are still known missing features
+ for human written assembly.
+
+
+ Some basic <a href="CodeGenerator.html#mc">internals documentation</a> for MC.
+
+ MC Assembler support for .file and .loc.
+
+ tblgen support for assembler aliases: <a
+ href="CodeGenerator.html#na_instparsing">MnemonicAlias and InstAlias</a>
+
+ Win32 PE-COFF support in the MC assembler has made a lot of progress in the 2.9
+ timeframe, but is still not generally useful. Please see
+ "http://llvm.org/bugs/showdependencytree.cgi?id=9100&amp;hide_resolved=1" for open bugs?
+
+
+ lib/Object and llvm-objdump
+ Experimental format independent object file manipulation library.
+ * Supports PE/COFF and ELF.
+ * llvm-nm extended to work with object files. Exactly matches
+ binutils-nm for the files I've tested.
+ * llvm-objdump added with support for disassembly (no relocations displayed).
+
+
+
</ul>
<p>For more information, please see the <a
@@ -569,7 +474,34 @@ it run faster:</p>
<ul>
<!-- SplitKit -->
-FastISel for ARM.
+
+<li>The pre-register-allocation (preRA) instruction scheduler models register
+ pressure much more accurately in some cases. This allows the adoption of more
+ aggressive scheduling heuristics.
+</li>
+
+ LiveDebugVariables is a new pass that keeps track of debugging information for
+ user variables that are kept in registers in optimized builds.
+
+
+Scheduler now models operand latency and pipeline forwarding.
+
+Major regalloc rewrite, not on by default for 2.9 and not advised to use it.
+ * New basic register allocator that can be used as a safe fallback when
+ debugging. Enable with -regalloc=basic.
+ * New infrastructure for live range splitting. SplitKit can break a live
+ interval into smaller pieces while preserving SSA form, and SpillPlacement
+ can help find the best split points. This is a work in progress so the API
+ is changing quickly.
+ * The inline spiller has learned to clean up after live range splitting. It
+ can hoist spills out of loops, and it can eliminate redundant spills.
+ Rematerialization works with live range splitting.
+ * New greedy register allocator using live range splitting. This will be the
+ default register allocator in the next LLVM release, but it is not turned on
+ by default in 2.9.
+
+
+
</ul>
</div>
@@ -583,6 +515,30 @@ FastISel for ARM.
</p>
<ul>
+<li>
+X86: Reimplemented all of MMX to introduce a new LLVM IR x86_mmx type. Now
+ random types like &lt;2 x i32&gt; are not iseld to mmx without emms. The
+ -disable-mmx flag is gone now.
+</li>
+
+ <li>
+X86 support for FS/GS relative loads and stores using address space 256/257 are
+ reliable now.
+ </li>
+
+ <li>
+X86: Much better codegen for several cases using adc/sbb instead of cmovs for
+ conditional increment and other idioms.
+ </li>
+
+ <li>
+ The X86 backend has adopted a new preRA scheduling
+ mode, "list-ilp", to shorten the height of instruction schedules
+ without inducing register spills.
+ </li>
+
+ MC assembler support for 3dNow! and 3DNowA instructions.
+
<li>Several bugs have been fixed for Windows x64 code generator.</li>
</ul>
@@ -598,9 +554,39 @@ FastISel for ARM.
</p>
<ul>
+<li>ARM Fast ISel</li>
+<li>ARM: New code placement pass.</li>
+<li>ARM: Improved code generation for Cortex-A8 and Cortex-A9 CPUs.</li>
+<li>ARM: __builtin_prefetch turns into prefetch instructions.</li>
+<li>Countless ARM microoptimizations.</li>
+
+<li> The ARM backend preRA scheduler now models machine resources at cycle
+ granularity. This allows the scheduler to both accurately model
+ instruction latency and avoid overcommitting functional units.</li>
+
+
</ul>
</div>
+
+<!--=========================================================================-->
+<h2>
+<a name="OtherTS">Other Target Specific Improvements</a>
+</h2>
+<div class="doc_text">
+<ul>
+ PPC: Switched to MCInstPrinter, and MCCodeEmitter. Ready to implement support
+ for directly writing out mach-o object files, but noone seems interested.
+
+ MicroBlaze: major updates for aggressive delay slot filler, MC-based assembly
+ printing, assembly instruction parsing, ELF .o file emission, and MC
+ instruction disassembler.
+
+ SPARC: Many improvements, including using the Y registers for multiplications
+ and addition of a simple delay slot filler.
+
+</ul>
+</div>
<!--=========================================================================-->
<h2>
@@ -614,30 +600,78 @@ on LLVM 2.8, this section lists some "gotchas" that you may run into upgrading
from the previous release.</p>
<ul>
-</ul>
+ last release for llvm-gcc
+- DIBuilder provides simpler interface for front ends like Clang to encode debug info in LLVM IR.
+ - This interface hides implementation details (e.g. DIDerivedType, existence of compile unit etc..) that any front end should not know about.
+ For example, DIFactory DebugFactory;
+ Ty = DebugFactory.CreateDerivedType(DW_TAG_volatile_type,
+ findRegion(TYPE_CONTEXT(type)),
+ StringRef(),
+ getOrCreateFile(main_input_filename),
+ 0 /*line no*/,
+ NodeSizeInBits(type),
+ NodeAlignInBits(type),
+ 0 /*offset */,
+ 0 /* flags */,
+ MainTy);
+ can be replaced by
+ DbgTy = DBuilder.createQualifiedType(DW_TAG_volatile_type, MainTy);
+DIFactory is gone now.
+
+
-<p>In addition, many APIs have changed in this release. Some of the major LLVM
-API changes are:</p>
-<ul>
+
+
+ LoopIndexSplit pass was removed, unmaintained.
+ LiveValues, SimplifyHalfPowrLibCalls, and GEPSplitter were removed.
+ Removed the PartialSpecialization pass, it was unmaintained and buggy.
+
+ DIFactory removed, use DIBuilder instead.
+
+ Triple::normalize is new, llvm triples are always stored in normalized form internally.
+
+ Triple x86_64--mingw64 is obsoleted. Use x86_64--mingw32 instead.
+
+ PointerTracking has been removed from mainline, moved to ClamAV.
+
</ul>
</div>
<!--=========================================================================-->
<h2>
-<a name="devtree_changes">Development Infrastructure Changes</a>
+<a name="api_changes">Internal API Changes</a>
</h2>
<div class="doc_text">
-<p>This section lists changes to the LLVM development infrastructure. This
-mostly impacts users who actively work on LLVM or follow development on
-mainline, but may also impact users who leverage the LLVM build infrastructure
-or are interested in LLVM qualification.</p>
+<p>In addition, many APIs have changed in this release. Some of the major
+ LLVM API changes are:</p>
<ul>
+
+ include/llvm/System merged into include/llvm/Support.
+
+
+ APInt API changes, see PR5207.
+
+ MVT::Flag renamed to MVT::Glue
+
+
+ error_code + libsystem + PathV2 changes
+ The system_error header from C++0x was added.
+ * Use if (error_code ec = function()) to check for error conditions
+ from functions which return it.
+ * error_code::message returns a human readable description of the error.
+
+ PathV1 has been deprecated in favor of PathV2 (sorry I didn't finish
+ this before the release).
+ * No Path class, use a r-value convertible to a twine instead.
+ * Assumes all paths are UTF-8.
+
+
</ul>
</div>