whitequark
8b1c5e3a26
llvm_ir_generator: add fast-math flags to fcmp.
...
This is allowed in 3.8.
2016-04-05 00:48:41 +00:00
whitequark
ad83b092fa
llvm_ir_generator: change !{→unconditionally_}dereferenceable.
...
Since LLVM 3.8, !dereferenceable is weaker, so we introduce
!unconditionally_dereferenceable (http://reviews.llvm.org/D18738 )
to regain its functionality.
2016-04-05 00:48:41 +00:00
whitequark
72b952e850
llvm_ir_generator: fix DICompileUnit.language.
2016-04-05 00:48:41 +00:00
whitequark
a57aabb3ea
compiler: purge generated functions from backtraces.
2016-04-02 18:29:36 +00:00
whitequark
a1e98a4437
llvm_ir_generator: add TBAA metadata for @now.
2016-04-02 18:19:14 +00:00
whitequark
b1f371e578
llvm_ir_generator: update debug info emission for LLVM 3.8.
2016-04-02 18:05:40 +00:00
whitequark
df8018f0be
compiler: emit verbose assembly via ARTIQ_DUMP_ASM.
2016-04-01 09:34:51 +00:00
whitequark
d9dd7df3b2
compiler: add analysis passes from TargetMachine.
...
This doesn't have any effect right now, but is the right thing to do.
2016-04-01 09:34:36 +00:00
whitequark
10108e6d64
compiler: mark loaded pointers as !dereferenceable.
...
Also, lower the bound for test_pulse_rate_dds, since we generate
better code for it now.
2016-03-31 12:04:38 +00:00
whitequark
2c04979727
compiler: update for LLVM 3.7.
2016-03-31 11:23:47 +00:00
whitequark
b8bd344e54
compiler: use correct data layout.
2016-03-31 11:22:29 +00:00
whitequark
8a908a7656
llvm_ir_generator: do not use 'coldcc' calling convention.
...
First, this calling convention doesn't actually exist in OR1K
and trying to use it in Asserts build causes an UNREACHABLE.
Second, I tried to introduce it and it does not appear to produce
any measurable benefit: not only OR1K has a ton of CSRs but also
it is quite hard, if not realistically impossible, to produce
the kind of register pressure that would be relieved by sparing
a few more CSRs for our exception raising function calls, since
temporaries don't have to be preserved before a noreturn call
and spilling over ten registers across an exceptional edge
is not something that the code we care about would do.
Third, it produces measurable drawbacks: it inflates code size
of check:* functions by adding spills. Of course, this could be
alleviated by making __artiq_raise coldcc as well, but what's
the point anyway?
2016-03-29 15:19:46 +00:00
whitequark
586022023b
llvm_ir_generator: mark loads as non-null where applicable.
2016-03-29 13:34:37 +00:00
whitequark
f6b391bb4e
compiler: fix ARTIQ_DUMP_ELF.
2016-03-29 13:03:49 +00:00
whitequark
a9489246a4
llvm_ir_generator: don't mark non-constant attribute loads as invariant.
...
Oops.
2016-03-29 13:03:49 +00:00
whitequark
3ed852e077
Commit missing parts of 1d8b0d46
.
2016-03-28 21:44:31 +00:00
whitequark
1038f1321f
compiler: allow specifying per-function "fast-math" flags.
...
Fixes #351 .
2016-03-28 21:44:08 +00:00
whitequark
6f5332f892
compiler: allow flagging syscalls, providing information to optimizer.
...
This also fixes a crash in test_cache introduced in 1d8b0d46
.
2016-03-28 19:56:56 +00:00
whitequark
1d8b0d46bc
compiler: mark FFI functions as ModRef=Ref using TBAA metadata.
...
Fascinatingly, the fact that you can mark call instructions with
!tbaa metadata is completely undocumented. Regardless, it is true:
a !tbaa metadata for an "immutable" type will cause
AliasAnalysis::getModRefBehavior to return OnlyReadsMemory for that
call site.
Don't bother marking loads with TBAA yet since we already place
!load.invariant on them (which is as good as the TBAA "immutable"
flag) and after that we're limited by lack of !nonnull anyway.
Also, add TBAA analysis passes in our pipeline to actually engage it.
2016-03-28 02:52:15 +00:00
whitequark
418f0a5234
compiler: mark loads of kernel constant attributes as load invariant.
...
Also, enable LICM, since it can take advantage of this.
2016-03-28 00:44:32 +00:00
whitequark
ca7463a054
compiler: do not write back kernel constant attributes.
...
Fixes #322 .
2016-03-28 00:05:57 +00:00
whitequark
f4e6b18690
compiler: implement kernel constant attributes.
...
Part of #322 .
2016-03-27 16:10:07 +00:00
whitequark
f81930ffd6
compiler: run IPSCCP.
...
This doesn't do much, only frees some registers.
2016-03-27 02:00:05 +00:00
whitequark
7213984330
compiler: raise inliner threshold to the equivalent of -O3.
2016-03-27 01:08:13 +00:00
whitequark
e75ad3d1aa
compiler: extract runtime checks into separate cold functions.
...
This reduces register pressure as well as function size, which
favorably affects the inliner.
2016-03-27 01:02:15 +00:00
whitequark
20ad7629ac
llvm_ir_generator: generate code more amenable to LLVM's GlobalOpt.
...
This exposes almost all embedded methods to inlining, with massive
gains.
2016-03-26 21:35:48 +00:00
whitequark
186a564ba8
compiler: make quoted functions independent of outer environment.
2016-03-26 20:46:45 +00:00
whitequark
f5c720c3ee
compiler: tune the LLVM optimizer pipeline ( fixes #315 ).
2016-03-26 13:20:23 +00:00
whitequark
3ee9834197
compiler: significantly increase readability of LLVM and ARTIQ IRs.
2016-03-26 12:21:02 +00:00
whitequark
8d0566661a
compiler: quote functions directly instead of going through a local.
2016-03-25 22:35:55 +00:00
whitequark
39d23793a4
transforms.llvm_ir_generator: extract class function attributes.
...
This should give LLVM more visibility.
2016-03-25 19:01:39 +00:00
whitequark
39599d4508
compiler: get rid of the GetConstructor opcode.
2016-03-25 19:01:39 +00:00
whitequark
f4ab507ab2
Bring back target print function.
2016-03-18 04:42:06 +00:00
whitequark
ac5061c205
validators.escape: track region of arguments.
...
Fixes #232 .
2016-03-18 03:08:14 +00:00
whitequark
9492464ed9
Commit missing parts of 78fa5be
.
2016-03-18 02:15:36 +00:00
whitequark
78fa5becea
compiler: refuse to embed a function from another core device.
...
Fixes #332 .
2016-03-18 02:01:14 +00:00
whitequark
82ab21dbd5
transforms.inferencer: better message for BinOpT unification.
...
Fixes #311 .
2016-03-18 01:22:01 +00:00
whitequark
73bfbe51db
compiler: reject lambdas used as kernel functions ( fixes #313 ).
2016-03-03 08:33:28 +00:00
whitequark
7e16da4a77
transforms.llvm_ir_generator: ignore assignments of None ( fixes #309 ).
2016-03-01 12:26:42 +00:00
whitequark
dc70029b91
transforms.asttyped_rewriter: set loc for ForT ( fixes #302 ).
2016-03-01 05:22:12 +00:00
whitequark
8bbffab8c8
Fix tests.
2016-02-27 13:40:37 +00:00
whitequark
e421b22953
types.TypePrinter: don't waste screen space on empty attribute lists.
2016-02-27 13:29:47 +00:00
whitequark
82a8e819ac
transforms.llvm_ir_generator: use private linkage instead of internal.
...
This reduces the size of symbol tables (internal adds an STB_LOCAL
symbol, but private doesn't).
2016-02-25 20:15:40 +00:00
whitequark
f838b8be49
compiler.embedding: cache attribute types ( fixes #276 ).
2016-02-25 19:56:45 +00:00
whitequark
d899d7307e
compiler.types: TDelay is always unifiable with self.
2016-02-25 19:56:12 +00:00
whitequark
919a49b6bc
compiler: quell excessively detailed diagnostics.
2016-02-25 19:43:52 +00:00
whitequark
a1dd909bc4
Take alignment into account during attribute writeback ( fixes #293 ).
2016-02-25 01:44:05 +00:00
whitequark
914bc9f360
artiq_run: allow running LLVM IR/bitcode files, even with trivial RPCs.
2016-02-24 22:50:45 +00:00
whitequark
da31d29897
compiler: fix ARTIQ_DUMP_*.
2016-02-24 21:43:46 +00:00
whitequark
a829b8a6fc
Commit missing parts of cf41890
.
2016-02-24 17:53:13 +00:00
whitequark
cf41890255
Correctly display backtraces that contain inlined functions.
2016-02-24 17:44:19 +00:00
whitequark
442673076f
compiler: setting ARTIQ_DUMP_* dumps to a file instead of stderr.
2016-02-24 17:44:19 +00:00
whitequark
bc81be1345
Implement dumb 'with parallel' ( #265 ).
2016-02-22 13:51:08 +00:00
whitequark
51a5910002
Rename 'with parallel' to 'with interleave' ( #265 ).
2016-02-22 13:24:43 +00:00
whitequark
9db2be2b03
compiler: only use colors in diagnostics on POSIX ( fixes #272 ).
2016-02-22 11:27:45 +00:00
whitequark
1465fe6f81
Add channel name as the first argument to rtio_log ( #206 ).
2016-02-15 21:17:54 +00:00
whitequark
74a75841f6
Add rtio_log() and make print() an RPC ( #206 ).
2016-02-15 03:56:56 +00:00
89f68f3470
style
2016-02-02 21:57:25 +01:00
whitequark
0acc86b3b3
transforms.iodelay_estimator: make diagnostics much more clear.
2016-01-27 02:10:15 +00:00
whitequark
3573a8750a
transforms.inferencer: give a suggestion on "raise Exception".
2016-01-20 03:20:25 +00:00
whitequark
5c6b1517d0
Rigorously treat builtin core device exceptions.
2016-01-19 01:45:25 +00:00
whitequark
5f0e2bf9f9
analyses.domination: all blocks dominate themselves.
2016-01-18 21:33:14 +00:00
whitequark
cc22837627
transforms.inferencer: infer a monomorphic type for slice ":"
2016-01-16 03:09:03 +00:00
whitequark
85222783ee
transforms.llvm_ir_generator: fix memoize().
2016-01-16 00:38:57 +00:00
whitequark
bed62349d2
transforms.llvm_ir_generator: i64 doesn't need sret ( fixes #228 ).
2016-01-15 23:58:11 +00:00
whitequark
127b117113
Add @host_only function decorator ( #172 ).
2016-01-15 16:42:08 +00:00
whitequark
63f7899f93
Commit missing parts of 7f914a0
.
2016-01-10 20:25:58 +00:00
whitequark
1be9e7576d
transforms.llvm_ir_generator: use byval for FFI calls where appropriate.
2016-01-10 18:31:35 +00:00
whitequark
ee1ef551bc
Fix symbolizer invocation with no addresses.
2016-01-10 18:04:10 +00:00
whitequark
d436093317
transforms.llvm_ir_generator: use sret attribute on FFI calls too.
2016-01-10 17:42:40 +00:00
whitequark
7f914a057c
transforms.llvm_ir_generator: use sret attribute.
2016-01-10 17:31:59 +00:00
whitequark
edb7423a4f
transforms.llvm_ir_generator: use getelementptr inbounds.
2016-01-10 16:36:04 +00:00
whitequark
a2a00e8b35
Set LLVM inlining threshold explicitly.
...
llvmlite's pass manager builder does not use a default when it
is not specified explicitly, unlike the C++ one.
2016-01-10 16:15:23 +00:00
whitequark
f92ce291ef
transforms.llvm_ir_generator: call devirtualized functions directly.
2016-01-10 15:54:46 +00:00
whitequark
c46fccdf4b
transforms.llvm_ir_generator: annotate invariant loads as such.
2016-01-10 15:45:34 +00:00
whitequark
b298170973
transforms.artiq_ir_generator: annotate embedded functions as internal.
2016-01-10 15:45:20 +00:00
whitequark
211af77e77
validators.escape: cache_get result lives forever.
2016-01-10 14:43:21 +00:00
whitequark
f2f1deb9db
transforms.llvm_ir_generator: quote recrusive objects correctly ( fixes #213 ).
2016-01-07 19:52:31 +00:00
whitequark
e8bba0a7d6
Commit missing parts of 6a6d7d.
2016-01-05 04:21:46 +00:00
whitequark
6a6d7dab19
transforms.artiq_ir_generator: add support for user-defined context managers.
2016-01-05 04:10:40 +00:00
whitequark
2955f9a1e0
Commit missing parts of 5baf18b
.
2016-01-05 00:16:58 +08:00
whitequark
dfbf55fed2
transforms/inferencer: add support for user-defined context manager.
2016-01-05 00:11:03 +08:00
whitequark
5baf18ba0d
transforms.inferencer: factor out _unify_attribute.
2016-01-04 22:13:05 +08:00
whitequark
05fa80818a
transforms.inferencer: make sure parallel/sequential is lone manager.
2016-01-04 21:26:03 +08:00
whitequark
3aa5acbaf3
transforms.llvm_ir_generator: don't crash if embedding is not enabled.
2016-01-03 19:07:11 +08:00
whitequark
81427af506
Remove selective attribute writeback; always synchronize all attributes.
...
The reasons are:
1. Shadow memory manipulation added ~12 instructions to TTLOut.pulse
(without inlining), and it's already barely fast enough.
2. More importantly, code such as self.ts[1] did not trigger
attribute writeback, and there seems to be no easy way to fix
that.
2016-01-03 04:57:58 +08:00
whitequark
38a99fde52
Implement selective attribute writeback using shadow memory.
2016-01-02 22:51:04 +08:00
whitequark
2e33084a5f
transforms.llvm_ir_generator: implement instrumentation for attribute writeback.
2016-01-02 06:51:30 +08:00
whitequark
5f68cc6a21
transforms.artiq_ir_generator: handle raise
in except:
with finally:
.
2016-01-02 06:51:29 +08:00
whitequark
693a364d85
transforms.artiq_ir_generator: fix typo.
2015-12-31 22:39:33 +08:00
whitequark
ff0ab736e9
Commit missing parts of 8aa34ee9
.
2015-12-31 22:36:25 +08:00
whitequark
79d020dd3a
transforms.artiq_ir_generator: handle terminated try body.
2015-12-31 22:36:25 +08:00
whitequark
8aa34ee952
compiler: don't require exceptions to inherit ARTIQException.
2015-12-31 21:54:54 +08:00
whitequark
0b69e488e6
transforms.llvm_ir_generator: compare exn typeinfo using strcmp.
...
This is necessary to correctly catch exceptions from RPCs.
2015-12-31 19:53:28 +08:00
whitequark
9ed6b54249
transforms.cfg_simplifier: remove.
...
Specifically, the following idiom broke our CFGSimplifier:
b1:
branch_if %cond, %b2, %b3
b2:
branch %b3
b3:
%v = phi [ %b2 => 0, %b3 => 1 ]
This is very annoying to properly handle in general case, and also
this (specifically %b2) is exactly what broke LocalAccessValidator
in the first place. So, remove CFGSimplifier altogether.
Instead, teach LocalAccessValidator how to look through predecessors
in order to get a location.
2015-12-31 17:07:36 +08:00
whitequark
787a1d34fe
transforms.llvm_ir_generator: don't crash when quoting over than one list.
2015-12-30 16:22:26 +08:00
whitequark
25e2824709
validators.escape: infer correct region for arguments.
2015-12-30 16:19:35 +08:00
whitequark
72b6cca9c3
transforms.cfg_simplifier: correctly update phis.
2015-12-30 16:07:14 +08:00
whitequark
2a474b7166
ir: fix incoming_{blocks,values,value_for_block}.
2015-12-30 16:06:18 +08:00
whitequark
ac3a170c8f
transforms.llvm_ir_generator: don't crash when quoting empty lists.
2015-12-30 15:47:57 +08:00