8 votes

Enzyme: Automatic differentiation of LLVM IR

Posted April 21, 2021 by Wulfsta

Tags: programming, enzyme, automatic differentiation, llvm.ir

https://enzyme.mit.edu/

Link information

This data is scraped automatically and may be incorrect.

Title: Enzyme AD
Word count: 209 words

5 comments

[3]
Wulfsta (OP)
April 21, 2021
Link
Found this the other day and wanted to share it here - doing automatic differentiation on code that has already been processed and optimized by the compiler is an amazing idea, and appears to work...

Found this the other day and wanted to share it here - doing automatic differentiation on code that has already been processed and optimized by the compiler is an amazing idea, and appears to work quite well.

3 votes
1. [2]
  entangledamplitude
  April 21, 2021
  Link Parent
  Depending on whether this covers ALL llvm code, that would mean auto diff for all languages that compile through llvm — including C/Cpp!!
  
  Depending on whether this covers ALL llvm code, that would mean auto diff for all languages that compile through llvm — including C/Cpp!!
  
  1 vote
  1. Wulfsta (OP)
    April 22, 2021 (edited April 22, 2021)
    Link Parent
    There is actually a thread somewhere in which the creators tried it on Rust, and it appeared to work! Edit: see here and here.
    
    There is actually a thread somewhere in which the creators tried it on Rust, and it appeared to work!
    
    Edit: see here and here.
    
    3 votes
vektor
April 22, 2021 (edited April 22, 2021)
Link
Since it's as on topic here as it's ever going to be, does anyone know of research on or have an opinion about the limits of automatic differentiation when applied to programs in the general case?...

Since it's as on topic here as it's ever going to be, does anyone know of research on or have an opinion about the limits of automatic differentiation when applied to programs in the general case? As in, what kind of models can be usefully trained using AD and what is their theoretical capability in terms of computability theory? My intuition says we can't use gradients to train things that are more advanced than propositional logic. FOL already involves considered-untractable structure search that gradients are not (imo) useful for. From the lense of formal grammars / automatons, it seems reasonable to assume we can train FSMs, maybe even PDAs (doubtful imo), but we're definitely hosed if we want to train Turing Machines. (Perhaps I should clarify/generalize: What I mean is that it's possible/impossible, using gradients, to train a model to perform the same tasks as such an automaton, i.e. accept/reject words from the respective grammar. That means I think we can not use AD to find a model that accepts Type-0 grammar, but Type-3 sounds very doable. In between I'm unsure, and I would like to know for sure whether my intuition here is right.

Relatedly, I can't seem to decide whether to use logical calculi to quantify the (theoretical) power of a model or grammars/automatons. If someone could help me square those away, that would be cool too.

3 votes
arghdos
April 21, 2021
Link
This is awesome... that CUDA-clang support woulda saved me a year of my PhD!

This is awesome... that CUDA-clang support woulda saved me a year of my PhD!

2 votes