Compiler + other closely related crap like linker.

Linker (computing)

 0  0

Some linker related answers by Ciro Santilli:

Find UTF-8 strings with Binutils `strings` (Binutils)

 0  0

Not possible it seems:

Automatic programming

 0  0

We use the term "automatic programming" to mean "generating code from natural language".

The ultimate high level of which is of course to program with:

computer, make money

which is basically the goal of artificial general intelligence, especially according to The Employment Test definition of AGI.

The term has not always had that sense:

automatic programming has always been a euphemism for programming in a higher-level language than was then available to the programmer

sums it up.

But in the current AI boom, this is the sense that matters, so that's what we will go with.

Bibliography:

www.reddit.com/r/LocalLLaMA/comments/1d25arj/a_coding_llm_that_actually_tries_to_compile_andor/

AI code generation framework that tries to run code

 0  0

OpenAI's GPT-4-turbo can generate and run Python code if it detects that the prompt would be better answered by Python, e.g. maths

AlphaEvolve (2025)

 0  0

Blog post: deepmind.google/discover/blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/

Whitepaper: storage.googleapis.com/deepmind-media/DeepMind.com/Blog/alphaevolve-a-gemini-powered-coding-agent-for-designing-advanced-algorithms/AlphaEvolve.pdf

Basically they require users to hand-code a metric and provide a program skeleton with some parts of the code marked to be replaced, and then the system focuses on modifying the code regions in question to optimize the metric.

All the novel results they announced were in constraint satisfaction problems or optimization problem. Their results are still awesome, but it's not very different from AlphaGo style things.

AI code generation benchmark

 0  0

Bibliography:

www.reddit.com/r/LocalLLaMA/comments/1e4unuz/any_up_to_date_benchmarking_sites_for_coding_llms/

Can AI code

 0  0

Appears to be a very small number of newly created problems?

HumanEval (2021)

 0  0

The tests are present in a gzip inside the Git repo: github.com/openai/human-eval/blob/master/data/HumanEval.jsonl.gz These researchers.

To get a quick overview of the problems with jq:

jq -r '"==== \(.task_id) \(.entry_point)\n\(.prompt)"' <HumanEval.jsonl

The first two problems are:

==== HumanEval/0 has_close_elements
from typing import List


def has_close_elements(numbers: List[float], threshold: float) -> bool:
    """ Check if in given list of numbers, are any two numbers closer to each other than
    given threshold.
    >>> has_close_elements([1.0, 2.0, 3.0], 0.5)
    False
    >>> has_close_elements([1.0, 2.8, 3.0, 4.0, 5.0, 2.0], 0.3)
    True
    """

==== HumanEval/1 separate_paren_groups
from typing import List


def separate_paren_groups(paren_string: str) -> List[str]:
    """ Input to this function is a string containing multiple groups of nested parentheses. Your goal is to
    separate those group into separate strings and return the list of those.
    Separate groups are balanced (each open brace is properly closed) and not nested within each other
    Ignore any spaces in the input string.
    >>> separate_paren_groups('( ) (( )) (( )( ))')
    ['()', '(())', '(()())']
    """

so we understand that it takes as input an empty function with a docstring and you have to fill the function body.

The paper also shows that there can be other defined functions besides the one you have to implement.

BigCodeBench (2024)

 0  0

Their most interesting subset, the -hard one, appears to be present at: huggingface.co/datasets/bigcode/bigcodebench-hard in Parquet format. OMG why.

The tests make free usage of the Python standard library and other major external libraries, e.g. huggingface.co/datasets/bigcode/bigcodebench-hard/viewer/default/v0.1.0_hf?views%5B%5D=v010_hf&row=0 uses FTPlib. Kind of cool.

They even test graph plotting? huggingface.co/datasets/bigcode/bigcodebench-hard/viewer/default/v0.1.0_hf?views%5B%5D=v010_hf&row=11 How does it evaluate?

SWE-bench (2024)

 0  0

www.swebench.com/

By Princeton people.

This one aims to solve GitHub issues. It appears to contain 2,294 real-world GitHub issues and their corresponding pull requests

The dataset appears to be at: huggingface.co/datasets/princeton-nlp/SWE-bench in Parquet format.

SWE-Lancer (2025)

 0  0

Tasks from Upwork.

Lowering and raising

 0  0

Lowering means translating to a lower level representation.

Raising means translating to a higher level representation.

Decompilation is basically a synonym, or subset, of raising.

Lower (compilation)

 0  0

Decompiler

 0  0

 Tagged

Video game reverse engineering

List of compilers

 0  0

GNU Compiler Collection (gcc)

 0  0

gcc CLI option

 0  0

gcc `-save-temps`

 0  0

Saves preprocessor output and generated assembly to separate files.

LLVM

 0  0

LLVM Intermediate Representation (LLVM IR)

 0  0

Very hot stuff! It's like ISA-portable assembly, but with types! In particular it also it deals with calling conventions for us (since it is ISA-portable). TODO: isn't that exactly what C does? :-) LLVM IR vs C

Documentation: llvm.org/docs/LangRef.html

 Tagged

Quantum Intermediate Representation

LLVM IR vs C

 0  0

LLVM IR hello world

 0  0

Example: llvm/hello.ll adapted from: llvm.org/docs/LangRef.html#module-structure but without double newline.

To execute it as mentioned at github.com/dfellis/llvm-hello-world we can either use their crazy assembly interpreter, tested on Ubuntu 22.10:

sudo apt install llvm-runtime
lli hello.ll

This seems to use puts from the C standard library.

Or we can Lower it to assembly of the local machine:

sudo apt install llvm
llc hello.ll

which produces:

hello.s

and then we can assemble link and run with gcc:

gcc -o hello.out hello.s -no-pie
./hello.out

or with clang:

clang -o hello.out hello.s -no-pie
./hello.out

hello.s uses the GNU GAS format, which clang is highly compatible with, so both should work in general.

clang

 1  0

LLVM front-end for C and related language like C++ etc.

Reproducible builds

 0  0

Reproducible builds allow anyone to verify that a binary large object contains what it claims to contain!

Bibliography:

Compiler

Compiler toolchain

Linker (computing)

Binutils

Binutils utility

GNU ld

strings (Binutils)

Find UTF-8 strings with Binutils `strings` (Binutils)

Automatic programming

AI code generation framework that tries to run code

AlphaEvolve (2025)

AI code generation benchmark

Can AI code

HumanEval (2021)

BigCodeBench (2024)

SWE-bench (2024)

SWE-Lancer (2025)

Lowering and raising

Lower (compilation)

Decompiler

List of compilers

GNU Compiler Collection (gcc)

gcc CLI option

gcc `-save-temps`

LLVM

LLVM Intermediate Representation (LLVM IR)

LLVM IR vs C

LLVM IR hello world

clang

Reproducible builds

Source-to-source compiler

 Ancestors (6)

 Incoming links (4)

 Synonyms (2)

 Discussion (0)

 Articles by others on the same topic (0)

 Discussion (0)  Subscribe (1)

 Discussion (0)