Ekaitz's tech blog

[Talk] Full Source Bootstrapping RISC-V on Guix

2024-09-19T00:00:00+03:00

Guix London¹ gave me the chance to talk about this whole bootstrapping process and I delivered.

In the talk I explain what RISC-V is, how is the Linux support right now, and which changes we added to (gnu packages commencement) in Guix.

Later, I share my thoughts about some social/emotional part, as I always do, in this case focusing the attention in leadership and how our sensei Ludovic Courtès is a great example of what I want to be as a free software developer that was given some responsibility.

Then there are some questions, really good questions (thanks for that!), and we discuss a little bit about NlNet and how the grants work. You might be interested on that if you want to propose a project.

Just that, I hope you like the talk.

Fill my inbox with your questions, specially if they are interesting.

Thanks to the people from Guix London: Steve, Fabio and Arun. ↩

The European Union must keep funding free software

2024-07-20T00:00:00+03:00

No need to say that during the last 2 years part of my efforts trying to give people a better world, using my engineering skills, have been funded by the European Union with the NGI programme, which I was granted by NLNet. Every talk I give about what we achieved I thank them because without this money I would never be able to use my time in a long-term project like this.

In a recent interview I mentioned many funding programmes out there focus the attention in the private sector. I am very aware of this because I actually worked as a research and development engineer for the private sector. Our R&D department was totally funded by public programmes, to the point we even had profits, but everything we did was just for the company we were part of. Nothing was released as free software.

Maybe you didn’t know this, but I left that company because I believed (and I continue to believe) the projects we started to do were not ethical. Our company, trying to become profitable (task they failed at, years later I left), started to make projects that used public funding to track users wherever they went, and harass them with aggressive advertisement. As R&D engineers, our job was to start to prototype this, which I didn’t want to do.

There were many reasons behind my decision, of course, but this was one of them. I couldn’t do that. That was bad. And it was done with your money.

I don’t know if that model of throwing money to private companies hoping they become stronger in the current globalised market makes any sense, that’s not for me to know. I’m just an engineer. I just hoped that if our money was given to someone it was to be used for our benefit or at least not to be used against us.

My personal experience, which I admit is anecdotal but it is shared by many colleagues, is the programmes that focus on the private sector don’t make sure this happens. They don’t even require the projects developed with the money to be shared with the public, letting the society benefit from the resulting technical innovation they pledge to be promoting.

I would prefer if the money was spent on things that would be shared, with open licenses, with everyone. But I’m just an engineer, and I don’t know about economic impact, and the measurements the people who decide things use for taking their decisions.

What I do know is the people that really work for others, that make things because their heart tells them to, are those that would never do something like what the company I worked for did. They wouldn’t think sharing their work as free software is a project requirement, imposed by the programme they are funded from. They would do it because they believe it is correct to do it. They would do it because it is an atrocity not to do so.

Surprisingly, a programme that shared those values existed: NGI; and I had the honor to be supported by it for the last two years. This has let me focus in making that my heart believes is correct and my brain has the skill to do. I’m giving back what I was given by the public university where I studied, by all the free software I use everyday and by all the open knowledge I have the privilege to be able to acquire from The (open) Internet.

The NGI programme is important. It changes the world in the proper direction.

It seems the European Union is more interested in putting money in other things (like AI) instead, maybe not being aware that the world’s software infrastructure has to be maintained, and companies are not going to do it for us.

Because all that I said, I sign the open letter by Petites Singularités you can read below.

Initially published by petites singularités. English translation provided by OW2.

Open Letter to the European Commission

Since 2020, Next Generation Internet (NGI) programmes, part of European Commission’s Horizon programme, fund free software in Europe using a cascade funding mechanism (see for example NLnet’s calls). This year, according to the Horizon Europe working draft detailing funding programmes for 2025, we notice that Next Generation Internet is not mentioned any more as part of Cluster 4.

NGI programmes have shown their strength and importance to supporting the European software infrastructure, as a generic funding instrument to fund digital commons and ensure their long-term sustainability. We find this transformation incomprehensible, moreover when NGI has proven efficient and economical to support free software as a whole, from the smallest to the most established initiatives. This ecosystem diversity backs the strength of European technological innovation, and maintaining the NGI initiative to provide structural support to software projects at the heart of worldwide innovation is key to enforce the sovereignty of a European infrastructure. Contrary to common perception, technical innovations often originate from European rather than North American programming communities, and are mostly initiated by small-scaled organizations.

Previous Cluster 4 allocated 27 million euros to:

“Human centric Internet aligned with values and principles commonly shared in Europe” ;
“A flourishing internet, based on common building blocks created within NGI, that enables better control of our digital life” ;
“A structured ecosystem of talented contributors driving the creation of new internet commons and the evolution of existing internet commons”.

In the name of these challenges, more than 500 projects received NGI funding in the first 5 years, backed by 18 organisations managing these European funding consortia.

NGI contributes to a vast ecosystem, as most of its budget is allocated to fund third parties by the means of open calls, to structure commons that cover the whole Internet scope - from hardware to application, operating systems, digital identities or data traffic supervision. This third-party funding is not renewed in the current program, leaving many projects short on resources for research and innovation in Europe.

Moreover, NGI allows exchanges and collaborations across all the Euro zone countries as well as “widening countries” ¹, currently both a success and an ongoing progress, likewise the Erasmus programme before us. NGI also contributes to opening and supporting longer relationships than strict project funding does. It encourages implementing projects funded as pilots, backing collaboration, identification and reuse of common elements across projects, interoperability in identification systems and beyond, and setting up development models that mix diverse scales and types of European funding schemes.

While the USA, China or Russia deploy huge public and private resources to develop software and infrastructure that massively capture private consumer data, the EU can’t afford this renunciation. Free and open source software, as supported by NGI since 2020, is by design the opposite of potential vectors for foreign interference. It lets us keep our data local and favors a community-wide economy and know-how, while allowing an international collaboration.

This is all the more essential in the current geopolitical context: the challenge of technological sovereignty is central, and free software allows to address it while acting for peace and sovereignty in the digital world as a whole.

As defined by Horizon Europe, widening Member States are Bulgaria, Croatia, Cyprus, the Czech Republic, Estonia, Greece, Hungary, Latvia, Lituania, Malta, Poland, Portugal, Romania, Slovakia and Slovenia. Widening associated countries (under condition of an association agreement) include Albania, Armenia, Bosnia, Feroe Islands, Georgia, Kosovo, Moldavia, Montenegro, Morocco, North Macedonia, Serbia, Tunisia, Turkey and Ukraine. Widening overseas regions are : Guadeloupe, French Guyana, Martinique, Reunion Island, Mayotte, Saint-Martin, The Azores, Madeira, the Canary Islands. ↩

Milestone (End?) - Bootstrapping path discovered

2024-07-02T00:00:00+03:00

During the latest posts I described that we managed to build GCC using our bootstrapping chain, and also that we built a modern GCC using our bootstrapped one, but now, some connection work has been done. It’s time to report on that.

Bootstrapping chain discovered

In my commencement.scm project, I started working on a bootstrapping path following Andrius Štikonas’ steps in live-bootstrap. His work is what is driving everything for several reasons: he knows how to do it well and Guix has a handful of peculiarities that make this way more complex than in live-bootstrap.

First of all, live-bootstrap uses custom Makefiles for most of the projects, that allows it to build things without relying on the feature detection that the configure scripts normally use. We could do the same thing on Guix but that would mean rewriting many packages from scratch and Guix package definitions would become way more complex than what they are right now.

Second, live-bootstrap is launched in some kind of a chrooted environment that is shared by all steps. This means all the packages in it share the / folder, and they can easily find other programs in /bin and libraries and stuff in the folders expected by the FHS. In Guix we do not support the FHS, and our packages are isolated, so we need to tweak the environment variables of the packages in order to make them find the dependencies. This sometimes requires patching the source of some packages, and here problems appear.

The most heavily patched package in this regard is actually GCC. It needs to be able to work well with our environment and that’s not easy to do. This makes my job harder.

The good news is Andrius managed to build GCC 9.5 with the bootstrapping chain in live-bootstrap and that means we discovered a full-source bootstrapping path for RISC-V in 64 bit that is working right now.

Time to make it reach a distro

I managed to build GCC 9.5 with C support, using our bootstrapping path, and that’s more than nothing, but we are working with Efraim Flashner (one of the Guix maintainers) to add full support for GCC 9.5 and also Janneke is working on top of my commencement.scm file to merge it with the support they already did for x86 (and was already done when I joined this effort).

So, the efforts to include our work in Guix are currently happening and I don’t think there’s much Andrius and I can do to help in this direction. The project is on the hands of the people that know the best.

Issues

During the project we found some small inconveniences that we could not fix, and those must be improved but they’ll take long time, as they come from some related projects. I’d love to say I have time, energy and knowledge to fix those, but most of them are not under my control.

First we have the problem we detected with Gash, that produces hangs in the very first steps of the bootstrapping process. This is really hard to fix but Timothy Sample, the project maintainer, is working on the issue.

Second, some of the projects we use in the bootstrapping process require some packaging skills I lack at this very moment, but they have been bootstrapped in live-bootstrap so this can be done. The projects are flex and bison, and maybe a couple more, but they are out of the scope of the project so I just used the non-bootstrapped ones and left them as to-dos in the commencement.scm file¹.

What now, then?

I think what to do next is pretty obvious now: handover to Guix.

I’ll open an issue in Guix in the following days where we’ll discuss the inclusion of this bootstrapping path in Guix in the following months, as the other related issues are fixed and more steps are included in the chain.

This handover process will take time because the bootstrapping path doesn’t end in a modern GCC, but in a proper GCC, GLibC, and some other packages that are required for almost everything. Our discovery is enough to continue further, but we didn’t do that as our goal was to provide the RISC-V support in the places were it wasn’t ready. We have already shown that we did that. Now I guess it’s time for the distros to catch up.

Of course, with our help. As always.

I also left those because we don’t build them from source in Guix yet. See https://issues.guix.gnu.org/52311 ↩

Milestone – Bootstrapped GCC 4.6.4 for RISC-V

2024-05-17T00:00:00+03:00

In latest posts we talked about many things: problems, our changes to other projects and other things. Now it’s time to actually talk about something we actually did.

But first, we fixed more extra things… This is a never ending story.

The things we need to deal with

We have been trying to build GCC 4.6.4 with the backported RISC-V support using TinyCC. We already did a GCC 4.6.4 that worked but we only tested it building with a modern GCC. Building with a TinyCC with an incomplete backend that we had to improve ourselves is not that obvious as it would be with the very well established architecture like i386.

We could build a minimal GCC, from TinyCC and Musl, but it didn’t work. Me and my colleague Andrius Štikonas detected and patched around a problem that we still don’t know where it is coming from (we reported upstream). This is the only one we need to apply to GCC, as we fixed many things in TinyCC.

This simple patch was enough to build the whole GCC. Can you spot the difference?

We find this kind of problems making small reproducers that we tried to tweak, and happened to work if we split things.

diff --git a/gcc/tree-ssa-operands.c b/gcc/tree-ssa-operands.c
index d05f8149170..d4ef8de4813 100644
--- a/gcc/tree-ssa-operands.c
+++ b/gcc/tree-ssa-operands.c
@@ -300,6 +300,7 @@ static inline void *
 ssa_operand_alloc (unsigned size)
 {
   char *ptr;
+  int x;

   gcc_assert (size == sizeof (struct use_optype_d)
              || size == sizeof (struct def_optype_d));
@@ -334,8 +335,8 @@ ssa_operand_alloc (unsigned size)
       gimple_ssa_operands (cfun)->operand_memory_index = 0;
     }

-  ptr = &(gimple_ssa_operands (cfun)->operand_memory
-         ->mem[gimple_ssa_operands (cfun)->operand_memory_index]);
+  x = gimple_ssa_operands (cfun)->operand_memory_index;
+  ptr = &(gimple_ssa_operands (cfun)->operand_memory->mem[x]);
   gimple_ssa_operands (cfun)->operand_memory_index += size;
   return ptr;
 }

We need to rebuild with Musl

Once that very first GCC compiles, it’s necessary to rebuild Musl with that GCC, and use the new GCC+Musl combination to rebuild GCC, now adding the C++ support we missed first time.

This gives us several things: a more stable GCC and a better Musl, that is compatible with GCC.

If we don’t do that, and we don’t rebuild Musl, we see errors when building the second GCC, the genmddeps program, which is created and run during build process of GCC just fails.

Also, we have to --disable-bootstrap, so rebuilding it is like doing the GCC’s bootstrap ourselves.

Backport Musl support to GCC 4.6.4

Next thing, remember from previous posts why we had to use Musl for our process. First, because it’s a fully-featured C standard library; second, because it’s simple and easy to build and, lastly, because we can’t use GlibC: the newest version we can build doesn’t support RISC-V.

The main problem we have with GCC 4.6.4 since we started with all this journey is that 4.6.4 was written before our architecture, RISC-V, was invented, and this also happened with Musl.

Andrius detected some missing declarations during the compilation process of the libstdc++, which happened because GCC was trying to use some GlibC specific functions for the build we didn’t have, as we were using Musl instead.

Turns out GCC needs extra configuration for Musl, that was added after our version was released, so I backported all that (not much) to GCC 4.6.4. Maybe I missed something, but now we can build GCC 4.6.4 with C++ support, using Musl, for RISC-V!

But it doesn’t work!

Right after building it, we tried to use it on a simple program:

#include<iostream>

int main (int argc, char* argv[]){
    std::cout << "Hello World" << std::endl;
    return 10;
}

But this returns a huge set of errors looking like:

/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(c++locale.o): In function `.L0 ':
(.text._ZSt14__convert_to_vIfEvPKcRT_RSt12_Ios_IostateRKPi+0x3a): undefined reference to `operator new[](unsigned long)'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(c++locale.o): In function `.L15':
(.text._ZSt14__convert_to_vIfEvPKcRT_RSt12_Ios_IostateRKPi+0x172): undefined reference to `operator delete[](void*)'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(c++locale.o): In function `.L11':
(.text._ZSt14__convert_to_vIfEvPKcRT_RSt12_Ios_IostateRKPi+0x190): undefined reference to `__cxa_call_unexpected'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(c++locale.o): In function `.L0 ':
(.text._ZSt14__convert_to_vIdEvPKcRT_RSt12_Ios_IostateRKPi+0x3a): undefined reference to `operator new[](unsigned long)'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(c++locale.o): In function `.L39':
(.text._ZSt14__convert_to_vIeEvPKcRT_RSt12_Ios_IostateRKPi+0x1b4): undefined reference to `__cxa_call_unexpected'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.L23':
(.text._ZNSi6ignoreEl+0x21c): undefined reference to `__cxa_end_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.L24':
(.text._ZNSi6ignoreEl+0x22e): undefined reference to `__cxa_end_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.LEHB2':
(.text._ZNSi6ignoreEl+0x24c): undefined reference to `__cxa_begin_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.L31':
(.text._ZNSi6ignoreEl+0x272): undefined reference to `__cxa_rethrow'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.LEHE2':
(.text._ZNSi6ignoreEl+0x27c): undefined reference to `__cxa_begin_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.L30':
(.text._ZNSi6ignoreEl+0x29c): undefined reference to `__cxa_end_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.L51':
(.text._ZNSt13basic_istreamIwSt11char_traitsIwEE6ignoreEl+0x22c): undefined reference to `__cxa_end_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.L52':
(.text._ZNSt13basic_istreamIwSt11char_traitsIwEE6ignoreEl+0x23e): undefined reference to `__cxa_end_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.LEHB9':
(.text._ZNSt13basic_istreamIwSt11char_traitsIwEE6ignoreEl+0x25c): undefined reference to `__cxa_begin_catch'
/gnu/store/f3chm93gvrina083lqp36qpf2w4fc3f1-profile/lib/libstdc++.a(compatibility.o): In function `.L59':
(.text._ZNSt13basic_istreamIwSt11char_traitsIwEE6ignoreEl+0x282): undefined reference to `__cxa_rethrow'
collect2: ld returned 1 exit status

Again, Andrius detected there was a line in the huge logs we have that said we were missing the find command. So we added it, and worked¹.

The thing built, and worked!

So we managed to make it build and run, improving TinyCC in the process. Now TinyCC can build Musl and GCC 4.6.4 (with a very small patch), and that GCC is able to build itself, adding C++ support. And of course, the thing works!

You can try if this thing worked, and see how, in the commencement.scm project where I’m keeping track of all this.

So yeah, success but this didn’t end. Now we need to use that GCC to build GCC 7.5, and compile the world with it.

Cross fingers.

There’s something weird in this case, if I do a change like this one below it fails:

    
     (native-inputs `(("gcc" ,gcc-muslboot0)
                      ("libc" ,musl-boot)
-                     ("find" ,findutils)
                      ,@(modify-inputs (package-native-inputs gcc-muslboot0)
+                                      (append findutils)
                                       (replace "make" gnu-make-muslboot)
                                       (delete "libc")
                                       (delete "tcc"))))

↩

TinyCC to GCC gap is slowly closing

2024-05-02T00:00:00+03:00

In previous episodes we talked about getting sidetracked and we mentioned we needed to build Musl because we had limitations in our standard library. We didn’t explain them in detail and I think it’s the moment to do so, as many of the changes we proposed there have been tested and upstreamed, and explain the ramifications that process had.

Symptoms

TinyCC and our MeslibC are powerful enough to build Binutils. But not enough to make some of the programs, like GNU As, work.

MeslibC is supersimple, meaning it doesn’t really implement some of the things you might consider obvious. One of the best examples is fopen. Instead of returning a fresh FILE structure, in MeslibC fopen simply returns the underlying file descriptor, as returned by the kernel’s open call. This is not a big problem, as the fread and fclose provided with MeslibC are compatible with this behaviour, but there’s a very specific case where this is a problem. In GNU As, if no file is given as an input, it just tries to read from standard input, and it fails, saying there was no valid file descriptor. Why? Let’s read the code GNU As uses to read files (gas/input-file.c):

/* Open the specified file, "" means stdin.  Filename must not be null.  */

void
input_file_open (const char *filename,
         int pre)
{
  int c;
  char buf[80];

  preprocess = pre;

  gas_assert (filename != 0);   /* Filename may not be NULL.  */
  if (filename[0])
    {
      f_in = fopen (filename, FOPEN_RT);
      file_name = filename;
    }
  else
    {
      /* Use stdin for the input file.  */
      f_in = stdin;
      /* For error messages.  */
      file_name = _("{standard input}");
    }

  if (f_in == NULL)
    {
      as_bad (_("can't open %s for reading: %s"),
          file_name, xstrerror (errno));
      return;
    }

  c = getc (f_in);
  /* ... Continues ...*/

If MeslibC uses file descriptor integers as FILE structures, it’s not hard to detect the problem in the example. For the cases where the selected filename is empty (no file to read from) filename[0] will be false (\0 character), and f_in will be set to stdin. That should normally mean some FILE structure with an internal file descriptor of value 0, the one corresponding to the standard input. As the structure is not NULL the error message below won’t trigger. As I just explained, MeslibC uses kernel’s file descriptors instead of FILE structures so stdin in MeslibC is just 0, which is equal to NULL for the compiler, so the error message is triggered and the execution stops.

MeslibC’s clever solution for filenames is simply failing due to the fact that C has no error types, and errors are signalled in the standard library using NULL.

This is just a simple case to exemplify how MeslibC affects our bootstrapping chain, but there are others. For example, MeslibC can’t ungetc more than once because that was enough for the bootstrapping as it was designed for x86, but as we moved to a more recent binutils version (the first one supporting RISC-V), that became an obstacle, and it’s preventing us from running GNU As.

Of course, all of these problems could be fixed in MeslibC, but in the end the goal of MeslibC is not to be a proper C standard library implementation, but a helper for the bootstrapping of more powerful standard libraries that already exist. These problems, and some others we also found, are just drawing the line of when do we need to jump to a more mature C standard library in our chain. Looks like binutils is where that line is drawn.

Musl

The bootstrapping chain as conceived in Guix uses GLibC, as Guix is a GNU project, but we found Musl to be a more suitable C standard library for these initial steps as it is simple an easy to build while keeping all the functionality you might expect from a proper C standard library.

We ran into some issues though, as upstream TinyCC’s RISC-V backend wasn’t ready to build it.

First of all, TinyCC’s RISC-V backend had no support for Extended ASM, so I implemented it and sent it upstream.

Once I did that we built Musl and we realized we had issues in some functions. The problem was the Extended Asm implementation was not understanding the constraints properly and those parameters marked as read and write were not considered correctly. I talked with Michael, the author of that piece of code, because I didn’t understand the behaviour well. He guided me a little and I proceeded to fix it in all architectures.

Still, we couldn’t build Musl because it was using some atomic instructions that were not implemented in TinyCC’s RISC-V assembler and we decided to avoid them, patching around them in Musl. They happened to be important for memory allocation (LOL) so I decided to implement them in TinyCC’s assembler and push the changes upstream. I implemented lr (load reserved), sc (store conditional) and extended fence‘s behaviour to match what the GNU Assembler (the reference RISC-V assembler) would do.

Still this wasn’t enough for Musl to build properly as TinyCC’s RISC-V backed was not implemented as a proper assembly but as instructions in human readable text. RISC-V is a RISC architecture and makes a heavy use of pseudoinstructions to ease the development of assembly programs. Before all this work, TinyCC only implemented simple instructions and almost no pseudoinstruction expansion.

Also, its architecture couples argument parsing with relocation generation and it doesn’t really help to implement pseudoinstructions with variable argument count or default values. I added enough code to avoid falling in the problems this design decision had and pushed everything upstream. The list includes support for many pseudoinstructions, proper relocation use for several instruction families like jal and branches, and some other things. In the end, we do not have a fully featured assembler yet, but we do have enough to build the simple code we find in a C standard library like Musl. In fact, even using the syntax that any RISC-V assembler would expect, as I explained in more detail here.

Meslibc

Once all those changes are finally applied to TinyCC, we can remove the weird split we needed to do in MeslibC to support make it match the TinyCC assembly syntax, so I did that. Less code, less problems.

Also my colleague Andrius added a realpath stub, to make us able to build upstream TinyCC without having to patch the places where realpath was used in it. realpath is not a simple function to implement, and it’s not critical in TinyCC. Again MeslibC doesn’t need to be perfect, only let us start building everything.

TinyCC

With all those changes coming to MeslibC and the ones we upstreamed, we now don’t need to patch on top of upstream TinyCC, so all our small changes on top of it are dropped now. Less code, less problems.

We could have kept these changes for ourselves, but sharing them is not only easier, but also better for everyone. The following is the complete list of changes I upstreamed to TinyCC, a project that we are not really part of, but this is what we do and what we believe in.

0aca8611 fixup! riscv: Implement large addend for global address
8baadb3b riscv: asm: implement j offset
15977630 riscv: asm: Add branch to label
671d03f9 riscv: Add full fence instruction support
c9940681 riscv: asm: Add load-reserved and store-conditional
0703df1a Fix Extended Asm ignored constraints
6b3cfdd0 riscv: Add extended assembly support
e02eec6b riscv: fix jal: fix reloc and parsing
02391334 fixup! riscv: Add .option assembly directive (unimp)
cbe70fa6 riscv: Add .option assembly directive (unimp)
618c1734 riscv: libtcc1.c support some builtins for __riscv
3782da8d riscv: Support $ in identifiers in extended asm.
e2d8eb3d riscv: jal: Add pseudo instruction support
409007c9 riscv: jalr: implement pseudo and parse like GAS
8bfef6ab riscv: Add pseudoinstructions
8cbbd2b8 riscv: Use GAS syntax for loads/stores:
019d10fc riscv: Move operand parsing to a separate function
7bc0cb5b riscv: Implement large addend for global address

Bootstrappable TinyCC

During the bootstrapping process we detected new issues and one of them was so deep it took pretty long to detect and solve.

Most of the programs we were building with our Bootstrappable TinyCC worked: GZip, Make… But we reached a point were we needed to rebuild upstream TinyCC with Musl, in order to start using Musl to build the next programs. It didn’t work.

We had a really hard time finding the problem behind this because it appeared too far in the chain to be easy. The process goes like this.

We use Mes to build our very first Bootstrappable TinyCC, which compiles itself several times (6), until it reaches its final state. That then builds upstream TinyCC and with that we build TinyCC again this time using Musl as its standard library. We found this last one was unable to build simple files and we started digging.

We realized TinyCC was using sign extension in unsigned values, and that was messing up with the next TinyCC, making it unable to build programs correctly. Researching this deeply we found the problem was in the load function of TinyCC but a TinyCC built with GCC didn’t have this problem. The only option was that the Bootstrappable TinyCC had the bug that was later affecting the compilers compiled with it.

Digging a little bit further I found the casts from Bootstrappable TinyCC had some missing cases that I didn’t backport properly but as I wasn’t able to understand them very well I decided to backport the full gen_cast function from upstream to the Bootstrappable TinyCC. With that, the errors from TinyCC were gone.

It feels like an accidental trusting trust attack, yes. This is the kind of things we have to deal with, and they are pretty tiring and frustrating to find.

The new Bootstrapping chain

So, all of this brings us to the new bootstrapping chain. We need to make things very different to the way Guix does them right now, because we are skipping many steps (GCC 2.95, now we need Musl for Binutils…) so I started a project to track how we go forward in the bootstrapping chain (it’s just a wip, for our tests, take that in account).

We had good and bad news in that regard. At the moment of writing we managed to build up to the GCC 4.6.4 I added RISC-V support to, but the compiler is faulty and it’s unable to build itself again with the C++ support.

I’m using non-bootstrapped versions of flex and bison, but those shouldn’t be hard to bootstrap either. I just didn’t have the time to make them from scratch. And I’m using a bash instead of gash because we had found a blocking error in gash that is not letting us continue forward from Binutils.

In any case, this means we are near from the next milestone: building GCC 4.6.4 with TinyCC; and as we described in the previous post we already built GCC 7.5 from GCC 4.6.4 so we solved the next already.

After those, we would need to clean this new bootstrapping chain and talk with Guix for its inclusion in there. I hope we can finish all this before hitting the deadline that is silently approaching…

GCC 4.6.4 with RISC-V support in Guix

2024-04-02T00:00:00+03:00

Short post now, following what we did in previous episodes where we explained how to build it and run it in Debian, now we can run it in Guix thanks to Efraim Flashner’s help.

We didn’t get rid of the GCC Bootstrapping issue we explained in that post (comparing binaries and all that, you remember). But we patched around it because the differences were minimal, and this is not going to be a production compiler after all. If this causes trouble in the future, we’ll fix it, but I don’t think it will (I’ve said this before…).

How to use this thing

We provide a guix.scm file in the repository you can use for your own packages. If you know Guix, that’s probably all you need to know.

In my case, I did some ugly testing on this thing, so I made another folder, adjacent to the gcc project folder and made this¹:

I added a guix.scm file:

(use-modules (gnu packages base)
             (gnu packages)
             (guix licenses)
             (guix gexp)
             (guix utils)
             (guix build-system gnu)
             (guix packages))

(load "../gcc/guix.scm")

(define-public my-hello
  (package
    (inherit hello)
    (source (local-file (string-append (dirname (current-filename)) "/hola.cpp")
              #:recursive? #t))
    (inputs (list gcc-mine))
    (arguments
      `(#:tests? #f
        #:phases (modify-phases %standard-phases
         (delete 'configure)
         (delete 'install)
         (replace 'build (lambda* (#:key outputs #:allow-other-keys)
            (let ((out (assoc-ref outputs "out")))
              (mkdir-p (string-append out "/bin"))
              (invoke "g++" "--version")
              (invoke "g++" "hola.cpp" "-o" (string-append out "/bin/hola"))))))))))
my-hello

And a hola.cpp file:

#include<iostream>
int main (){
    std::cout << "Hola!\n";
    return 0;
}

Jumped into the folder and called this:

guix build -f guix.scm --system=riscv64-linux-gnu -K

Which gave me something like this, but way longer (beware of the BLABLABLAH):

$ guix build -f guix.scm --system=riscv64-linux -K
fatal: not a git repository (or any parent up to mount point /)
Stopping at filesystem boundary (GIT_DISCOVERY_ACROSS_FILESYSTEM not set).
substitute: updating substitutes from 'https://substitutes.nonguix.org'... 100.0%
substitute: updating substitutes from 'https://ci.guix.gnu.org'... 100.0%
substitute: updating substitutes from 'https://bordeaux.guix.gnu.org'... 100.0%
The following derivation will be built:
  /gnu/store/niszn6sh1hd6hqwg0xjcw2q7q4lbjvbb-hello-2.12.1.drv
building /gnu/store/niszn6sh1hd6hqwg0xjcw2q7q4lbjvbb-hello-2.12.1.drv...

... BLABLABLAH ...

starting phase `build'
g++ (GCC) 4.6.4
Copyright (C) 2011 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

phase `build' succeeded after 1.4 seconds
successfully built /gnu/store/niszn6sh1hd6hqwg0xjcw2q7q4lbjvbb-hello-2.12.1.drv
The following graft will be made:
   /gnu/store/hnvji18y322q408x3h6i0wafmd6hqmdf-hello-2.12.1.drv
applying 2 grafts for hello-2.12.1 ...

... BLABLABLAH ...

successfully built /gnu/store/hnvji18y322q408x3h6i0wafmd6hqmdf-hello-2.12.1.drv
/gnu/store/dbkz567v66ry813svva2j3lnmdchb5r5-hello-2.12.1

And that I could run (using Qemu binfmt magic):

$ /gnu/store/dbkz567v66ry813svva2j3lnmdchb5r5-hello-2.12.1/bin/hola
Hola!

Let’s use file command just in case:

$ file /gnu/store/dbkz567v66ry813svva2j3lnmdchb5r5-hello-2.12.1/bin/hola
/gnu/store/dbkz567v66ry813svva2j3lnmdchb5r5-hello-2.12.1/bin/hola: ELF 64-bit
LSB executable, UCB RISC-V, RVC, double-float ABI, version 1 (SYSV),
dynamically linked, interpreter
/gnu/store/fpqqyym3w1ym24jr2xiwz94hjfgr5hjm-glibc-2.35/lib/ld-linux-riscv64-lp64d.so.1,
for GNU/Linux 2.6.32, stripped

No tricks here, it’s a RISC-V file, and you can see up in the Guix output it was built using GCC 4.6.4. That’s what we promised, isn’t it?

Next we need to connect this thing to the bootstrapping chain. It needs to be built using TinyCC and it needs to be able to build a modern GCC that is more stable.

With that, we might be able to build the whole RISC-V world.

Wish us luck.

Extra (added 2024-04-05)

In a similar fashion to the my-hello I shared as an example, I just tried this:

(use-modules (gnu packages gcc)
             (ice-9 rdelim)
             (gnu packages)
             (guix licenses)
             (guix gexp)
             (guix utils)
             (guix build-system gnu)
             (guix packages))

(load "../gcc/guix.scm")

(define-public gcc-with-mine
  (package
    (inherit gcc-7)
    (inputs `(("gcc" ,gcc-mine) ,@(alist-delete "gcc" (package-inputs gcc-7))))
    (arguments (substitute-keyword-arguments (package-arguments gcc-7)
      ((#:phases phases)
       `(modify-phases ,phases
         (add-after 'unpack 'setenv
             ;; We don't want gcc-11:lib in CPLUS_INCLUDE_PATH, it messes with
             ;; libstdc++ from gcc-4.6.
             (lambda _
               (setenv "CPLUS_INCLUDE_PATH" (getenv "C_INCLUDE_PATH")))) ))))))

gcc-with-mine

And guess what? It worked.

This is really interesting because GCC 7.5 is the first one with official RISC-V support. Being able to build it means from here we can follow the existing bootstrapping chain and build the world. Also, GCC is a huge program and, as I told you before, it does some checks during the build, building itself twice and then comparing the resulting binaries to make sure it’s stable. Building an stable GCC that is also able to build itself means a lot.

At this point we can consider my GCC-4.6.4 backport is correct, and not only that, our package definition is, too. In projects like this bootstrapping thing it still feels weird when things just work, it makes you be suspicious… I’ll try to ignore that feeling and be happy, just for once.

Oh yes, I said ugly. If you know Guix you’ll understand what I mean. There are plenty of ways to do this clean, but I don’t care. I’m ugly. ↩

Takerufuji made history

2024-04-01T00:00:00+03:00

Takerufuji Mikiya, born as Ishioka Mikiya in 1999, loves sumo. He does amateur fights at university, where he finds Ōnosato Daiki, 2000, formerly known as Nakamura Daiki, the most prominent young fighter in top-level sumo today, a worthy rival.

It’s 2017, Terunofuji Haruo, born in Ulaanbaatar, Mongolia, as Gantulgyn Gan-Erdene in 1991, is a top-level ōzeki, the second highest sumo range. He is strong as a monster, lifting other fighters and carefully placing them out of the dohyō, the sumo “ring”, almost without resistance. He is an absolute display of strength, but he’s not immune to injury. After several knee injuries and health problems, he is not able to fight for so long he is relegated to lower sumo ranges, forced to fight in a lower division each competition he misses. He is forced to start in jonidan when he returns in 2019. That’s the fifth division in sumo.

Terunofuji in 2023 (wikipedia)

Terunofuji is back in business. During his first competition after the return he secures the promotion to the sandanme, next he is promoted to makushita. Two makushita competitions later, with a 6-1 record, he wins the makushita competition with an amazing 7-0, securing the promotion to jūryō, which he also wins (13-2) once and has a very good performance next (10-5). He’s back in makuuchi, the top level division. It’s 2020.

During the first top-level division competition after his return, Terunofuji wins the competition (13-2), also obtaining the special prizes for Outstanding Performance and Technique. One year later, he is back at ōzeki range, winning three of the six competitions he took part in during the year, and being in the runner-up on five of them. After a great performance (14-1) he is promoted to yokozuna, the highest range in sumo, reserved only for the grandmasters of the sport. In 2021, after a two-year comeback, he became the 73rd yokozuna in the recorded sumo history (since 1798).

The young Takerufuji, who is still fighting in amateur sumo, decides to make the jump to professional, inspired by Terunofuji‘s comeback. In November 2022, he wins (7-0) his first jonokuchi competition, the lowest division in sumo. He is promoted to jonidan, that he also wins (7-0). Next he makes a great performance in sandanme (6-1) and is promoted to makushita where he obtains very good results (6-1, 6-1, 5-2 and 6-2) and is finally promoted to jūryō, the second division. Now, he can consider himself a proper professional. It’s January 2024. Only a little bit more than a year passed since he started his professional attempt.

Takerufuji in 2024 (wikipedia)

Ōnosato Daiki has also made his bet. In May 2023 he started at makushita level, as he was a top fighter in amateur sumo, he could skip lower divisions. After two good performances in makushita (6-1 and 4-3) he is promoted to jūryō and continues to perform at the same level (12-3 and 12-3) and is finally promoted to the top division, with a maegashira 15 range, in January 2024. In less than a year, thanks to his exceptional strength and technique, Ōnosato signs the third-fastest to reach the top division since 1989. He was so fast, his hair hasn’t grow enough yet to tie in a chonmage, the mandatory sumo hairstyle.

Ōnosato in 2023 (wikipedia)

While Ōnosato is doing a great performance in his top division debut, in January 2024, ending with a great 11-4 record, Takerufuji is almost unstoppable in jūryō, winning the championship with an excellent 13-2 record and being thus promoted to the top-level division, where both young fighters would meet, again since their amateur times, in March.

It’s March 2024, and the time has come. Both Ōnosato and Takerufuji have a great start, with 0 losses the sixth day. After the first two thirds of competition, the tenth day, both arrive with very good numbers: Ōnosato with a 8-1 after his loss against Ōnoshō the seventh day and Takerufuji has a surprising non-loss streak, very rare on first division debutants, 9-0 that is.

Both having a similar range, and a very good record, are paired together. Ōnosato is 40 kilograms heavier, he is very strong and his technique is sharp. Takerufuji is 8 centimeters shorter, and strong as a truck. In their first duel in the sumo top division, hopefully from many, Ōnosato is blasted out of the ring, and Takerufuji obtains his 10-0. Since the 15-day tournament format was established in 1949, only the yokozuna Taiho conquered an eleven-win streak in his first top-level competition.

Next day, Takerufuji fights Kotonowaka, a young fighter at the ōzeki range, winning the fight and matching Taiho‘s record. After this, he demonstrated he could challenge the hardest fighters in the competition. Next day Hōshōryū comes, the young Mongolian fighter is one of the most feared in the ōzeki range. Hōshōryū is fast, strong and his technique, based on traditional Mongolian wrestling, bökh, is both accurate and hard to defend against.

After a very intense start, Hōshōryū lands a perfect technique and throws Takerufuji out of the dohyō, putting an end to the 11 win streak, but not yet to the competition.

The 14th fight comes and Takerufuji has a 12-1 record, the best from all the participants. This time loses against Asanoyama injuring his ankle right at the beginning of the fight. 12-2 that is, and everybody is afraid he won’t compete the last day, as he couldn’t walk in his way out.

The last day of the competition starts, Takerufuji, with his 12-2 is one win away from the next fighter, Ōnosato, who secured a great 11-3 record and is one win away from the next in the list: Hōshōryū.

Takerufuji has an injured ligament, and his master has advised him not to fight. Nobody knows what would happen. When his turn comes, he walks to the center of the dohyō with a bandage in his right ankle. Even if he loses he would still have chances to win the tournament if Ōnosato loses his fight against Hōshōryū, but we all know he will try to win this fight against Gōnoyama, his spirit is one of a warrior.

Gōnoyama, a great fighter (10-4 at the fight), has a great start, almost dropping Takerufuji, when he put too much weight in his injured ankle. But Takerufuji holds in place, grabs Gōnoyama‘s mawashi and push, push, push until he is out. Takerufuji wins the competition and becomes the second fighter that wins the makuuchi competition as a debutant since Ryogoku, who won the summer tournament of 1914, that’s 110 years.

The tournament continues, as some fights are left. Ōnosato has to fight Hōshōryū, the only sumo that was able to win against an uninjured Takerufuji. In a spectacular display of technique and flexibility, Hōshōryū throws Ōnosato out of the dohyō. 11-4 for Ōnosato in the end, signing a great competition and also wining fighting spirit and technique special prices.

Takerufuji, also received all possible special prices: fighting spirit, outstanding performance and technique; and he is honored as the great warrior he is. His name is now written in sumo history forever.

Many have no doubt that Takerufuji, Ōnosato, and probably Hōshōryū, will become great yokozunas one day, but sadly, the only one we have now, the inspiration for this huge accomplishment, Terunofuji, was out of the competition, after a very poor start, due to his injuries and health problems at his 32 years of age.

So we’ve got sidetracked…

2024-03-30T00:00:00+02:00

There are many software projects involved in our bootstrapping process, not only compilers! And many of them are not fully supported for RISC-V or we don’t have the compilers ready to build them. In order to be able to build everything, we need to touch the build scripts of many of them, add patches to them or fix our C standard library and compilers, as they are extremely minimal and may lack support.

During these days we had a really interesting case of back-and-forth that sidetracked us a little bit, so let me share it with you, and meanwhile I’ll introduce most of the work we’ve been doing since January.

Gash

In the bootstrapping in Guix we don’t rely on Bash to run our scripts. Instead we use Gash.

During the bootstrapping process in Guix, we found Gash hangs in some specific points of the process, mostly when configuring Binutils.

I managed to use Bash in Binutils in my RISC-V hardware instead, but in my x86_64 laptop I was unable to build all the dependencies I needed to build Bash for RISC-V. This is not cool.

Thankfully, Gash maintainers are friendly and we can talk with them to try to fix the issue.

Gzip

Gzip is also an integral part of the process, as we download the software releases in Gzip format. We need to decompress them as fast as possible (and correctly).

It built pretty easily, using the bootstrappable Tinycc but it didn’t run properly at first. It was able to decompress files but then, when it tried to compare the file checksum, it failed to do so.

This happened to be related with some missing integer support in our bootstrappable TinyCC. The riscv-mes branch of our bootstrappable TinyCC shows all the commits we needed to add to fix this issue and the rest of the issues that I share in this post.

The Gzip issue was fixed by this commit¹:

589d2ab1 RISCV: 32bit val sign extension

It has some 32 bit value sign extension support we were missing. Without it, the binary operations that calculate the checksum in Gzip were simply wrong, as everything was incorrectly sign-extended².

As you might imagine, due to the lack of proper debug symbols and the fact that the issue was so specific, this was really hard to deal with until we found out the real problem. Of course, this very issue would affect many other programs but this was the first time we saw it. That’s why it’s very important to fix things properly, as they may have many ramifications.

GNU-Make

One of the other dependencies is GNU-Make, needed in many projects. In the previous steps of the bootstrap we manage running commands manually, but in more complex projects GNU-Make is necessary.

In January we built Make using our bootstrappable TinyCC, but it didn’t work!

First it didn’t work, it segfaulted, when using long options (--help) but short ones (-h) did. This happened because our bootstrappable TinyCC had a missing piece from the backport I did. The commit db9d7b09 of the riscv-mes branch and the following two show a clear history of how this worked. We first realized the char * was loaded to a register using a lb instruction, which is a load byte. I realized this printing the value of the pointer in hex format, that shown only the lower bytes were the same, while the higher ones were empty. Then I disassembled and found the lb, that should have been an ld (load doubleword), the pointer size in a 64 bit machine that is.

The problem here was that the char * was detected in the compiler as an array of characters, which has a size of 1: a byte. The TinyCC I took the code from uses a function that calculates the size of the type, a pretty reasonable thing to have in a compiler. The problem we had was the type information was not stored properly and that function calculated the type based on that wrong information. My first attempt was to use a different function for that, but when I sent a patch to upstream TinyCC, thinking they also had this issue, they told me they didn’t have it in the first place³. That was more than surprising for me so I dig in the Git history until I realize they had a very interesting commit: Make casts lose top-level qualifiers. Ah-ha! The commit only has one line and then some tests. This is the content, without the tests:

+    vtop->type.t &= ~ ( VT_CONSTANT | VT_VOLATILE | VT_ARRAY );

This removed the VT_ARRAY flag from the pointer in the type, so the function that calculates the type treats the type as a pointer, 64 bit then in our case, so ld is emitted and we are happy. We cherry pick the commit from upstream, revert our fix and go on.

But of course that was not enough, that’d be too easy. We found some other issues in Make.

Now Make was running and not failing, but it said "No makefile was found" and it never run the recipes. We realized later there was some kind of issue when reading files and my colleague Andrius found the getdents system call returns a different struct in 64 bits, and we were reading it like the 32 bit structure so he fixed that in Meslibc for all 64 bit architectures. This error makes a lot of sense in Meslibc, because all the previous attempts in the bootstrapping were in 32 bits and our starting point only supports that. That’s one of the other sources of errors we have, we are also making this whole thing 64bit-ready.

Once Make was able to find and read the Makefile and run it, we realized other problem, this one related with the dates of the files. Make started to give us weird messages like "Timestamp out of range; substituting...". Later, I found that some recipes were executed even if the files it required didn’t change.

This is not a big deal if you just want things to be built once so we left this as not-very-important-thing until I used this Make in a Guix package. The gnu-build-system in Guix first runs ./configure (configure phase) and later runs make (build phase). This make rerun the ./configure command from the previous step, because it thought some of the files where changed between both phases. This behavior is more problematic than it feels, because Guix needs to fix the shebangs of all the scripts in the project⁴, and it has a phase for this between the ones I just mentioned: patch-generated-file-shebangs. If it’s the make run itself that configures the project and right after that starts building, the shebangs of the generated files are not fixed, and the process fails. The issue is not a not-very-important-thing anymore!

Of course, after what I just explained I was forced to fix this. Some debugging sessions later I found the stat system call’s result was not interpreted correctly in MeslibC. There were some padding issues, so I just fixed that in RISC-V and mostly fixed Make. Now Make, built using our bootstrappable TinyCC, works well enough for us.

TinyCC

In my talk this February in FOSDEM-2024 I explained upstream TinyCC was missing some RISC-V support and that we didn’t have it working yet. During this time we solved the main issue we had with it:

Unimplemented large addend for global address

I had no idea about how to fix this so I wrote an email to the person that wrote most of the code around the relocations and he answered me, giving me a very interesting answer. Thank you, Michael.

That answer was more than enough for me to write the code for it (it was almost done) and in a couple of hours I had a fix for this. The large addend support was pretty simple, actually. It was just that relocations are still a little bit scary for me, and the codebase doesn’t help a lot.

With this issue fixed, now we can go for upstream TinyCC and use it for later steps on the project, as we do in the bootstrapping chains in other architectures, as the upstream TinyCC is more stable and capable than our bootstrappable fork.

Binutils

We need to remember our goal is to build GCC. That’s why we try to use upstream TinyCC, as it is able to build it whereas our bootstrappable TinyCC might not be as.

Building GCC requires Binutils, so we tried to build it. We had several issues in Binutils and we haven’t managed to make Binutils’ programs that don’t explode. The problem here is probably because of limitations of our standard library, so here comes the sidetrack.

We considered using Musl instead, as it’s a powerful standard library that is also very simple.

Musl

Musl is really cool. We’ve used it a lot as a reference for MeslibC, but Musl is not used in Guix’s bootstrapping process in other architectures. Our plan is to try use it for Binutils to see if our broken binaries are because of MeslibC or because of something else.

Musl, as most C standard libraries, requires some support for assembly, and more specifically Extended Asm.

We already talked about Extended Asm⁵ support before but, in summary, it was unimplemented in TinyCC’s backend for RISC-V.

Apart from that, TinyCC lacks some very important pseudoinstructions that are used in Musl and the assembly syntax it uses is not the one that the GNU Assembler uses, so TinyCC is unable to build simple instructions like:

ld a0, 8(a0)

As TinyCC expects something like:

ld a0, a0, 8

Hmm…

Back to TinyCC

This is were the sidetrack went so wild we went back to almost the beginning. I wanted to make Musl build so I started to write support for everything I wanted it to do.

I implemented many pseudoinstructions and instructions that were missing and Musl needed. This includes GNU Assembler syntax for memory access instructions like loads and stores. By the way, don’t trust them blindly because I realized I did jal wrong (some relocation issue again!) and I had to fix it later.

I also added .options directive for the RISC-V assembly, that is used really often (I didn’t implement it yet). I did enough to make the builds pass. Most of the times the .options directive is used to disable the linker relaxation, which TinyCC doesn’t do anyway so… Why bother?

I also have a draft for the Extended Asm, and I have it kind of working. I am not sure about some of the things I did but I feel it’s pretty close.

The Extended Asm support is not upstreamed yet, but I sent it to the TinyCC mailing list. The rest of the things I sent it already to TinyCC and you can see in the mob branch.

MeslibC

Of course, I can’t stop, so I took of the all support I did for TinyCC and tried to apply it to the bootstrappable TinyCC.

I was also a little bit forced to do so because we rebuild MeslibC with TinyCC and after the changes we could not do it. When we started we had to make a copy of MeslibC that didn’t have the GNU As style assembly and supported the TinyCC style assembly instead. Mes’ Guix package as-is only provides one of the flavors of the MeslibC code, the TinyCC style one, which we can’t rebuild with the modern support in TinyCC.

My solution was to backport all the Extended Asm support and all the new assembler to the bootstrappable TinyCC and then remove the MeslibC copy that used the old syntax. I managed to make it build but the executables generated with it explode at the time of writing, so we need to review that further. In any case, this is a good change because it reduces the amount of code we have, and it uses the more recent TinyCC assembly, that had many improvements since I did the backport, a year ago.

So…

It looks we are back again at the very beginning, and near to the end at the same time, if you take in account what I shared in the latest post of the series about GCC.

We still need to work in some other related projects, like Patch, that would allow us to apply our bootstrapping patches, but that’s also almost working. I want to believe it’s not going to give us many headaches in the future.

In summary, it looks like sometimes you have to run and later go back to walk the same path, slowly this second time, with all the knowledge you got in the first run.

Here we are. Sidetracked, but also pretty happy, as this is still going forward.

Link to GitHub ↩
I already told you integers are hard ↩
Yes, I should have checked better. My bad. ↩
Guix doesn’t store binaries in the classic places. It does not follow the File Hierarchy Standard. It needs to replace the references to things like #!/bin/bash with something like #!/gnu/store/295aavfhzcn1vg9731zx9zw92msgby5a-bash-5.1.16/bin/bash ↩
Extended Asm helps you call assembly blocks using C variables, and it also protects the variables you don’t want to touch. You can read more about that in GCC’s documentation. ↩

GCC 4.6.4 with RISC-V support

2024-03-28T00:00:00+02:00

I already mentioned at the FOSDEM-2024 that we built GCC 4.6.4 in a Debian machine in real RISC-V hardware, but I didn’t explain the specifics. Since then we’ve been working on many other parts and trying to package it for Guix, which happened to be harder than we thought (more on that later).

Today I decided to build it again to make sure it was possible to do, including GCC’s bootstrapping process that was giving us headaches in Guix and wrote the process down, because I remember the last time I tried it failed for me and I was worried. Maybe I did something I forgot? Or was my brain playing tricks on me?

That’s why I’m writing this down. You already know how it works. It has happened to you, and if it didn’t it probably will.

Debian

Install some simple deps:

$ sudo apt install build-essential git

Clone the repo and jump to it, to the working-compiler-c++ tag:

$ git clone https://github.com/ekaitz-zarraga/gcc.git
$ cd gcc
$ git checkout working-compiler-c++

In Debian the riscv64-linux-gnu standard library is installed in a weird location, so we need to make the compilation process find it.

$ export C_INCLUDE_PATH=/usr/include/riscv64-linux-gnu/
$ export CPLUS_INCLUDE_PATH=/usr/include/riscv64-linux-gnu/
$ export LIBRARY_PATH=/usr/lib/riscv64-linux-gnu/

We need to patch a couple of things, that we’ll set up in the Guix recipe. I decided to keep them out of the codebase because I want to keep the code consistent with the past.

This first patch is to convert the struct ucontext to the modern name: ucontext_t. This change only makes GCC compilable using a modern toolchain, in other contexts you might want to keep the old name.

$ sed -i 's/struct ucontext/ucontext_t/g' gcc/config/*/linux-unwind.h

Next, to avoid some error with pthread, you have to apply this diff. Which is just removing some pthread reference from gcc/config/riscv/linux.h

diff --git a/gcc/config/riscv/linux.h b/gcc/config/riscv/linux.h
index cd027813b41..d7d2b0978de 100644
--- a/gcc/config/riscv/linux.h
+++ b/gcc/config/riscv/linux.h
@@ -23,16 +23,6 @@ along with GCC; see the file COPYING3.  If not see

 #define GLIBC_DYNAMIC_LINKER "/lib/ld-linux-riscv" XLEN_SPEC "-" ABI_SPEC ".so.1"

-/* FIXME */
-/* Because RISC-V only has word-sized atomics, it requries libatomic where
-   others do not.  So link libatomic by default, as needed.  */
-#undef LIB_SPEC
-#ifdef LD_AS_NEEDED_OPTION
-#define LIB_SPEC GNU_USER_TARGET_LIB_SPEC \
-  " %{pthread:" LD_AS_NEEDED_OPTION  LD_NO_AS_NEEDED_OPTION "}"
-#else
-#endif
-
 #define LINK_SPEC "\
 -melf" XLEN_SPEC "lriscv \
 %{shared} \

Now, configure and build, classic GNU build-system:

$ ./configure --build=riscv64-linux-gnu --enable-languages=c,c++ \
    --disable-shared --disable-gomp --prefix=/data/prefix
$ make -j4

This should finish properly and give you a working GCC.

Guix

In the riscv branch you can see more work by Efraim Flashner trying to make a Guix package we can use later, but it’s not as easy as it looks. Guix is not like Debian in many things, and that makes the process a little bit harder.

Efraim managed to fix the package I made (see guix.scm) to work on i386 but the very same package didn’t work for RISC-V without changes.

The main problem has to do with GCC’s bootstrapping process.

GCC’s bootstrapping process

When we talk about GCC’s bootstrapping process we don’t mean the whole distribution bootstrapping which is what we are trying to achieve in this very project, but the process the compiler itself has to check itself.

When you build GCC from source, it is built with the compiler you have in your machine. Once it did that, the resulting compiler is used to compile the GCC’s source code again, and the resulting compiler builds GCC again. The binaries generated by the latest two steps are compared one to another, and if they are not identical (bit by bit) the build process is considered a failure.

This is giving us some headaches in Guix. We manage it to finish, but the latest steps are slightly different, for reasons we are not sure about yet.

So…

The reason why I built this thing in Debian again was to remind me it is possible to build it right, passing the comparison step, so I could get a little bit more of motivation to make it build in Guix properly.

Let’s see if this serves its purpose and we manage to make it soon.

FOSDEM and Guix Days 2024

2024-02-12T00:00:00+02:00

This year I gave a talk at FOSDEM, summarizing our work on the Guix bootstrapping for RISC-V, so I decided to get a couple of free days more and also visit the Guix Days, that where happening before the FOSDEM itself. Let’s do a short summary of my experience here.

Guix Days

I visited the Guix Days but not full-time because I wanted to spend some time in Brussels itself, rather than being locked in a place for the whole day.

We had some interesting discussions about Guix, the most interesting for me was the Guix Governance, where we discussed how Guix is managed and how it works internally in a social level. This discussion was specially important for me as an external contributor because I believe Guix has a complex but opaque internal structure that is difficult to grasp from the outside. Being in places like the Guix Days themselves let’s you understand it but it’s our responsibility to make Guix accessible for people that doesn’t have time to come to these kind of events.

I say all this because I’m basically that person. I don’t enjoy this kind of events that much and I feel I was a little bit forced to be there to be a little bit more than a random string in the IRC chat.

I had the chance to be there, but it was an effort to me. I’d like it not to be the same for others.

It’s not like I’m an antisocial, in fact I feel I’m a very social person, but I don’t like the politics of things and this event, more than anything else, I felt it was a political event where I had to be just to show up.

This is not just a Guix problem. Most large enough project fall into this kind of dynamic, where people that show up are better considered that those who don’t. It makes sense (this is how the world works), but at the same time it doesn’t (I don’t like how the world works).

FOSDEM 2024

We arrived FOSDEM on Saturday. It was literally impossible to do anything there. It was basically supercrowded. We tried to watch some talks, all were full. Waited for a long queue to enter one and we didn’t have the chance to enter in the end and we decided to leave. We had a nice day in the city instead.

Sunday morning I gave a talk. Sunday was way better: we could walk around and do things but we spent the morning in the Declarative and Minimalistic Computing Devroom until my talk happened. After my talk, we watched a couple more there and left for a very good lunch.

I think the talk went well, but I’ll leave further judgement to you. Feel free to watch and send me feedback if you like to do so.

My feelings

In a personal level, traveling (taking flights and all that…) is mentally exhausting for me, and it’s also expensive. I don’t feel like I would do this often in the future, as I didn’t do it in the past.

Also, I don’t enjoy geeky events like these that much. I don’t use Guix or any other software as a part of a tribe, I just think it’s useful. I enjoy other kind of social interactions more. I just felt like an outsider in both events, but I don’t really want to become anything else than that. I don’t feel comfortable with becoming “part” of anything. I believe software is not a cult, and everyone should have the freedom to contribute and enjoy in a purely practical way, with no identities involved.

Also I felt like people around those higher latitudes are colder, they laugh less than I do and they don’t have the boiling blood that I have. Maybe that’s why my talk made people laugh and react. There’s nothing wrong about that, culture is always cool, but the culture mismatch I felt it was a little bit of a barrier.

Apart form all that, I had the chance to visit a cool city with my significant other and with my friends, who came to support me in my talk and enjoy a conference. We probably didn’t enjoy the conference that much, but I experienced being surrounded by people that love me and had a lot of fun with them, and that’s more valuable than anything else.

On the other hand, I don’t need FOSDEM for that. I feel grateful for my people every single day of the year.

Don’t expect to find me in many geek events like these, but I don’t totally discard showing up from time to time.

Guix + Zig + NSIS for the win…DOWS?

2023-12-15T00:00:00+02:00

Some months later, it’s time to talk about the post I made about writing software for windows, without windows because I put some of those things in practice.

I made a simple application with one external audio library (OpenAL) and simple networking. No GUI this time, but some complexity was there.

The program

The program I’ll discuss here is Karkarkar, a tool to read a Twitch chat out loud in Basque. I made this for the Basque streaming community of gamers, which is really cool but they had to rely on Text-To-Speech systems for other languages as the main services for streamers don’t include Basque in their service, and the closest one, Spanish, is not good at some words’ pronunciation defer.

Most of these people are gamers, and they use Windows, which I don’t like, but in the end they are also people and they deserve Free Software regardless of the Operating System they decide to (or are forced to) use.

I took this as a cool exercise to test all we talked about in the past, and as a learning experience for the times I need to do something for a client that requires Windows support or anything like that.

The Text-To-Speech system

I used a TTS library by Aholab, a research group from the Bilbao School of Engineering, the university where I studied. They released AhoTTS (ahots means voice in Basque) some years ago in Github, a working TTS library for Basque and Spanish, with all the extra data it needs to work.

The only problem it has is the AhoTTS codebase is a mess, with tons of horrible decisions and undercover bugs. I had to fork the lib and make it work more or less before adding it to the project (I won’t discuss the issues here), but I did.

Connection to the Twitch chat

This simply uses IRC to connect to Twitch chat. The protocol is simple, and I only implemented an embarrassingly minimal amount of it. Enough to make it kind of work. I hope to make more of this in the future.

Playing the audio

I started with libao, a simple audio library and then moved to OpenAL for reasons I’ll mention next.

All together

The program listens to IRC, when a message arrives it sends it to AhoTTS, receives the samples to play and sends them to OpenAL which does its magic to make it play out loud.

Everything is the simplest thing possible as I didn’t have a lot of time to spend on this and also wanted to focus on the release process and being able to make a package for windows, and some linux distributions. Adding more code on top of that is easy once the problem of the distribution is solved.

The tooling

My strongest dependency¹, AhoTTS, is written in C++, so I decided to go for binary file distribution. And as I discussed in aforementioned post, I was looking for an excuse to make a project in Zig, and its cross-compilation capabilities could help me in this case, so I went for that. I made everything with Zig 0.10.1, as that is the latest version packaged for Guix, but I was forced to move to my Zig 0.11.0 package (see Testing).

For binary distribution, I didn’t have many ideas when I wrote the post, but a person in the Lisp Game Jam of that time suggested me to use NSIS for the Windows installer. It was already packaged in Guix, my distro of choice, so I just went for that, as it looked the simplest way to solve this.

For testing all this, I relied in wine, what else could I do?

So tl;dr:

I’m coding on Guix.
Programming in Zig 0.10.1 but then moved to Zig 0.11.0.
Installer using NSIS.
Testing done with Wine64.

Keeping it small

First of all, publishing software for Windows from Linux is painful if you have to compile everything for it. Guix helps a little bit with that as you can use --target= with mingw and have some luck. Sadly, many packages (most of them) need bash-minimal, which is not buildable for mingw at the moment.

In many software projects cross-compilation is not even possible.

Knowing that I decided to keep my project as small as possible, because I wanted to actually deliver something without losing my sanity.

Each library you depend on you have to compile and deliver to your target, too. It’s not that common to have Windows users to install things by themselves as a GNU/Linux person would.

Audio library

libao is the smallest audio library I could find, but I didn’t manage to build it for Windows myself, so I had to rely on something else. The OpenAL-Soft maintainers have a binary distribution for windows, so I decided to go for that one instead. It’s way harder to use, and I had to do some weird stuff to make it work, but it’s easier for me: no need to build it myself.

This is even more important in the case of having an audio library, interacting with the system is always a pain in the ass, and trying to cross-compile a library like this is not always easy, as their configure scripts are complex as they need to check for many things.

This is why I also avoided a GUI for the moment. Too much.

AhoTTS

AhoTTS is written in C++, it has zero dependencies and uses CMake as a build system.

At the beginning I compiled it using Guix to mingw, because it doesn’t have dependencies, it just worked. Later I encountered some issues though: libstdc++ and libgcc were not found at runtime and I had to statically link them (-static-libgcc -static-libstdc++ ftw).

Later I started digging in its code, fixing some horrible things inside and then I managed to add it as a submodule (yes, I hate that too) and compile it directly with Zig. This is the best option so far: you can statically link the library, it is built with the same process and it is cross-compiled by Zig. No more missing library problems.

Extra: In order to interact with AhoTTS from Zig, I had to make a small C++ to C bridge, that I never did before. It’s easy stuff, just convert the API a little bit, put some extern "C" on it and everything will go fine. void * is your friend.

The rest of it

That’s just writing some actual code and making the program run. That’s the easiest part.

Bringing it to the users

One thing is making something that builds and runs in your computer and a very different thing is making it work in other people’s computers. And this is the main problem that I wanted to discuss here.

I have some requirements:

I made a terminal application, but I don’t expect my users to know how to run it. I need to make something that is click and run.
I need it to have some icon in the desktop/startup-menu.
The AhoTTS library needs some extra data files. These are searched by the library and need to have some specific structure. These need to be installed properly, too.
I need to provide a simple way to uninstall the application.

Windows

The cool thing of this is I don’t own a windows machine since ~2010 and I have no access to any (living the good life!). I don’t have any plans to change that neither I have plans to really learn about Windows. So we have to be clever to solve this.

First things first:

zig build -Dcpu=baseline -Dtarget=x86_64-windows-gnu

Damn! That was actually very easy to do. Why isn’t this the norm in other languages?

Of course, in order to do this I needed to provide a proper build.zig file that was able to find the DLLs I was linking against and the header files. That’s not that difficult after all². But has to be done. Still, easy. Kudos for Zig.

Now, knowing where Windows searches the DLLs we depend on is important as our installation process will depend on it. They have some good documentation for that which has a very interesting point:

Standard search order for unpackaged apps:
…
7. The folder from which the application loaded.

This seems good enough for my purpose, so I can just install my binary and the libraries in the very same folder. I thought I would need to install stuff mixed somewhere else and learn a lot about windows, but I didn’t have to. Simple!

The extra data can be stored anywhere I want, because I am the one that controls the search algorithm, but I still need to have easy access to it. I decided to go for LOCALAPPDATA, but I’m thinking on putting it in the same folder as the rest of the program. At the time of writing that’s not done yet.

NSIS

Having clear where everything should be installed, it’s time to make it actually install it.

NSIS is a great tool. The look of the vanilla installer is old-school but I didn’t even bother to use the modern interface because that required me to think, activity I like to reserve to special occasions (like when I’m paid extreme amounts of money for it or the time I’m in bed before falling asleep).

NSIS in a nutshell: you write a script, run makensis with the script as an input and, boom! You have a .exe that installs your stuff.

It took me a little while to understand the structure of the script but once you learn it is really easy (for the basic things, after that you can go as hard as you want). It has two concepts you need to understand: Pages and Sections.

Pages: define the different pages the user will navigate through during the process. There are many pages pre-defined, and they cover all the basic functionality: license agreement, component selection, installation directory selection…
Sections: define the different parts of your installation process. You can mark some as optional or include them in different installation profiles (all, minimal, recommended… You’ve seen this before). Then, if you use components page, the sections will be listed to the user to choose which one they want to install. You can also add sections for the uninstaller, which will only be run when the user uninstalls the program.

EXAMPLE: I decided to include the sources also in the installer, but they are not required to run the program. It’s as simple as adding a Section with the /o flag and they won’t be automatically checked in the components step. Really cool stuff!

The rest of the thing is just commands you can read in the documentation. You can do many-many things with it. It has environment variables for almost anything you’ll need, so you don’t need to hardcode things in the script.

In my case I decided to ask the user for a configuration (the Twitch username) during the installation using a custom page (this requires some digging in the plugins’ documentation, but it’s not hard either), and created a launcher that automagically inserts the username in the call to the program (not great for later configuration, I know). This is done with a Batch file, which also makes the dirty job of opening the terminal when it’s double clicked.

Here’s the installer script I did, if you want to read it:

https://github.com/ekaitz-zarraga/karkarkar/blob/master/windows/installer.nsi

GNU/Linux

GNU/Linux world is really diverse, so it’s not easy to know about every single system’s requirements. At the moment I stayed with Guix and Debian because they are the only distros I use and I’m more familiar with them.

In Windows I was asking to the user to add their username in the installation process, but in Linux I don’t have any simple (for the user) way to do it, so for the moment I ask for the username when the program is called with no input arguments. Ugly, but works for the moment. The goal was to deliver this thing, not to make it perfect. I can do that later.

The cool part is they provide different approaches for packaging: source vs binary distribution.

XDG standard: Desktop file and icons

We need, of course, to arrange the same things we arranged in Windows: the desktop icon and launching the terminal automatically. That’s not hard to do using XDG specification!

Just add the Terminal=true line and it should open a terminal emulator when clicked³. The Exec= line in the desktop file has the program you want to run, and it has to point to it correctly.

Once the .desktop file is done, we realize we need to deal with the icons. I went just for a SVG icon, but I could add the rest. The only thing I needed to do is put everything in the correct folder. Something like this:

 usr
 └── share
     ├── applications
     │   └── karkarkar.desktop
     └── icons
         └── hicolor
             └── scalable
                 └── apps
                     ├── karkarkar.svg
                     └── karkarkar-symbolic.svg

Both Guix and Debian detect them properly once they are installed in the folder where they expect them.

Guix

I wrote quite a few Guix packages lately so I’m pretty comfortable with this.

I just need to tell Guix how it should build the program and put some files in the proper directory.

Also, I added the zig-build-system myself to Guix not that long ago. It’s pretty straightforward to use.

The icons and the desktop file needs to put everything in the correct place, #$output/share/..., and patch the Desktop file to point to the correct binary, #$output/bin/... that is. For this, I kept a desktop file as a template, with some reasonable defaults and just patched it in the Guix package. That’s easy.

Not the best Guix package ever but it simply works, and that’s everything I want at this point.

Debian

Debian packages can be exported from Guix package definitions, but it exports its file structure with every single dependency. In our case, that meant hundreds of MegaBytes. That’s too much so I did it manually, and the final size was around 10 MegaBytes. Not bad.

I never made a Debian package before, and I did the most minimalistic thing I could think of.

First, we need to build the thing:

zig build -Dcpu=baseline

And then just place everything in place in a folder, and call dpkg-deb on top of it. In order to do that, I just wrote a bash script that did this whole thing, it explains what I did way better than I can write in English:

# Run me in the linux/ folder
# I need the version in an argument, which should be Major.Minor-Revision
# for example 0.1-3.
version=$1
outfolder="karkarkar-$version"
mkdir "$outfolder"
mkdir -p "$outfolder/usr/bin"
mkdir -p "$outfolder/usr/share/applications"
mkdir -p "$outfolder/usr/share/AhoTTS/"
mkdir -p "$outfolder/usr/share/icons/hicolor/scalable/apps"

cp -r "../AhoTTS/data_tts"       "$outfolder/usr/share/AhoTTS/"
cp    "karkarkar.desktop"        "$outfolder/usr/share/applications"
cp    "../icons/karkarkar.svg"   "$outfolder/usr/share/icons/hicolor/scalable/apps"
cp    "../zig-out/bin/karkarkar" "$outfolder/usr/bin/"
patchelf "$outfolder/usr/bin/karkarkar" --set-interpreter "/lib64/ld-linux-x86-64.so.2"

mkdir -p "$outfolder/DEBIAN"
cat > $outfolder/DEBIAN/control <<EOF
Package: karkarkar
Version: $version
Section: base
Priority: optional
Architecture: amd64
Depends: libopenal1 (>=1.19.1)
Maintainer: Ekaitz Zarraga <blablablah>
Description: Karkarkar
 Listen to a Twitch chat in Basque.
EOF

dpkg-deb --root-owner-group --build "$outfolder"
rm -rf "$outfolder"

I need to highlight here the zig build I did automatically adds the Guix dynamic linker to the binary, but that is not where the dynamic linker is in Debian. I decided to patch the binary (patchelf) instead of trying to configure the compilation process, I thought this would be easier. I don’t know if it was easier or not, but it was easy, so that’s ok.

Also note that the DEBIAN/control file has the bare minimum fields, but it’s enough to work. In Debian, everything is installed in /usr/whatever, but that’s the only detail I changed.

Something I want to write down to remember later is the debian packages (.deb files) can be extracted with ar -x and they have a couple of tar.xz files inside. The data.tar.xz file has the file structure that will later installed in the system.

Testing

Yeah, you have to do it too. I tested it in Wine, but still needed to be tested in Windows. I had a working version done in Zig 0.10.0 that was running well in Wine, but it exploded in Windows because of this issue, that didn’t happen in Wine. I needed to use Zig 0.11 because of this error, which wasn’t a big deal anyway, because I already had it more or less packaged.

So, yes, you have to test in Windows just in case, if you can. I have to thank my friend (you know who you are!) for testing this program in his computer and reporting the error.

Conclusions

The program itself it’s really underdeveloped, but I actually made a lot of progress understanding how to make a program reach users in different systems easily. In the end, being a solo developer forces me to be clever, and do everything as simple as I can.

I don’t normally have many dependencies because I believe software has to be simple and tailor-made. This makes me control every aspect of the projects I do but also greatly simplify the distribution. That’s my context.

I already talked about this in the previous post on the subject, but in the end, being able to do this relieves me from the “Web is Cross-platform” mentality, which I don’t think is a silver bullet, even though in some cases might be a simple solution.

I think developers today tend to avoid making native applications and software quality suffers from that. There are many reasons for this (corporate control, subscription systems, easy deployments, controlled environments…) but one might be that the code people is used to write has too many dependencies, and it is hard to package and distribute. I don’t usually have that problem. I already said I don’t like having many dependencies.

Now I have an easy way to do this, I can simply focus on writing the actual code, which is, in the end, the most important part.

Key points:

Zig happened to make this process really simple, just because the developers decided to make it simple and they had the engineering skill to back the decision.
NSIS gave me all I needed to put my application in a windows machine without learning almost anything about Windows. I have the enough information to make the thing work, not more. NSIS was the key I was missing.
Guix lets me cross-compile some of the dependencies to mingw, and I did that at the beginning (and it worked!) so it’s a powerful tool even for that.
Wine is a good way to see everything was working well, but I found some discrepancies between it and Windows.

The full problem is not completely solved yet. Finding a GUI toolkit I can cross compile easily is still in the TODO list. But I’m pretty satisfied when the result until now.

My colleague Andrius Štikonas talked to me about MXE. I may try it in the future but I don’t like the fact that it downloads things by itself. I leave it here just in the case I need it in the future.

In fact, I think I can try other programming languages, even interpreted ones. NSIS is surely capable of dealing with it. This opens a whole world of possibilities for me.

In the end, everything has been a very interesting process, making applications targeted to the non-programmer (and non-gnu/linux) user has always been in my mind. Now, I can say I almost solved the problem and I have the base to work more on this in the future.

Finally, the program is released in a very alpha stage in itch.io where you can find the installers and packages and the code is hosted in Github until I get completely mad and I delete my account entirely (which I have no doubt it will happen one day or another).

And that’s mostly it, now I can focus on extending the behavior.

There are not that many Basque TTS systems out there… You know? ↩
I still had weird issues with this. In Zig 0.10.1 DLLs where found even if their name started with lib, but moving to Zig 0.11.0 didn’t find the libraries starting with lib. But Wine did find them so I didn’t know what to do. When I added the AhoTTS as a submodule the problem disappeared, but not because it was solved, it was just avoided. ↩
“Should” is the best word choice here, because it doesn’t work for some people, like in GNOME, as the terminals are hardcoded:
https://gitlab.gnome.org/GNOME/glib/-/blob/main/gio/gdesktopappinfo.c#L2685 ↩

Bye Protonmail

2023-11-21T00:00:00+02:00

The other day in the fediverse a friend of mine asked me about Protonmail. I explained a little bit my feelings and Protonmail jumped in, making me finally explain further. I think the conversation is interesting enough to share here¹.

Ekaitz Zárraga 👹

I really like @protonmail but they are always getting in the way with their non-standard things and their bridge which is FULL of dependencies and it’s impossible to package for some systems.

They are pushing me away from them too hard…

I won’t be surprised if finally push me away from their service in the mid term… after many years of trusting them for my business and personal email… It’s a real shame.

Proton Mail

@ekaitz_zarraga Can you let us know what kind of dependencies you’re referring to?

Ekaitz Zárraga 👹

@protonmail the Proton bridge has many dependencies:
https://github.com/ProtonMail/proton-bridge/blob/master/go.mod

Packaging all of them for a distribution is a huge effort. I don’t think you are really aware of the level of work it requires. Also, you have a .deb and a .rpm package, which are precompiled… forcing your users to trust those.

My distro and my work are focused on reproducible builds and bootstrappability… some serious concerns you don’t take in account.

Ekaitz Zárraga 👹

@protonmail Also, don’t get me wrong. I love protonmail and its ideas but I think you are too focused on “normal” users and breaking other people’s setups without giving much in exchange. I feel like a second class user in protonmail, as my distro doesn’t support .deb or .rpm packages… and I need to use plain text email pretty often (which you don’t really support in the web either).

Ekaitz Zárraga 👹

@protonmail I love protonmail, and I’d love to fix these issues, I would even make a reproducible bridge for you if you ask me to. But I don’t have the energy to do it by myself. It’s simply not possible to package.

So, here we are. As much as I’d like to continue to work with you and support you, I don’t feel I can do it anymore

Not long later I simply moved my email out of protonmail, to a different platform. An Europe based email provider that provides an interaction based on standards. Standards I can use with any setup, in any machine.

I wouldn’t care to have a non-standard solution if the Protonmail Bridge application worked, but they only provide .deb and .rpm packages. I can’t package the app, because it has too many dependencies to be done in an acceptable amount of time.

Also, the Web client is getting more and more complex. My anti-tracking plugins (like jShelter) tell me they are fingerprinting me when I reach the login screen. Why? Who knows. I contacted them and told them about this and, of course, I didn’t talk with a technical person, because you are not supposed to do that, so I don’t think my words reached anyone that could understand them, or consider them.

Maybe it’s me who changed. I don’t need the default PGP configuration anymore because I can configure it myself, I realized I am more in the need of being able to easily git send-email than using a beautiful Web UI that tracks me or uses modern JS features. I use a weird distro now, which shouldn’t be a problem but it happens to be, and I realized having too many dependencies in the code is often a problem in several dimensions.

So, something that happens too often in my life happened again: being a technical user has been punished again in favour of the concept of dumb users. The funny thing of all this is I don’t think dumb users exist. We should discuss that another time.

They are a company, they want to grow, so they must try to sell for the baseline user. The minimal amount of knowledge a person can have. Selling a product for “expert” users is lost money, there are not that many “experts” in this world after all. So it’s easier to add layers and layers of complexity to your software in order to provide a dumb proof interface, instead of educating your userbase, or letting the educated ones to customize their stuff. Don’t get me wrong, it makes perfect sense, and Protonmail’s mission is to provide default encryption to the largest proportion of email users possible so the decision fits their mission².

The default encryption and the “easy” PGP key setup Protonmail offers is really cool for users that don’t require more level of customization. I still like the goals of the company but I could’ve used a simpler way to customize my experience: maybe a simpler bridge? Maybe something else.. I don’t know.

In the end, they pushed me away from their service.

So long, Protonmail. It’s been a good time together.

My posts in Mastodon are also automatically deleted so if you try to read it later in there you might not find it. I’m copying it here as a reference. ↩
Regardless of anything I said here, they are making many people encrypt their email, one way or another, and I’ll continue to do so. That is valuable. ↩

Mes released and bootstrappable TCC merged

2023-11-16T00:00:00+02:00

So, some merging and releasing has been done so we need to update a little bit on what we talked about in the previous blog post.

Mes

We spent some time more testing what we shared in the previous post with you and now we can proudly say our work has been merged in Mes, and has been released with it in GNU Mes 0.25.

You can read the GNU Mes 0.25 release notes in Janneke’s blog in the following link:

http://joyofsource.com/gnu-mes-025-released.html

Bootstrappable TinyCC

We are also very happy to announce that our changes to the bootstrappable TinyCC have been merged to Janneke’s repository that is used for the official Guix bootstrapping process. You can see the changes¹ being included here:

https://gitlab.com/janneke/tinycc/-/tree/mes-0.25?ref_type=heads

Some words about it

All of us are of course very happy about this, and this didn’t make us relax, as we continued to push fixes and test all this in more ways, looking always for the next challenge.

We should enjoy this moment a little bit more, and that’s why I am posting this.

I want also to thank again the people that took part in this, specially Andrius for his help, for all the hours of sleep he lost during the process and for giving a second life to this effort, when I thought I was too tired to continue; and Janneke, who very patiently reviewed every single contribution and has been pushing me since the very early beginning of this adventure, when I was thinking about accepting the challenge or continue with my life. I’m glad I chose the adventure.

Of course, this is a cool milestone for all of us, we worked hard to make this but specially for me it means a lot. I’ve been working on this for almost two years now, and since I my changes on the bootstrappable TinyCC have been sitting in my repository since the last year, when I finished the previous NlNet grant. In fact, all that I did in that grant was sitting there, nothing was merged in the actual Guix bootstrap, as what I did were very specific parts of the chain, but they lacked the connection with other steps.

When you work in a project like that there’s almost no satisfaction. No releases, no upstreaming and, in my case, almost no help and no company. Everything that I did could be sitting there in my repos forever.

At the time when I did the backport of TinyCC I was unsure of what I did and I was exhausted after all the work I did with GCC. When we started this second round I thought everything was going to be broken. And it was, but it was much better than I thought!

Now, being part of the official Bootstrappable TinyCC means I can finally close that chapter, which was full of uncertainty, and actually have some interesting feedback of all that work I did that it seemed useless at the time when I did it.

It happened to be useful after all.

Now let’s see if GCC happens to be as useful as this was.

Cheers, dear reader. We deserve to celebrate.

The commits we had have been reordered and squashed as the changes were split in around 40 different commits that were done as we found the errors. I managed to rearrange them in a few commits that have way more sense. I say it just in case you start looking for the independent commits: they are gone. My repository is still keeping the branches and tags we mentioned before so you can still go there to find the changes the way we did them. ↩

Milestone — MesCC builds TinyCC and fun C errors for everyone

2023-10-30T00:00:00+02:00

It’s been a while since the latest technical update in the project and I am fully aware that you were missing it so it’s time to recap with a really cool announcement:

We finally made a self-hosted Bootstrappable TinyCC in RISC-V

Most of you probably remember I already backported the Bootstrappable TinyCC compiler, but I didn’t test it in a proper environment. Now, we can confidently say it is able to compile itself, a “large” program that makes use of more complex C features than I did in the tests.

All this work was done by Andrius Štikonas and myself. Janneke helped us a lot with Mes related parts, too. The work this time was pretty hard, honestly. Most of the things we did here are not obvious, even for C programmers.

I’m not used to this kind of quirks of the C language. Most of them are really specific, related with the standards and many others are just things were missing. I hope the ones I chose to discuss here help you understand your computing better, as they did to me.

This is going to be veery long post, so take a ToC to help you out:

Context
1. Why is this important?
Problems fixed
Reproducing what we did
1. Using live-bootstrap
2. Using Guix
Conclusions
What is next?

Context

You have many blogposts in the series to find the some context about the project, and even a FOSDEM talk about it, but they all give a very broad explanation, so let’s focus on what we are doing right now.

Here we have Mes, a Scheme interpreter, that runs MesCC, a C compiler, that is compiling our simplified fork of TinyCC, let’s call that Bootstrappable TinyCC. That Bootstrappable TinyCC compiler then tries to compile its own code. It compiles it’s own code because it’s goal is to add more flags in each compilation, so it has more features in each round¹. We do all this because TinyCC is way faster than MesCC and it’s also more complex, but MesCC is only able to build a simple TinyCC with few features enabled.

During all this process we use a standard library provided by the Mes project, we’ll call it MesLibC, because we can’t build glibc at this point, and TinyCC does not provide it’s own C standard library.

With all this well understood, this is the achievement:

We made MesCC able to compile the Bootstrappable TinyCC, using MesLibC, to an executable that is able to compile the Bootstrappable TinyCC’s codebase to a binary that works and has all the features we need enabled.²

The process affected all the pieces in the system. We added changes in MesCC, MesLibC and the Bootstrappable TinyCC.

Why is this important?

We already talked long about the bootstrapping issue, the trusting trust attack and all that. I won’t repeat that here. What I’ll do instead is to be specific. This step is a big thing because this allows us to go way further in the chain.

All the steps before Mes were already ported to RISC-V mostly thanks to Andrius Štikonas who worked in Stage0-POSIX and the rest of glue projects that are needed to reach Mes.

Mes had been ported to RISC-V (64 bit) by W. J. van der Laan, and some patches were added on top of it by Andrius Štikonas himself before our current effort started.

At this moment in time, Mes was unable to build our bootstrappable TinyCC in RISC-V, the next step in the process, and the bootstrappable TinyCC itself was unable to build itself either. This was a very limiting point, because TinyCC is the first “proper” C compiler in the chain.

When I say “proper” I mean fast and fully featured as a C compiler. In x86, TinyCC is able to compile old versions of GCC. If we manage to port it to RISC-V we will eventually be able to build GCC with it and with that the world.

In summary, TinyCC is a key step in the bootstrapping chain.

Problems fixed

This work can be easily followed in the commits in my TCC fork’s riscv-mes branch, and in my Mes clone’s riscv-tcc-boot branch. We are also identifying the contents of this blogpost in the git history by adding the git tag self-hosted-tcc-rv64 to both of my forks. We will try to keep both for future reference.

In Mes the process might be a little bit harder to follow because we sent most of the patches to Janneke and he merged them so when we were about to release this post I continued from Janneke’s branch to avoid divergences (I had some problems with that before). In any case, the code is there and searching by authors (Andrius and myself) would guide you to the changes we did.

Many commits have a long message you can go read there, but this post was born to summarize the most interesting changes we did, and write them in a more digestible way. Lets see if I manage to do that.

The following list is not ordered in any particular way, but we hope the selection of problems we found is interesting for you. We found some errors more, but these are the ones we consider more relevant.

TinyCC misses assembly instructions needed for MesLibC

TinyCC is not like GCC, TinyCC generates binary code directly, no assembly code in between. TinyCC has a separate assembler that doesn’t follow the path that C code follows.

It works the same in all architectures, but we can take RISC-V as an example:

TinyCC has riscv64-gen.c which generates the binary files, but riscv64-asm.c file parses assembly code and also generates binary. As you can see, binary generation is somehow duplicated.

In the RISC-V case, the C part had support for mostly everything since my backport, but the assembler did not support many instructions (which, by the way are supported by the C part).

MesLibC’s crt1.c is written in assembly code. Its goal is to prepare the main function and call it. For that it needs to call jalr instruction and others that were not supported by TinyCC, neither upstream nor our bootstrappable fork.

These changes appear in several commits because I didn’t really understood how the TinyCC assembler worked, and some instructions need to use relocations which I didn’t know how to add. The following commit can show how it feels to work on this, and shares how relocations are done:

There you can see we started to understand things in TinyCC, but some other changes came after this.

A very important not here is upstream TinyCC does not have support for these instructions yet so we need to patch upstream TinyCC when we use it, contribute the changes or find other kind of solutions. Each solution has its downsides and upsides, so we need to take a decision about this later.

TinyCC’s assembly syntax is weird

Following with the previous fix, TinyCC does not support GNU-Assembler’s syntax in RISC-V. It uses a simplified assembly syntax instead.

When we would do:

sd s1, 8(a0)

In TinyCC’s assembly we have to do:

sd a0, s1, 8

This requires changes in MesLibC, and it makes us create a separate folder for TinyCC in MesLibC. See lib/riscv64-mes-tcc/ and lib/linux/riscv64-mes-tcc for more details.

TinyCC does not support Extended Asm in RV64

Way later in time we also found TinyCC does not support Extended Asm in RV64. The functions that manage that are simply empty.

We spent some time until we realized what was going on in here for two reasons. First, there are few cases of Extended Asm in the code we were compiling. Second, it was failing silently.

Extended Asm is important because it lets you tell the compiler you are going to touch some registers in the assembly block, so it can protect variables and apply optimizations properly.

In our case, our assembly blocks were clobbering some variables that would have been protected by the compiler if the Extended Asm support was implemented.

Andrius found all the places in MesLibC where Extended Asm was used and rewrote the assembly code to keep variables safe in the cases it was needed.

The other option was to add Extended Asm support for TinyCC, but we would need to add it in the Bootstrappable TinyCC and also upstream. This also means understanding TinyCC codebase very well and making the changes without errors, so we decided to simplify MesLibC, because that is easier to make right. We are probably going to need to do this later on anyway, but we’ll try to delay this as much as possible.

MesLibC `main` function arguments are not set properly

Following the previous problem with assembly, we later found input arguments of the main function, that come from the command line arguments, were not properly set by our MesLibC. Andrius also took care of that in 4f4a1174 in Mes.

This error was easier to find than others because when we found issues with this we already had a compiled TinyCC. So we just needed to fix simple things around it.

TinyCC says `__global_pointer$` is not a valid symbol

This is a small issue that was a headache for a while, but it happened to be a very simple issue.

In RISC-V there’s a symbol, __global_pointer$, that is used for dynamic linking, defined in the ABI. But TinyCC had issues to parse code around it and it took us some time to realize it was the dollar sign ($) which was causing the issues in this point.

TinyCC does not process dollars in identifiers unless you specifically set a flag (-fdollars-in-identifiers) when running it. In the RISC-V case, that flag must be always active because if it isn’t the __global_pointer$ can’t be processed.

We tried to set that flag in the command line but we had other issues in the command line argument parsing (we found and fixed them later later) so we just hardcoded it.

This issue is interesting because it’s an extremely simple problem, but its effect appears in weird ways and it’s not always easy to know where the problem is coming from.

Bootstrappable TinyCC’s casting issues

This one was a really hard one to fix.

When running our Bootstrappable TinyCC to build MesLibC we found this error:

    cannot cast from/to void

We managed to isolate a piece of C code that was able to replicate the problem.³

long cast_charp_to_long (char const *i)
{
  return (long)i;
}

long cast_int_to_long (int i)
{
  return (long)i;
}

long cast_voidp_to_long (void const *i)
{
  return (long)i;
}

void main(int argc, char* argv[]){
    return;
}

Compiling this file raised the same issue, but then I realized I could remove two of the functions on the top and the error didn’t happen. Adding one of those functions back raised the error again.

I tried to change the order of the functions and the functions I chose to add, and I could reproduce it: if there were two functions it failed but it could build with only one.

Andrius found that the function type was not properly set in the RISC-V code generation and its default value was void, so it only failed when it compiled the second function.

Knowing that, we could take other architectures as a reference to fix this, and so we did.

See 6fbd1785.

Bootstrappable TinyCC’s `long double` support was missing

When I backported the RISC-V support to our Bootstrappable TinyCC I missed the long double support and I didn’t realize that because I never tested large programs with it.

The C standard doesn’t define a size for long double (it just says it has to be at least as long as the double), but its size is normally set to 16 bytes. All this is weird in RV64, because it doesn’t have 16 byte size registers. It needs some extra support.

Before we fixed this, the following code:

long double f(int a){
    return  a;
}

Failed with:

    riscv64-gen.c:449 (`assert(size == 4 || size == 8)`)

Because it was only expecting to use doubles (8 bytes) or floats (4 bytes).

In upstream TinyCC there were some commits that added long double support using, and I quote, a mega hack, so I just copied that support to our Bootstrappable TinyCC.

See a7f3da33456b.

After this commit, some extra problems appeared with some missing symbols. But these errors were link-time problems, because TinyCC had the floating point helper functions needed for RISC-V defined in lib/lib-arm64.c, because they were reusing aarch64 code for them.

After this, we also compile and link lib-arm64.c and we have long double support.

MesCC struct initialization issues

This one was a lot of fun. Our Bootstrappable TinyCC exploded with random issues: segfaults, weird branch decisions…

After tons of debugging Andrius found some values in structs were not set properly. As we don’t really know TinyCC’s codebase really well, that was hard to follow and we couldn’t really know where was the value coming from.

Andrius finally realized some structs were not initialized properly. Consider this example:

typedef struct {
    int one;
    int two;
} Thing;

Thing a = {0};

That’s supposed to initialize all fields in the Thing struct to 0, according to the C standard⁴.

As a first solution we set struct fields manually to 0, to make sure they were initialized properly. See 29ac0f40a7afb

After some debugging we found that the fields that were not explicitly set were initialized to 22. So I decided to go to MesCC and see if the struct initialization was broken.

This was my first dive in MesCC’s code, and I have to say it’s really easy to follow. It took me some time to read through it because I’m not that used to match, but I managed to find the struct initialization code.

What I found in MesCC is there was a 22 hardcoded in the struct initialization code, probably coming from some debug code that never was removed. As no part of the x86 bootstrapping used that kind of initializations, or nothing relied on them, the error went unnoticed.

I set that to 0, as it should be, and continued with our life.

MesCC vs TinyCC size problems

The C standard does not set a size for integers. It only sets relative sizes: short has to be shorter or equal to int, int has to be shorter or equal to a long, and so on. If you platform wants, all the integers, including the chars can have 8 bits, and that’s ok for the C standard.

TinyCC’s RISC-V backed was written under the assumption that int is 32 bit wide. You can see this happening in riscv64-gen.c, for example, here:

    EI(0x13, 0, rr, rr, (int)pi << 20 >> 20);   // addi RR, RR, lo(up(fc))

The bit shifting there is done to clear the upper 20 bits of the pi variable. This code’s behavior might be different from one platform to another. Taking the example before, of that possible platform that only has 8 bit integers, this code would send a 0 instead of the lower 12 bits of pi.

In our case, we had MesCC using the whole register width, 64bits, for temporary values so the lowest 44 bits were left and the next assertion that checked the immediate was less than 12 bits didn’t pass.

This is a huge problem, as most of the code in the RISC-V generation is written using this style.

There are other ways to do the same thing (pi & 0xFFF maybe?) in a more portable way, but we don’t know why upstream TinyCC decided to do it this way. Probably they did because GCC (and TinyCC itself) use 32 bit integers, but they didn’t handle other possible cases, like the one we had here with MesCC.

In any case, this made us rethink MesCC, dig on how are its integers defined, how to change this to be compatible with TinyCC and so on, but I finally decided to add casts in the middle to make sure all this was compiled as expected.

It was a good reason to make us re-think MesCC’s integers, but it took a very long time to deal with this, that could be better used in something else. Now, we all became paranoids about integers and we still think some extra errors will arise from them in the future. Integers are hard.

MesCC add support for signed shifting

Integers were in our minds for long, as described in the previous block, but I didn’t talk about signedness in that one.

Following one of the crazy errors we had in TinyCC, I somehow realized (I don’t remember how!) that we were missing signed shifting support in MesCC. I think that I found this while doing some research of the code MesCC was outputting when I spotted some bit shifts done using unsigned instructions for signed values and I started digging in MesCC to find out why. I finally realized that there was no support for that and the shift operation wasn’t selected depending on the signedness of the value being shifted.

Let’s see this with an example:

signed char a = 0xF0;
unsigned char b = 0xF0;

// What is this?                (Answer: 0xFF => 255)
a >> 4;

// And this?                    (Answer: 0x0F => 15)
b >> 4;

In the example you can see the shifting operation does not work the same way if the value is signed or not. If you always use the unsigned version of the >> operation, you don’t have the results you expected. Signs are also hard.

In this case, like in many others, the fix was easier than realizing what was going wrong. I just added support for the signed shifting operation, not only for RISC-V but for all architectures, and I added the correct signedness check to the shifting operation to select the correct instruction. The patch (see 88f24ea8 in Mes) is very clean and easy to read, because MesCC’s codebase is really well ordered.

EDIT: Some person in the web noted I called the bit-shift operations rotation operations. I normally use both words interchangeably but it is true they don’t mean the exact same thing. A shift is when the values are lost, and a rotation when they come from the other side of the register. I edited the article to use the correct word.

MesCC switch/case falls-back to default case

In the early bootstrap runs, our Bootstrappable TinyCC it did weird things. After many debugging sessions we realized the switch statements in riscv64-gen.c, more specifically in gen_opil, were broken. The fall-backs in the switch were automatically directed to the default case. Weird!

MesCC has many tests so I read all that were related with the switch statements and the ones that handled the fall-backs were all falling-back to the default case, so our weird behavior wasn’t tested.

I added the tests for our case and read the disassemble of simple examples when I realized the problem.

Each of the case blocks has two parts: the clause that checks if the value of the expression is the one of the case, and the body of the case itself.

The switch statement generation was doing some magic to deal with case blocks, but it was failing to deal with complex fall-through schemes because the clause of the target case block was always run, making the code fall to the default case, as the clause was always false because the one that matched was the one that made the fall-back.

There were some problems to fix this, as NyaCC (MesCC’s C parser) returns case blocks as nested when they don’t have a break statement:

(case testA
  (case testB
    (case testC BODY)))

Instead of doing this, I decided to flatten the case blocks with empty bodies. This way we can deal with the structure in a simpler way.

((case testA (expr-stmt))
 (case testB (expr-stmt))
 (case testC BODY))

Once this is done, I expanded each case block to a jump that jumps over the clause, the clause and then its body. Doing this, the fall-back doesn’t re-evaluate the clause, as it doesn’t need to. The generated code looks like this in pseudocode:

    ;; This doesn't have the jump because it's the first
CASE1:
    testA
CASE1_BODY:
    ...

    goto CASE2_BODY
CASE2:
    testB
CASE2_BODY:
    ...

    goto CASE3_BODY
CASE3:
    testB
CASE3_BODY:
    ...

If one of the cases has a break, it’s treated as part of its body, and it will end the execution of the switch statement normally, no fall-back.

This results in a simpler case block control. The previous approach dealt with nested case blocks and tried to be clever about them, but unsuccessfully. The best thing about this commit is most of the cleverness was simply removed with a simple solution (flatten all the things!).

It wasn’t that easy to implement, but I first built a simple prototype and Janneke’s scheme magic made my approach usable in production.

All this is added in Mes’s codebase in several commits, as we needed some iterations to make it right. 22cbf823582 has the base of this commit, but there were some iterations more in Mes.

Boostrappable TinyCC problems with GOT

The Global Offset Table is a table that helps with relocatable binaries. Our Bootstrappable TinyCC segfaulted because it was generating an empty GOT.

Andrius debugged upstream TinyCC alongside ours and realized there was a missing check in an if statement. He fixed it in f636cf3d4839d1ca.

The problem with this kind of errors is TinyCC’s codebase is really hard to read. It’s a very small compiler but it’s not obvious to see how things are done on it, so we had to spend many hours in debugging sessions that went nowhere. If we had a compiler that is easier to read and change, it would be way simpler to fix and we would have had a better experience with it.

Bootstrappable TinyCC generates wrong assembly in conditionals

We spent a long time debugging a bug I introduced during the backport when I tried to undo some optimization upstream TinyCC applied to comparison operations.

Consider the following code:

if ( x < 8 )
    whatever();
else
    whatever_else();

Our Bootstrappable TinyCC was unable to compile this code correctly, instead, it outputted a code that always took the same branch, regardless of the value in x.

In TinyCC, a conditional like if (x < CONSTANT) has a special treatment, and it’s converted to something like this pseudoassembly:

load x to a0
load CONSTANT to a1
set a0 if less than a1
branch if a0 not equal 0     ; Meaning it's `set`

This behaviour uses the a0 register as a flag, emulating what other CPUs use for comparisons. RISC-V doesn’t need that, but it’s still done here probably for compatibility with other architectures. In RISC-V it could look like this:

load x to a0
load CONSTANT to a1
branch if a0 less than a1

You can easily see the branch “instruction” does a different comparison in one case versus the other. In the one in the top it checks if a0 is set, and in the other checks if a0 is smaller than a1.

TinyCC handles this case in a very clever way (maybe too clever?). When they emit the set a0 if less than a1 instruction they replace the current comparison operation with not equal and they remove the CONSTANT and replace it with a 0. That way, when the branch instruction is generated, they insert the correct clause.

In my code I forgot to replace the comparison operator so the branch checked if a0 is less than 0 and it was always false, as the set operation writes a 0 or a 1 and none of them is less than 0.

The commit 5a0ef8d0628f719 explains this in a more technical way, using actual RISC-V instructions.

This was also a hard to fix, because TinyCC’s variable names (vtop->c.i) are really weird and they are used for many different purposes.

Support for variable length arguments

In C you can define functions with variable argument length. In RISC-V, those arguments are sent using registers while in other architectures are sent using the stack. This means the RISC-V case is a little bit more complex to deal with, and needs special treatment.

Andrius realized in our Bootsrappable TinyCC we had issues with variable length arguments, specially in the most famous function that uses them: printf. He also found that the problem came from the arguments not being properly set and found the problem.

Reading upstream TinyCC we found they use a really weird system for the defines that deal with this. They have a header file, include/tccdefs.h, which is included in the codebase, but also processed by a tool that generates strings that are later injected at execution time in TinyCC.

This was too much for us so we just extracted the simplest variable arguments definitions for RISC-V and introduced that in MesLibC and our Bootstrappable TinyCC.

Extra: files generated with no permissions

The bootstrappable TinyCC built using MesCC generated files with no permissions and Andrius found that this problem came from the variable length argument support definitions. So he fixed that, too⁵.

The macro that defined va_start was broken pointer arithmetic. At the beginning he thought it was related with MesCC’s internals but he tested in GCC later and realized the problem was in the macro definition. That’s why currently the commit says “workaround” in the name, but it’s more than a workaround: it’s a proper fix. We are rewording that, but that would happen after we release this post.

MesLibC use `signed char` for `int8_t`

We already had a running Bootstrappable TinyCC compiled using MesCC when we stumbled upon this issue. Somehow, when assembling:

addi a0, a0, 9

The code was trying to read 9 as a register name, and failed to do it (of course). It was weird to realize that the following code (in riscv64-asm.c) was always using the true branch in the if statement, even if asm_parse_regvar returned -1:

int8_t reg;
...
if ((reg = asm_parse_regvar(tok)) != -1) {
    ...
} else ...

I disassembled and saw something like this:

call asm_parse_regvar   ;; Returns value in a0
reg = a0
a0 = a0 + 1
branch if a0 equals 0

This looks ok, it does some magic with the -1 but it makes sense anyway. The problem is that it didn’t branch because a0 was 256 even when asm_parse_regvar returned -1.

During some of the int related problems someone told me in the Fediverse that char‘s default signedness is not defined in the C standard. I read MesLibC and, exactly: int8_t was defined as an alias to char.

In RISC-V char is by default unsigned (don’t ask me why) but we are used to x86 where it’s signed by default. Only saying char is not portable.

Replacing:

typedef char int8_t;

With:

typedef signed char int8_t;

Fixed the issue.

From this you can learn several things:

Don’t assume char‘s signedness in C
If you design a programming language, be consistent with your decisions. In C int is always signed int, but char‘s don’t act like that. Don’t do this.

MesLibC Implement `setjmp` and `longjmp`

Those that are not that versed in C, as I was before we found this issue, won’t know about setjmp and longjmp but they are, simplifying a lot, like a goto you can use in any part of the code. setjmp needs a buffer and it stores the state of the program on it, longjmp sets the status of the program to the values on the buffer, so it jumps to the position stored in setjmp.

Both functions are part of the C standard library and they need specific support for each architecture because they need to know which registers are considered part of the state of the program. They need to know how to store the program counter, the return address, and so on, and how to restore them.

In their simplest form they are a set of stores in the case of the setjmp and a set of loads in the case of longjmp.

In RISC-V they only need to store the s* registers, as they are the ones that are not treated as temporary. It’s simple, but it needs to be done, which wasn’t in neither for GCC nor for RISC-V in MesLibC.

Andrius is not convinced with our commit in here, and I agree with his concerns. We added the full setjmp and longjmp implementations directly ~~stolen from~~ inspired in the ones in Musl⁶ but it has also floating point register support, using instructions that are not implemented in TinyCC yet. This is going to be a problem in the future because later iterations will try to execute instructions they don’t actually understand.

There are two (or three) possible solutions here. The first is to remove the floating point instructions for now (another flavor for this solution is to hide them under an #ifdef). The second is to implement the floating point instructions in TinyCC’s RISC-V assembler, which sounds great but forces us to upstream the changes, and that process may take long and we’d need to patch it in our bootstrapping scripts until it happens.

We just added the #ifdefs because our code is full of them anyway and sent it to Mes: 0e2c5569.

Those are mostly the coolest errors we needed to deal with but we stumbled upon a lot of errors more.

Before this effort started Andrius added support for 64 bit instructions in Mes and fixed some issues 64bit architectures had in M2.

I found a bug in Guix shell (it’s still open) and had to fix some ELF headers in MesCC generated files because objdump and gdb refused to work on them.

Andrius also found issues with weak symbols in MesLibC that were triggered because TCC didn’t have support for them, thankfully upstream TCC had that issue fixed and we just cherry-picked for the win.

He even had the energy to test all this in real RISC-V we specifically acquired for this task.

There are many more things to tell, but this is already getting too long and if I continue writing we’ll probably end up fixing some stuff more.

In the end, a project like this is like hitting your head against a wall until one of them breaks. Sometimes it feels like the head did, but it’s all good.

Reproducing what we did

All we did means nothing if you can’t reproduce it. We provide two ways to reproduce this process: live-bootstrap and Guix.

Both provide a similar thing but there are some differences from the high-level that is worth mention now.

Comparing with live-bootstrap, using Guix helps because it reuses the previous steps if they didn’t change. This results in shorter waits once Mes is sorted out.

On the other hand, I’ve have had issues with the failed builds in Guix (in emulated systems). It was hard to jump inside the build container and play around inside so the development cycle suffered a lot. In live-bootstrap, if you are good with bwrap you can jump and tweak things with no issues.

For those who enjoy digging in the code and trying to follow the process I recommend following live-bootstrap‘s scripts. The directory structure is a little bit confusing but the scripts are very plain and linear. The ones in the Guix process come from previous bootstrap efforts and they are designed to do many things automagically, that makes them a hard to follow.

Using live-bootstrap

Andrius is part of the live-bootstrap effort and he’s doing all the scripting there to keep the process reproducible.

Live-bootstrap is…

An attempt to provide a reproducible, automatic, complete end-to-end bootstrap from a minimal number of binary seeds to a supported fully functioning operating system.

That’s the official description of the project. From a more practical perspective, it’s a set of scripts that build the whole operating system from scratch, depending on few binary seeds.

That’s not very different to what Guix provides from a bootstrapping perspective. Guix is “just” an environment where you can run “scripts” (the packages define how they are built) in a reproducible way. Of course, Guix is way more than that, but if we focus on what we are doing right now it acts like the exact same thing.

NOTE: live-bootstrap‘s project description is a little bit outdated. If you read the comparison with Guix, what you’d read is old information. If you want to read a more up-to-date information about Guix’s bootstrapping process I suggest you to read this page of Guix manual: https://guix.gnu.org/manual/devel/en/html_node/Full_002dSource-Bootstrap.html

Being very different projects, in a practical level, the main difference between them is live-bootstrap is probably easier for you to test if you are working on any GNU/Linux distribution⁷.

If you want to reproduce this exact point in time you only need to use my fork of live-bootstrap, branch riscv-tcc-boot. I also made a tag on it, self-hosted-tcc-rv64, to make it easier to remember when was this post released. Andrius made all the magic to set that process to take all the inputs from Mes and TinyCC from the correct tag.

Clone the repository, set up the dependencies and run this (if you are not in a RISC-V host you need to configure Qemu and binfmt):

 ./rootfs.py --bwrap --arch riscv64 --preserve

That should, after a long time, reach a point where there’s a properly compiled bootstrappable TinyCC.

Using Guix for a reproducible environment

I made a Guix recipe that can replicate the whole process, too. It took me long time to make it work but it finally does.

From my TCC fork reproducing this should be easy for the people versed in Guix. There’s a guix folder with some files, (most of them broken, not gonna lie) but there are two you should pay attention to:

channels.scm stores the state of my Guix checkout so you can reproduce it in the future using guix time-machine. At the moment it doesn’t feel necessary but if something fails when you try it, please refer to that.
commencement.scm is an edited copy of the Guix bootstrapping process, directly obtained from gnu/packages/commencement.scm from Guix’s codebase. I patched this to make it work for RISC-V, using some more modern commits in the dependencies.

In order to reproduce all our work in Guix you just need to build tcc-boot0 package from the commencement.scm file using riscv64-linux as your --system. I’m a nice guy so I just added a command there you can use for this, just run:

./tcc-boot0-from-source.sh

And that should build the whole thing. It takes hours, you have been warned.

Also it adds --no-grafts (thanks Efraim), because if you keep the grafts it compiles the world from scratch (curl, x11… not good).

If you just want to build mes-boot as an intermediate step, I also made a file for that:

./mes-boot-from-source.sh

The both scripts will load variables from the commencement.scm module provided. The module is not complex if you are used to Guix, but it calls some complex shell scripts in both Mes and TinyCC to build. Those contain all the magic.

Conclusions

Of course, the problems we fixed now look easy and simple to fix. This blog post doesn’t really do justice to the countless debugging hours and all the nights we, Andrius and I, spent thinking about where could the issues be coming from.

The debugging setup wasn’t as good as you might imagine. The early steps of the bootstrap don’t have all the debug symbols as a “normal” userspace program would. In many cases, function names were all we had.

I have thank my colleague Andrius here because he did a really good debugging job, and he provided me with small reproducers that I could finally fix. Most of the times he made the assist and I scored the goal.

He also did a great job with the testing which I couldn’t do because I was struggling with Guix from the early days, trying to make the compilers find the header files and libraries.

In the emotional part it is also a great improvement to have someone to rely on. Andrius, Janneke and I had a good teamwork and we supported each other when our faith started to crumble. And believe, it does crumble when a new bug appears after you fixed one that you needed a week for. There were times this summer I thought we would never reach this point.

It’s also worth mention here that the bootstrapping process is extremely slow: it takes hours. This kills the responsiveness and makes testing way harder than it should be. Not to mention that we are working on a foreign architecture, which has it’s own problems too.

If you have to take some lesson from something like this, here you have a suggestion list:

The simplest error can take ages to debug if your code is crazy enough.
Don’t be clever. It sets a very high standard for your future self and people who will read your code in the future.
I guess we can summarize the previous two points in one: If we could remove TinyCC from the chain, we would. It’s a source of errors and it’s hard to debug. The codebase is really hard to read for no apparent reason.
When build times are long, small reproducers help.
Add tests for each new case you find.
Don’t trust, disassemble and debug.
Be careful with C and standards and undefined behavior.
Integers are hard. Signedness makes them harder.
Being surrounded by the correct people makes your life easier.

Also, as a personal note I noticed I’m a better programmer since the previous post in the this series. I feel way more comfortable with complex reasoning and even writing new programs in other languages, even if I spent almost no time coding anything from scratch. It’s like dealing with this kind of issues about the internals give you some level of awareness that is useful in a more general way than it looks. Crazy stuff.

If you can, try to play with the internals of things from time to time. It helps. At least it helped me.

What is next?

Now we have a fully featured Bootstrappable TinyCC we need to decide what to do next.

On the short term, all this has to be released in the original projects: Mes, M2, and so on. That’s the easy part, as everything has proved to be ready.

On the mid term, it’s not very clear what to do first. We suspect we’ll need upstream TinyCC for the next steps, because we many different tools to continue with the bootstrapping chain, and the bootstrappable TinyCC might not be enough to build them. On the other hand, when we go for a standard library we’ll miss the extended assembly support we already mentioned. There’s some uncertainty in the next step.

The long-term is pretty much clear though, the goal is GCC. First GCC for C and then for C++ to make it able build GCC 7.5 which should enable the rest of the chain pretty easily (famous last words). I anticipate we are going to have problems with GCC (I know this because I left them there last time) so we’ll need to fix those, too. Once that is done, we would use GCC to compile more recent versions of GCC until we compile the world.

That’s more or less the description of what we will do in the next months.

And this is pretty much it. I hope you learned something new about C, the Bootstrapping process or at least had a good time reading this wall of text.

We’ll try to work less for the next one, but we can’t promise that. 😉

Take care.

There are many rounds. Like 7 or so. ↩
So it can compile itself again an again, but who would want to do that? ↩
This is how we managed to fix most of the problems in our code: make a small reproducer we can test separately so we can inspect the process and the result easily. ↩
You can see an explanation in the (1) case at cppreference.com ↩
He is like that. ↩
Yo, if it’s free software it’s not stealing! Please steal my code. Make it better. ↩
If you run it in Guix or in a distribution that doesn’t follow FHS you’d probably need to touch the path of your Qemu installation or be careful with the options you send to the rootfs.py script. ↩

More work, more people, more energy — thanks NlNet

2023-07-17T00:00:00+03:00

I might become a little bit famous in this small world of Guix, RISC-V and bootstrapping after my FOSDEM talk of this year and the work I did during 2021 and 2022, that you can follow in this series of posts I’m going on with right now.

All that I write here is nothing I did alone. Many people helped me to make this happen, but in the end it is me the one that writes here and makes the noise. So, before explaining anything else, I want to thank everyone that is involved in the process.

I also want to thank NLNet / NGI-Assure for funding the project. Without there wouldn’t be anything to discuss here. They just enabled this work with their funds.

The work is done, it was funded and it was finished. I backported RISC-V support to GCC, and I also backported the RISC-V support to the bootstrappable TinyCC, but that’s not enough. All that I did has to be combined with the whole bootstrapping toolchain, so it’s time for more.

A new way

Even with all the help, during the project I felt alone. The codebases are huge (GCC is millions LoC), or very badly written (tcc I’m looking at you) and there are tons of moving parts (Hex0, M1, M2-Planet, Mes, tcc, bootstrappable tcc, gcc, all the libcs…). It’s really hard to know everything and none of us know all the ecosystem deeply so many times there’s none to ask for help. You are alone.

This might seem a good thing, a challenge, and it is, but it’s also very energy consuming. I did all I could and I’m not sure if I can take this a lot further by myself.

Now, the project has evolved. We have most of the dots and it’s time to draw the line that connects them.

In order to do that we need more collaboration as each of us has become an expert in a different part of the chain. Also, many new problems will arise from the interaction between the different parts.

Knowing that, this time I proposed something else: I wanted to make a larger project where more people would collaborate and I asked NlNet for the funds to continue the work from that perspective.

That’s good because we can pay every person involved on this according to their implication¹.

NlNet / NGI Assure

Of course, I wouldn’t be writing this if NlNet didn’t give us the funds.

So, yes: NlNet decided to fund us. Big thanks to them and to NGI Assure.

The work

As I introduced in the Fosdem talk, there’s a lot of integration work to do.

During the last year I focused on backporting the RISC-V support to GCC and the Bootstrappable TinyCC. I did that because I knew that would enable more work on the whole chain of compilers we use in the bootstrapping process. But also because I could just focus on a very specific part, forgetting about the whole chain for a moment.

Now it’s time to start combining all the work together.

The funding includes enough tasks to make the full source bootstrapping for RISC-V. This is a summary of the tasks:

Finish GNU Mes’ RISC-V support
Build the Bootstrappable TinyCC using GNU Mes’ RISC-V support added in the first task
Fix the backported GCC 4.6.4 package to include C++ support and fix missing functionality
Build the backported GCC 4.6.4
Build the upstream GCC 7.5 or higher with the backported GCC 4.6.4
Package the whole process and include it in Guix’s commencement module
Review the associated projects and fix the possible issues
Document all the process

You are probably not familiar enough with the whole thing to know what they really mean but some of them are really hard.

I’ll go into more detail on all of them as we work on them, so don’t worry at the moment.

The feelings

I’m not very excited with this project anymore. The tasks you can see in the previous block are not good for a person like me. I really struggle with them: configuring development environments, fixing weird imports, etc. It’s hard for me, intellectually and emotionally.

I already did the parts that interested me the most and I want to move on to something else.

Why ask for more funds then?

Well, lets say it plainly: the funds are not for me. I’m using the success I had with the previous project and the interest NlNet has on it to fund other people to finish the work.

Whoever that makes the task will get the budget associated to it.

The plan here is to help coordinate other people to make the tasks, but not really do them myself. I don’t discard it though. I’ll probably need to work on some of them.

The fact that I’m the one that presented the proposal doesn’t mean the proposal is for me. The proposal is for you.

The people

I already managed to involve two fellow hackers and I made the proposal with them as collaborators:

Efraim Flashner, who has been working on the RISC-V port of most of the Guix packages is going to take part in this second stage of the project as he knows better than anyone else what’s the status of RISC-V in Guix.
Danny Milosavljevic, who worked in the bootstrapping process for ARM also agreed to get involved in this.
Jan Nieuwenhuizen (Janneke) is the Mes author and maintainer.
Andrius Štikonas is deeply involved in the bootstrapping process, too. Making a lot of patches to live-bootstrap, Mes, Hex0, M2-Planet and so on.
Juliana Sims has also shown interest in the project because she has been involved in RISC-V related projects before.

You can also collaborate with us, if your contributions are good we can even add you to the official team.

If you want to join us, feel free to contact me to riscv-effort@elenq.tech or join #bootstrappable in libera.chat and ping me there.

I’m sure we all will learn a lot together during the process.

Closing words

So, in summary, I’m just introducing a new part of this adventure. Thanks to NlNet, we can take all the work we have been doing separately on Mes, GCC, TinyCC, Hex0, M2-Planet and so on and finally combine it all together.

This is a huge effort, but hopefully we’ll manage to do it, learn a lot in the process and get paid.

We’ll see how it goes. I’ll keep you all informed.

Take care.

It might sound really surprising to some, but some of the people involved on this are paid zero money for their time at the moment, and they are doing great improvements. This is a topic for a huge discussion but, in summary: work is work, and you should get paid for it. ↩

Support Windows not supporting Windows

2023-03-18T00:00:00+02:00

I hate Windows. I don’t like it, I don’t support the software practices of Microsoft and I probably never will. That doesn’t mean I don’t live in this world, where unfortunately most of the people uses Windows. Many people have no other choice than using Windows and they still deserve to have some good free software in their computers.

Of course, I always try to encourage my clients (and everyone around me!) to use free software and stop using Windows, but sometimes it’s impossible, and it’s always better for them to use Windows with some free software made by me¹ than using Windows with more proprietary bloatware done by any garbage corporation that doesn’t care about their freedom.

Since I started with ElenQ Technology I always had this issue in mind, and the time to tackle it has come, so:

How could I make software for Windows users if I don’t use Windows, I don’t have any machine that runs Windows and I don’t support Windows in any way?

Until recently, most of my clients asked me for Web based tools, so I dodged that ball without even realizing it, but I always had that the impression I would have to tackle the issue someday and I was just delaying the moment to make some research.

During the last couple of weeks, in my spare time, I made that research a little bit and this is a simple high-level result of that research.

Needless to say this research is for myself and I have a very strong background to take in consideration (read the blog and you’ll see!), so it probably won’t fit your needs in any way, but it does fit mine and probably some of my near colleagues’.

Web-based
1. Web Extensions
Java Virtual Machine
Native Binaries
Distribution with Windows Installers
Conclusion
Final words

Web-based

Most of the times I am asked to do Web stuff, because most of the people just use Webs for everything².

Clients are used to work with websites, the UIs are easy to make, they work in any device… They are cool for many things, but when you need to make any interesting native operation they don’t make any sense (i.e. reading or creating files locally) and they require deployment, one way or another, and that may carry extra costs or efforts and maintenance.

Web Extensions

Another interesting option are Web Extensions (browser extensions). They are kinda easy to make and making them work in several browsers³ is almost no effort, they don’t require deployment (no servers, no pain) and they have more permissions than a regular website.

The problem they have is the browser is still a constrained environment, and you might not be able to do anything you’d like in there and they force the users to have the browser open in order to run them.

JVM runs everywhere!

I’ve never been a Java fan. I don’t really like the language and the fact that it supposes you are using an IDE to code in it, but I have to say the JVM is a really interesting environment.

The main problem it has is it’s pretty large, but it’s not a huge deal to tell my clients to install it (if they don’t have it already) and it provides most of the functionality I’d ever need out of the box.

GUI

It comes with GUI stuff by default (Swing and AWT⁴) and there are more modern ways to make GUIs like JavaFX, which I didn’t manage to make it work in my Guix machine.

Interesting JVM languages

The best thing about the Java Virtual Machine is you don’t need to use Java for it. There are some cool languages full of parenthesis you can use in it.

Clojure

I have some Clojure past. It’s a language I love. It had a couple of things I didn’t like about it though:

The startup time of Clojure made me feel uncomfortable.
I was worried about the size of the JVM.
Most of my code relied too heavily in Leiningen for everything and I didn’t know very well what was going on internally or which libraries were being used (a little bit of an NPM effect), and I was worried about the maintenance of the software if I was asked to make changes in the future⁵.
The Java interaction is really well designed at a language level, but integrating Clojure’s functional programming with heavily imperative Java code (like GUIs) feels uncomfortable.

I have to say I don’t code in Clojure for a long time and I wasn’t that good programmer at the time I played around with it. Probably I would make a way better use of it right now, and things that I felt weird may feel way more comfortable.

None of this issues is a big deal anyway. For these kind of projects for Windows, it might be a great choice, as most of the problems don’t mean a lot any more in this context. I may give Clojure another go.

Kawa

Recently I discovered Kawa, and it looks great. Kawa programs are easy to build with no additional tools, it’s a scheme, it’s fast, and its Java interaction feels natural.

Of course, there are almost no libraries written in Kawa, no tutorials, no learning resources further than the documentation (which is very good, by the way).

It’s minimalistic, it’s easy to set up and is really fast: it might be a good choice for many projects.

Distribution: JAR files and UberJARs

The Java world might be “too enterprisey” for someone like me, but it has interesting features. The jar files are just zip files that have Java bytecode and resources inside.

In any machine with Java installed they are launched automatically when they are clicked. There’s no need to unpack them or anything like that.

There are several ways to make those jar files, and one is simply insert all the dependencies of the Java application inside of the jar. That’s called the UberJAR.

Doing this makes you sure to have all the dependencies and resource files (such as icons and stuff like that) inside of a file that you can distribute and will always run if there’s a JVM installed in the target machine. Simple distribution!

The only problem they have is they are not located in the correct system folder to appear in the application launchers⁶.

Native binaries

Sharing prebuilt binaries is also feasible if you are using the right tools, but it might be tricky to distribute.

The first thing you have to do if you want to share binaries is cross-compile them for windows. I found a couple of tools that fit very well with my style here: MinGW and Zig.

MinGW

Recently I discovered this and it happens to be great. Simply put, MinGW a cross-compiler toolchain for Windows. It has everything you need to build your C/C++ software for Windows: gcc, binutils, libraries and header files.

Pretty straightforward.

Guix also supports this as a target so I can even guix build --target= with it and have all the fun.

Zig

Zig is a great programming language and the tooling around it is absolutely fantastic. It’s designed to be easy to cross-compile and don’t need anything else than the Zig compiler itself to be able to build your Zig, C or C++ software for a Windows machine. Just change the target and boom, works!

Zig also comes with a build-system, that lets you describe how to build your whole project and build it with a simple command. No need to use external tools like GNU Autotools, Make, CMake, Meson or anything like that. One Zig installation comes with everything you need.

Another advantage of Zig is that I love the language and I’m looking for an excuse to learn it. I think it’s very well designed and I love the community it has. I don’t like the syntax that much but I think I’ll get used to it.

Distribution: statically built binaries

In order to distribute the binaries, there only obvious choice is to statically link everything and give it in one .exe file to my clients. I can’t really trust non-tech users to install all the dependencies in place and I can’t guide them in the process of how to do it because I don’t know how to do it myself, and I can’t try as I don’t own any Windows machine.

Statically built binaries require no installation so that’s great, but they are not set in the correct folder of the system and they don’t support resources as icons and stuff like that⁷. They might be ok for many things but they are not a perfect solution either.

GUIs

This kind of clients require GUIs most of the time. I don’t imagine them running a script from the shell.

There are many GUI libraries I could use but I’d like to use anything that is small and easy to build for any target. That leaves most of them out.

I tried to build IUP myself but the build process is basically broken. It looks good and uses native GUI components, but if the build process is broken I can’t really trust it. I could just use the binaries they provide but I don’t like that.
I built FLTK successfully without problems and I even packaged it in my personal Guix channel. It’s not beautiful, but it works, and I don’t expect it to be hard to build for Windows either.
I could go for something like Dear ImGUI, but ImGUIs are better suited for programs that are being rendered continuously like games and such.
Qt, GTK, wxWidgets and those are great too, but probably too much for a simple man like me⁸.

Distribution with Windows installers

Software distribution can be eased using a Windows Installer like MSI or MSIX. Those packages know where to install everything and they do automagically, as Windows users are used to.

They require extra tools but they might be simple enough to deal with and help a lot removing the downsides of the distribution methods described previously.

GNOME project has a tool called msitools, which exposes a similar interface to WixToolset, a popular Windows installer generator. I can use that to build and inspect MSI installers.
Microsoft also provides a tool for MSIX installers that is Open Source but it happens to insert telemetry (what a surprise!).
There’s a Python package called msicreator that simplifies the use of msitools with a simpler approach that might be more than enough for my needs.

There are also some language-specific tools like PyInstaller but that forces me to use Python, which I like but I don’t know if I want to keep using for everything. Also, it includes the interpreter in the installer, which feels like a little bit too much.

EDIT: Someone mentioned the existence of NSIS, which looks pretty promising. I add it here for future reference. It’s packaged in Guix so it might be a good idea to give it a go.

Conclusion

Each choice comes with its downsides and shines in specific scenarios. Working in a classic C/C++ setup with MinGW might be great but I have to make sure I don’t use a complex dependency tree, as everything might fail to compile or distribute for Windows.

I want to learn Zig. It’s really cool and I think it will ease the process significantly. I’m not sure about the C integration but I need to give it a go first. It might become my go-to language for these kind of applications.

On these cases I need still to find the best GUI library to use (suggestions welcome!).

The JVM case is also interesting. It comes batteries included and has Swing by default, which is horrible but it’s something. For faster development I could use some flexible language like Clojure or Kawa there, being the second way faster than the first but also way less know (which shouldn’t be a problem as I don’t want to rely in many external libraries).

All these options look feasible so I could just go for any of them. Obviously, some have way better performance than others (C/C++/Zig vs Clojure) but the ease of development is also something I have to take in account. That I’d need to think about when the projects come.

We’ll see…⁹

Final words

It’s obvious that I left many options out and I can’t wait to get some emails of people recommending me to learn Go or Rust¹⁰, but this non-exhaustive research is mostly based on my personal (and current) preference.

Surely you’ll think I’m making it way more difficult than it actually is: “just install a virtual machine and build there! Buy a Windows machine if you really want to solve this issue!!” and you’d probably be right, but it doesn’t feel right to me. So I won’t.

If you have other ideas or success stories that fit this line of thinking don’t hesitate to contact me.

Stay safe!

And I can earn some money in the process. ↩
Often times what the clients want or they ask for is not what they need, so be careful with that. ↩
API support in all the browsers is not the same, be careful with that. ↩
Welcome back to 2001. ↩
Guix can help here! ↩
The Java ecosystem provides a tool to solve this called javapackage but unfortunately it doesn’t cross-compile (Wine for the win?) and it would add a full Java runtime to the installer. Maybe it’s too much. ↩
The problem with the resources can be bypassed by a self-expanding executable: a program that unpacks its binary contents in the current folder. They can be made by 7zip and other tools like this, but I don’t really like it, as they pollute the current folder and you might not expect that to happen. Some games are distributed like this. ↩
Probably you didn’t know but my first free software contribution was for KDE and I had to deal with Qt. It was right in the migration from Qt4 to Qt5. Good times. ↩
I can always chicken out and go for PyInstaller + PyQt when the projects come. 😐 ↩
I don’t like Go but I’m open to learn Rust even if its a little bit more complex than I’d like it to be (and I don’t like the syntax). It would require me to allocate a long time for it, which is not a problem, but I need to be sure that I will be able to get some benefit from it. ↩

Milestone – RISC-V support in Mes’s bootstrappable TinyCC

2022-09-22T00:00:00+03:00

In the series we already introduced GCC, TinyCC, Mes and Mes’s TinyCC fork that is designed to be bootstrappable. In this post we are going to deal with the latter, explain how we made it work for RISC-V and the challenges we encountered.

The non-bootstrappable nature of TinyCC

As we introduced in the previous post TinyCC is not compilable from very simple compilers like Mes’s mescc. So the Mes project decided to make a fork that mescc was able to compile. Mes calls it a bootstrappable tinycc.

There’s a in uninteresting philosophical debate about what does bootstrappable mean, which leads to many errors and misunderstandings¹. Many compilers call themselves bootstrappable if they can be compiled with themselves. When we talk about this, we are looking for a full-source bootstrappability, that is, that the compilers can be compiled from source, or from a full-source bootstrappable compiler.

TinyCC is supposed to be compilable by itself, but who compiles the version that compiles TinyCC? Another TinyCC? And who compiles that?

The yogurt problem we always get: how do you make yogurt? Take yogurt, mix with milk and in some hours you’ll get yogurt. See the problem?

If you are a culinary maniac, as I am, you can stretch this metaphor further. If you know what you are doing, you can obtain yogurt from raw milk².

That’s what our project is doing: make yogurt from raw milk at some point.

So the compilers normally only care about the latest yogurt, but, we, the saviors of the ancient milk, those who can acidify the raw pureness, can make yogurt starter with raw milk.

That’s the kind of magic nobody cares about, not in the compiler world nor in the real life.

The yogurt starter does not make the best yogurt, by the way, it needs generations and generations of yogurts to make the best. That’s what our project does: start simple (stage-0 and Mes) and go enriching the product (TinyCC) until reaching a mature yogurt (GCC).

TinyCC does not really care about this bootstrappability concept. They only want to be compilable with themselves. Nothing else.

That’s why Jan, the inventor of this metaphor I just stretched to the infinite, had to fork the project. He had another choice: simplify TinyCC’s code upstream to be able to be compiled from a simpler step, but his ideas were rejected and some weird animosity I don’t understand started. More on that later.

The RISC-V support

When the previous blogpost was written, TinyCC had a RV64 backend, but the TinyCC fork did not have RISC-V support.

My job here was to take the backend from the official TinyCC and bring it to the bootstrappable one, Jan’s fork. I can say that is done. Good for me.

The process

I followed the cross-compiler trick again, in order to make this process easier in my computer and because Mes doesn’t support RISC-V output yet. Making a TinyCC for my x86_64 machine that had RISC-V output sounded more than reasonable to me. Later I could always move to a full RISC-V machine making sure that the backend was working.

So first I made a guix package for upstream TinyCC cross-compiler (for RISC-V) with GCC. This wasn’t really obvious, because there were some variables to set correctly. Tested everything compiled and worked like expected. Apart from a couple of issues later corrected upstream, it did.

Next, I made a guix package for the forked TinyCC with GCC. This also needed some changes, as the forked one is a quite old version of TinyCC. The process needs here a libtcc1.a that can be empty if the process is compiled with GCC (libgcc provides that functionality) but the compilation process doesn’t mention anything about this, and coming up with that by yourself is hard.

Now the project was compilable, it was time to code. You can see this part in the riscv-mes branch:

https://github.com/ekaitz-zarraga/tcc/commits/riscv-mes

I took the backend from the upstream and inserted it in the fork. Of course, it didn’t compile. Many internal structures and APIs changed, so after trying to stitch all together myself, I headed to the Mailing List. At the beginning I wanted to think the answers I was getting were because I wasn’t explaining my doubts properly or something but what it was happening was that the animosity towards our fork (decision I didn’t take) appeared and someone tried to ridicule me in the mailing list for no reason at all.

The funny thing is I’d never needed to contact the mailing list if the project was as well written as they claim it to be. It’s full of functions and variables with one character, the code is mixed together in a very aggressive way… It’s supersmall, tiny even, but really hard to read. Also, the commits are not very descriptive for anyone that is not the main maintainer, who, surprise! Is the same person that gives aggressive answers in the mailing list… I hope it’s only my perception and they are nice with his friends and family, but the interaction made me feel uncomfortable and I don’t want to touch this code again.

It was a sad moment, I must admit. But I decided I was going to do this with help or without it. And I think I did it. Removed references here and there and finally it looks like I reached somewhere.

There are some differences to point out, one of the commits that made me ask in the mailing list was a huge change on the way that conditionals are handled in TinyCC. Our fork didn’t have that so I needed to split the code in several pieces and the benefits from that commit (some instruction optimization) are lost in the backport. Still the branching and jumping is correct, but less optimal. Not bad.

Code added and compiled, it was time for testing. I made a little script (I didn’t share that, but it’s not really relevant either) and a small test case of simple C files and compiled (not linked) them with the upstream version of the compiler and the forked one. Disassembled them and compared differences.

You can try it building the upstream TinyCC and the fork and make them compile (-c) a some files. Use objdump --dissassemble and see the results. It’s not really hard to test. Here you have an example of a program you can build:

// Example file to build
int main (int argc, char *argv[]){
    int a = 19, b = 90;
    if (a && b){
        return 1;
    } else {
        return 45 + 90 << 8;
    }
}

And the result it should give in both versions, optimized (upstream) and unoptimized (our fork):

OPTIMIZED VERSION                              || UNOPTIMIZED VERSION
===============================================||==================================================
0000000000000000 <main>:                       || 0000000000000000 <main>:
   0:   fd010113    addi    sp,sp,-48          ||    0: fd010113    addi    sp,sp,-48
   4:   02113423    sd  ra,40(sp)              ||    4: 02113423    sd  ra,40(sp)
   8:   02813023    sd  s0,32(sp)              ||    8: 02813023    sd  s0,32(sp)
   c:   03010413    addi    s0,sp,48           ||    c: 03010413    addi    s0,sp,48
  10:   00000013    nop                        ||   10: 00000013    nop
  14:   fea43423    sd  a0,-24(s0)             ||   14: fea43423    sd  a0,-24(s0)
  18:   feb43023    sd  a1,-32(s0)             ||   18: feb43023    sd  a1,-32(s0)
  1c:   0130051b    addiw   a0,zero,19         ||   1c: 0130051b    addiw   a0,zero,19
  20:   fca42e23    sw  a0,-36(s0)             ||   20: fca42e23    sw  a0,-36(s0)
  24:   05a0051b    addiw   a0,zero,90         ||   24: 05a0051b    addiw   a0,zero,90
  28:   fca42c23    sw  a0,-40(s0)             ||   28: fca42c23    sw  a0,-40(s0)
  2c:   fdc42503    lw  a0,-36(s0)             ||   2c: fdc42503    lw  a0,-36(s0)
  30:   00051463    bnez    a0,38 <main+0x38>  ||   30: 00051463    bnez    a0,38 <main+0x38>
  34:   0180006f    j   4c <main+0x4c>         ||   34: 01c0006f    j   50 <main+0x50>
  38:   fd842503    lw  a0,-40(s0)             ||   38: fd842503    lw  a0,-40(s0)
  3c:   00051463    bnez    a0,44 <main+0x44>  ||   3c: 00051463    bnez    a0,44 <main+0x44>
  40:   00c0006f    j   4c <main+0x4c>         ||   40: 0100006f    j   50 <main+0x50>
  44:   0010051b    addiw   a0,zero,1          ||   44: 0010051b    addiw   a0,zero,1
  48:   0100006f    j   58 <main+0x58>         ||   48: 0140006f    j   5c <main+0x5c>
  4c:   00008537    lui a0,0x8                 ||   4c: 0100006f    j   5c <main+0x5c>
  50:   7005051b    addiw   a0,a0,1792         ||   50: 00008537    lui a0,0x8
  54:   00000033    add zero,zero,zero         ||   54: 7005051b    addiw   a0,a0,1792
  58:   02813083    ld  ra,40(sp)              ||   58: 00000033    add zero,zero,zero
  5c:   02013403    ld  s0,32(sp)              ||   5c: 02813083    ld  ra,40(sp)
  60:   03010113    addi    sp,sp,48           ||   60: 02013403    ld  s0,32(sp)
  64:   00008067    ret                        ||   64: 03010113    addi    sp,sp,48
                                               ||   68: 00008067    ret

In the right you can see there are some j instructions duplicated, but it’s not supposed to be a problem, as the rest of the addresses are calculated properly, and they are never going to be reached.

Last step

So the code is added to the fork and it seems to work. That’s what I promised to do, but I wanted to go a little bit further and test if Mes was able to handle the code I added to the TinyCC fork.

In order to do that I made another branch in the project where I changed the package and some configuration in order to compile the forked TinyCC using Mes.

You can see what I did here:

https://github.com/ekaitz-zarraga/tcc/commits/mes-package

Turns out that I managed to build the thing, using Mes for my x86_64 machine choosing RISC-V as the backend, but it doesn’t work at all.

The resulting compiler generates empty files that have no permissions and fails instantly.

At least we tested that mescc is ok with the C constructs we used in the backport of the RISC-V support. But there are still many things to test and this isn’t easy at all.

Let me give you some examples on how tricky this process is.

This line in the guix.scm file³:

    "--extra-cflags=-Dinline= -DONE_SOURCE=1"

Does two crazy preprocessor tricks, inserted as C flags. It’s equivalent to adding these macros in the top level of the sources:

#define inline 
#define ONE_SOURCE 1

The first one removes the word inline from the source code, because mescc does not support that. The second, defines ONE_SOURCE to a value because if it’s only defined, without a value, like the makefile does by default, it is not matched properly by de #ifdefs. Finding this is not obvious.

That’s of course not the only thing, we found out many others. I spent a couple of weeks making the building process work for mescc and when I thought it was working the result is a broken binary. Pretty fun.

And why all this trouble, you might think?

Jan’s fork is not compiled using the configure and the Makefile the project comes with, he wrote some shell scripts to build everything. I wanted to try to build the project directly as it came for several reasons: the scripts are prepared for native compilers and not for the cross compiler I was building, they use Mes from source but I just needed to use the upstream one and I thought integrating all this in the normal building process would be an extra win.

I lost this time though.

The compilation process might be missing some libraries, or some stubs might be in use instead of the real code… Maybe the problem is I’m using the x86_64 version of Mes, which is not thoroughly tested… But using the i386 version is not possible because I’m building for 64bit RISC-V and the i386 doesn’t know how to deal with 64 bit words… Honestly, I don’t know what to do.

Something cool to say

Mes does not compile following the classic process. Mes is integrated with some tools from the stage-0 project so it uses the M1 macro system, hex0 and all that kind of things to build the programs.

During the process I found that some of the M1 instructions Mes was generating were not available by M1, so I had to add a few extra instructions to the M1 macro definitions for Mes. Here’s the diff (a little bit simplified) I had to make:

diff --git a/lib/x86_64-mes/x86_64.M1 b/lib/x86_64-mes/x86_64.M1
index 9ffbbf15..64997c55 100644
--- a/lib/x86_64-mes/x86_64.M1
+++ b/lib/x86_64-mes/x86_64.M1
@@ -147,6 +148,10 @@ DEFINE mov____0x8(%rbp),%rsp 488b65
 DEFINE mov____0x8(%rdi),%rax 488b47
 DEFINE mov____0x8(%rdi),%rbp 488b6f
 DEFINE mov____0x8(%rdi),%rsp 488b67
+DEFINE mov____(%rax),%si 668b30
+DEFINE mov____(%rax),%sil 408a30
+DEFINE mov____%si,(%rdi) 668937
+DEFINE mov____%sil,(%rdi) 448837
 DEFINE movl___%eax,0x32 890425
 DEFINE movl___%edi,0x32 893c25
 DEFINE movl___%esi,(%rdi) 8937

base-commit: aa5f1533e1736a89e60d2c34c2a0ab3b01f8d037

Now, with those instructions added, my package got a little bit more complex: I had to extend the Mes package with my patch until that change is accepted upstream. But this is great! Using software and improving it while you use it is the best feeling in life!⁴

Let me use this point to show you a little bit how this macro system works. You can see this x86_64.M1 file has three columns: DEFINE, some text, and some number in hex. This is kind of an assembler description. There’s the M1 program that receives a file written with instructions that look like the text in the second column in the .M1 file and converts them one by one to the numbers in the third. In short, the .M1 file is a reference that tells the M1 program how to do the conversion.

M1 is just a text replacement tool that makes the conversion based on the input file it gets from the .M1 file. It helps us write instructions in a way that looks like they have a meaning (that’s what an assembler is after all).

Later, those numbers are converted to binary, using Hex0 or another a little bit more sophisticated tool.

All these tools are written in a way that can be audited (Hex0 is written in Hex0…) and they are executed from source at their very beginning.

This is how we make yogurt directly from milk. Cool huh? Props to http://bootstrappable.org/

Conclusions

Back to the project, considering the fact that I didn’t manage to build a fully working TinyCC with a RISC-V backend using Mes, is this a failure?

I wouldn’t say so.

The new RISC-V backend is added and tested in the forked TinyCC, using GCC as a compiler. That’s a big chunk of the work.

On the other hand, I can compile the forked TinyCC with mescc even if the result didn’t work, I can say the code I added was processed so it was technically acceptable for mescc. Not bad, but we’ll still need to see how true is this.

In the end, these kind of small steps make progress, and having everything documented here and in the commits on the git repositories help others continue with what I just did.

Now, I’m going to leave this as finished, as the code is supposed to work. All the dots are more or less drawn. Now it’s time for another project, one that connects all the dots of the RISC-V full source bootstrap: from mescc (already has some RISC-V support) to the forked TinyCC (I added the RISC-V support), next to the mainline TinyCC (has RISC-V support) or/and GCC 4.6.4 (I added RISC-V support) and from one of those to GCC 7.5 (the first one with RISC-V support) and then to the world.

My work in this project left all the breadcrumbs in the forest, ready for anyone to follow⁵.

That person can be me, anyone else or even a group of people. All I can say is I won’t forget this project, I’ll always be reachable for advice and I’d try to help as much as I can. As I always do.

These days I’ll continue to give a couple of tries to this and I may reach something else, but I won’t be as busy on it as I’ve been. I think I gave everything I could in this project. There’s still a lot to do, but what it’s left is not something I can do alone.

Until next time.

I’ve reached many misunderstandings about my project too. Some people have told me all this work is worthless because you can always bootstrap from an x86_64 machine and then continue the bootstrapping effort in your RISC-V. And so on. That’s why this blog doesn’t have a comment section. People insist to believe that other people’s work is worthless or they are able to do it simpler with no effort. I won’t claim that my explanations are the best, but I can claim to be the laziest person I know, and I’d never spent time in something that doesn’t worth the effort. ↩
With kefir you are fucked. We don’t know where it comes from. Luckily we harvested a lot and it’s easy to grow. ↩
https://github.com/ekaitz-zarraga/tcc/blob/mes-package/guix.scm#L196 ↩
Chocolate and hot coffee too. ↩
I hope someone follows them before the birds eat them. ↩

Adding TinyCC to the mix

2022-08-01T00:00:00+03:00

In the series we already introduced GCC, made it able to compile C programs and so on, but we didn’t solve how to build that GCC with a simpler compiler. In this post I’ll try to explain which changes must be applied to all the ecosystem to be able to do this.

The current status

I already talked about this in the past, but it’s always a good moment to remind the bootstrapping process we are immerse in. There are steps before of these, but I’m going to start in GNU Mes, which is the core of all this.

From the part that interests us, GNU Mes has a C compiler, called MesCC. This C compiler is the one we use to compile TinyCC and we use that TinyCC to compile a really old version of GCC, the 2.95, and from that we compile more recent versions until we reach the current one. From the current one we compile the world.

That’s the theory, and it’s what we currently have in the most widely supported architectures (i386 and maybe some ARM flavour). Problems arise when you deal with some new architecture, like the one we have to deal with: RISC-V.

RISC-V was invented recently, and the compilers did not add support for it until some years ago. GCC added support for RISC-V in the 7.5 version, as we have been discussing through this series, which needed a C++ compiler in order to be built. That’s a problem we almost solved in the previous steps, backporting the RISC-V support to a GCC that only needed a C compiler to be built.

Now, extra problems appear. Which C compiler are we going to use to build that GCC 4.6.4 that has the RISC-V support we backported?

According to the process we described, we should use GCC 2.95, but it doesn’t support RISC-V so we would need to backport the RISC-V support to that one too. That’s not cool.

Another option would be to remove the GCC 2.95 from the equation and compile the GCC 4.6.4 directly from TinyCC, if that’s possible. Making the whole process faster removing some dependencies. But this means TinyCC has to be able to compile GCC 4.6.4. We are going to try to make this one, but that requires some work we will describe today.

On the other hand, in order to be able to build all this for RISC-V, TinyCC and MesCC have to be able to target RISC-V…

Too many conditions have to be true to all this to work. But hey! Let’s go step by step.

RISC-V support in TinyCC

First, we have to make sure that TinyCC has RISC-V support, and it does. Since not a long time ago, TinyCC is able to compile, assemble and link for RISC-V, only for 64 bits.

I tested this support using a TinyCC cross-compiler and it works. If you want to try it, I have a simple Guix package for the cross compiler, and I also fixed the official Guix package for the native TinyCC, which have been broken for long.

Still, I didn’t test the RISC-V support natively, but if the cross-compiler works, chances are the native will also work, so I’m not really worried about this point.

GNU Mes compiling TinyCC

GNU Mes supports an old C standard that is simpler than the one TinyCC uses, so it uses a fork of TinyCC with some C features removed. This fork was done way before the RISC-V support was added to TinyCC and many things have changed since then.

We need to backport the TinyCC RISC-V support to Mes’s own TinyCC fork, then. Or at least do something about it.

When I first took a look into this issue, I thought it would be an easy fix, I already backported GCC, which is orders of magnitude larger than TinyCC… But it’s not that easy. TinyCC’s internal API changed quite a bit since the fork was done, and I need to review all of it in order to make it work. Also, this process includes the need to convert all the modern C that is not supported by MesCC to the older C constructs that are available on it.

It’s a lot of work, but it’s doable to a certain degree, and this might suppose a big step for the full source bootstrap process. Like what I did in GCC, it’s not going to solve everything, but it’s a huge step in the right direction.

GNU Mes supporting RISC-V

On the lower level part of the story, if we want to make all this process work for RISC-V, GNU Mes itself should be runnable on it, and able to generate binaries for it.

There have been efforts to make all this possible, and I don’t expect this support to take long to appear finally in GNU Mes. It’s just a matter of time and funding. I am aware that Jan is also interested on spending time on this, so I think we are covered on this area.

GCC compilation with TinyCC

The only point we are missing then is to be able to build the backported GCC from TinyCC, without the intermediate GCC 2.95. This a tough one to test and achieve, because the GCC compilation process is extremely complex, and we need to make quite complex packages for this process to work.

On the other hand, the work I already did, packaging my backported GCC for guix is not enough for several reasons: it was designed to work with a modern GCC toolchain, and not with TinyCC; and a cross-compiler is not the same thing as a native one.

GCC is normally compiled in stages, which are called bootstrap by the GCC build system. I described a little bit of that process in a footnote in past. That process is not activated in a cross-compilation environment, which is what I used when the backend I backported was ~~back~~tested. If the bootstrap process doesn’t work, it means the compilation process fails, so this introduces possible errors in the build system which we were avoiding thanks to the cross-compilation trick.

I did this on purpose, of course. I just wanted a simple working environment which was letting me test the backported RISC-V backend of the compiler, but now we need to make a proper package for GCC 4.6.4, and make it work for TinyCC.

I wouldn’t mention this if I didn’t try it and failed making this package. It’s not specially difficult to make a package, or it doesn’t look like, until you get errors like:

configure: error: C compiler cannot create executables

`¯\_(ツ)_/¯`

That being said, this is not only a packaging issue. As we already mentioned, we are removing GCC 2.95 from the pipeline, so TinyCC has to be able to deal with the GCC 4.6.4 codebase directly, including the backport I did.

The easiest way to test this is to compile GCC 4.6.4 for x86_64 in my machine, with no emulation in between, so we can find the things TinyCC can’t deal with. Later we would be able to test this further in an emulated environment or directly in a RISC-V machine to make sure TinyCC can deal with the RISC-V backend, but for a first review in the GCC core, using x86_64 can be enough. It requires no weird setup, further than a working package… Ouch!

I’m not really good at this part and I’m not sure if anyone else is, but I don’t feel like spending time in trying to make this package cascade. I feel like my time is better spent on fixing stuff, or, once the package cascade is done, fixing the compatibility.

During the whole project, making Guix packages and figuring out build systems is the part where more time was spent, and it’s the one with the lowest success rate. It feels like I wasted hours trying to make the build process work for nothing.

The funny part of this is Guix is partially the one to blame here, not conforming the FHS and having this weird way to handle inputs is what makes the whole process really complex. Code has to be patched to find the libraries, scripts must be patched too, binaries are hard to find… On the good side, it’s Guix that makes this work worth the effort, and also what makes this process reproducible, once it’s done, to let everyone enjoy it.

Wait, but didn’t Mes use a TinyCC fork?

Oh yeah of course. What I forgot to mention is the step we just described, making TinyCC able to compile the backported GCC 4.6.4, is not just as simple as I mentioned. If we use upstream TinyCC to compile GCC, who is going to compile that TinyCC? We already said MesCC is not able to do that directly.

We could build that TinyCC with the TinyCC fork Mes has or make the TinyCC fork go directly for the GCC 4.6.4, but in any case there’s an obvious task to tackle: The RISC-V support must arrive the TinyCC fork before we can do anything else. And that’s where I want to focus.

This is not only about RISC-V

I have to be clear with you: I mixed two problems together and I did that on purpose.

On the one hand we have the RISC-V support related changes. And on the other hand we have the changes on the compilation pipeline: the removal of GCC 2.95.

The second part is just a consequence of the first, but it’s not only related with the RISC-V world. Once we have our compilers ready, we are going to apply the change for the whole thing. Removing a step is a really important task for many reasons but one is the obvious at this point: having a really old compiler like GCC 2.95 forces us to stay with the architectures it was able to target, or makes us add them and maintain them ourselves. It’s a huge flexibility issue for the little gain it gives: GCC 4.6.4 is already compilable from a C90 compiler.

So, this is an important milestone, not only for my part of the job but also for the whole GNU Mes and bootstrapping effort. Skipping GCC 2.95 has to be done in every architecture, and the packaging effort of that is unavoidable.

What I already did

While I was reviewing what it needed to be done, I started doing things here and there, preparing the work and making sure I was understanding the context better.

First, I realized I introduced some non-C90 constructs in the backport of GCC, because I directly copied some code from 7.5 and I removed those. This is important, because we need to be able to compile all this with TinyCC, and I don’t expect TinyCC to support modern constructs.

I packaged a TinyCC RISC-V cross compiler for the upstream project, and also for the Mes fork even thought the latter is not available yet for compilation: we need to backport the backend in order to make it work. Still, it’s important work, because it lets me start the backport easily. I’ll need to apply more changes on top of it, for sure, but at the moment I have all I need to start coding the new backend.

I spent countless hours trying to make a proper GCC package and trying to use TinyCC as the C compiler for it with no success. This is why I decided to move on and work in a more interesting and usable part: adding the RISC-V backend to the Mes fork of TinyCC.

Of course, I already started working on the RISC-V support of the TinyCC fork from Mes, and started encountering API mismatches here and there. Most of them related with some optimizations introduced after the fork, that I need to review in more detail in the upcoming weeks. I also spent some time trying to understand how TinyCC works, and it’s a very interesting approach I have to say¹.

Conclusions

I’d love to tackle all these problems together and fix the whole system, but I’m just one guy coding from his couch. It’s not realistic to think I can fix everything, and trying to do so is detrimental to my mental health.

So I decided to go for the RISC-V support for the TinyCC fork we have at Mes. This would leave all the ingredients ready for someone more experienced than me to make the final recipe.

The same thing happened with the GCC backport. I didn’t really finish the job: there’s no C++ compiler working yet, but that’s not what matters. Anyone can take what I did, package it properly, which it happened to be an impossible task for me, and make it be ready. We already made a huge step.

Fighting against a wall is bad for everyone, it’s better to pick a task where you can provide something. You feel better, and the overall state of the project is improved. Achieving things is the best gasoline you can get for achieving new things.

Regarding the task I chose, I’ve already spent some hours working on it. It’s not an easy task. The internal TinyCC API changed a lot since the moment the fork was done, and there are many commits related with RISC-V since then. One of the most recent one fixes the RISC-V assembler after I reported it wasn’t working, few weeks ago. All these changes must be reviewed carefully, undoing the API changes and also, most importantly, keeping the code compatible with GNU Mes’s C compiler.

Not an easy task.

Maybe I’ll have the time to explain it in a future blog post, maybe not. ↩

Milestone — Source to Binary RISC-V support in GCC 4.6.4

2022-06-20T00:00:00+03:00

In the series we already introduced GCC, and we already shared how I backported the RISC-V support from the GCC core to GCC-4.6.4. Now it’s time to finish what we left half-done and actually introduce a full RISC-V compiler.

Where we left last time

The Tuesday, 7th of April, I marked a commit with the minimal-compiler tag. That commit contains all the work we did until that time. In that tag we describe how we can build a compiler that is only able to assemble files to RISC-V.

As we already explained around here, GCC is a driver program that calls other programs to do its work. The GCC core compiles the code to assembly language and then calls binutils to do the rest of the work: assembly and linking.

At that point, we had to call binutils by hand.

The changes

The changes applied at the time of writing are available in the working-compiler tag. As the tag message describes, they were split in two different branches: the guix-package branch and the riscv branch.

The guix_package branch is merged in the riscv branch but this split lets us differentiate which changes are related with the compiler itself and which are related with the tooling around the compiler. That way we’ll be able to choose what to do with the commits easily in the future. We’ll probably need to rearrange some stuff.

The context is everything: Guix package part

The guix_package branch contains all the commits that make the Guix tooling around the project work. This includes the compilation process definition in a reproducible way, the environment setup and all that.

As the working-compiler tag message describes, this is the way you can currently make this compiler work and play with it:

$ guix shell -m manifest.scm
$ source PREPARE_FOR_COMPILATION.sh riscv64-linux-gnu
 # This second command will prepare the PATH and other environment
 # variables to make GCC find libraries and executables

If you use this in the future and it fails, it might be because between the time this blog post was written and you read it Guix made some changes in the core packages that are used. You can always use the time-machine utility to make sure you use everything like in the moment this post was written:
guix time-machine --channels=channels.scm -- shell -m manifest.scm

From this point you can directly run the compiler, it will need the sysroot option to be able to find the crt* files, but that’s something I’m not worried about at this point, we’ll fix that when we integrate this in the bootstrapping process.

Run the compiler like this now:

$ riscv64-linux-gnu-gcc --sysroot=$GUIX_ENVIRONMENT [-static]  ...

Notable changes in the Guix side

The most notable change in the Guix side is the addition of the manifest.scm file and also the PREPARE_FOR_COMPILATION.sh file. With the help of my man Janneke, I realized the problems I had came from the fact that I was calling the compiler with the wrong environment and it was unable to find the linker and the assembler. Yes, this kind of things happen a lot in Guix if you are not careful (and I am not careful at all). Adding these tools let me prepare a working environment where the assembler and the compiler are found and called properly.

This change also includes the some interesting extras: the GLibC added to the manifest also contains the static version so we can generate static binaries that are easier to test in an emulated environment without having to deal with the dynamic linker. Important stuff.

Also, now the compilation process relies on a newer Guix version, which removed the -unknown part from the triplets (actually quadruplets), like riscv64-unknown-linux-gnu. That was a little bit of a pain, because I just tried to compile everything one day and failed, and in the end it was just that small change. I decided to update the Guix version needed to keep it up-to-date with the current Guix, so I didn’t need to run guix time-machine each time. It’s better like this.

If you want to read more about the change and see how fast Guix people helped me understand what was going on, see this mailing list thread¹. I have also to mention that I needed to add a small change to my GCC to be able to work in the case the -unknown part was not added to it: adding riscv to config.sub was enough for that.

I also fixed a couple of extra things but they are not really relevant for this. Having a working environment preparation is a nice milestone by itself, but we did some things more on the GCC side!

Road to a working compiler: The GCC part

The changes in the riscv branch contain some commits, most of them are small, but they are really important. I have to say this is full of details I don’t really understand, so I’ll try to focus on those I actually do. The rest of them are simply things that happened to work in the end. You know, this is pretty old software and the project is too complex to understand it all…

Memory models and fences

First, before doing anything else, we mentioned in the previous post that the memory models were something we needed to review. We knew this because the code related to memory models was used in a couple of parts of the RISC-V code we copied from the GCC 7.5 codebase, but it was not available in GCC 4.6.4. That API simply did not exist back then.

The commit 71dc25d removes the memory models from the code (which were already commented out but not solved), taking in account the most conservative approach: always add the .aq flag and the fence instruction. This is not optimal, but the performance penalty is negligible and it’s not affecting the functionality.

I did not come up with this myself, as I mentioned in the previous post, I asked the maintainer of the RISC-V support of GCC (who is also one of the big names of RISC-V) about this and he gave me this solution.

I also had to change the optabs a little bit, using memory_barrier instead of one of the more recent optabs. For this I just compared the code from the MIPS architecture and checked how it changed from the 4.6.4 to the 7.5, as I did for many other parts of this work. Easy-peasy.

Wrong arguments in the assembler call

As I mentioned in the Guix part, we were unable to call the assembler. This means we didn’t uncover the assembler call was broken until we actually put it in the PATH and tried to call it.

The commit 7030067 shows how I needed to make small changes in the way the assembler is called by GCC to ensure that it was called correctly.

This issue was easy to fix, but not that easy to catch. First I found the assembler was complaining because it didn’t understand the -k-march option. I spent some time realizing the problem was that those were to options that were merged together due to a lack of a space. Yes, the space in the end of the line is relevant.

I directly removed the -k option from the ASM_SPEC because my assembler was considering it ambiguous. I don’t remember where I copied this from but it works and I don’t want to think about it ever again.

Libgcc: the core of this change

The biggest thing in this set of changes was the addition of libgcc, which is mandatory if you want to link your programs compiled with GCC. libgcc is a library GCC uses for complex operations: instead of generating the assembly code directly, it generates calls to libgcc, where those complex operations are defined. You can read further about those operations but they are not really relevant for this post, the relevant part is we need to add libgcc in order to have a working compiler.

The GCC codebase has different folders for its different blocks, so it’s not surprising to see there’s a folder called gcc for the core and a folder called libgcc for libgcc. Anyone would expect that just cherry picking the commit that added the libgcc support to GCC 7.5 would be enough to have the backport ready.

Sadly, life is a little bit harder than that.

Cherry picking the libgcc support

The first and easiest thing to do is to cherry pick the commit 72add2f and pray. It looked plausible to make it work, because, if you look at the changes it makes, it’s pretty well contained in the libgcc/config/riscv/ folder and adds just a couple of lines to the libgcc/config.sub to make it find the riscv folder.

The contents of the commit are pretty clear:

Some assembly files that implement some operations
Some header files and C code that implement other things
Some weird files called t-something

The first two types of files we can understand as the body of the libgcc support: the juice. The t-something files are what are called Makefile Fragments.

The Makefile Fragments are the basis of the GCC build system. The files like config.host, also part of the commit, sets a variable, tmake_file, where all the t-somethings are added so the compiler generator framework knows how to build the things according to the rules described in them.

That’s how GCC buildsystem works. Now let’s talk about the problems.

LIB2ADD iteration is broken

First thing I realized when I did the cherry pick of the libgcc support was the whole thing did not build anymore. There was a crazy issue here.

We are not going to talk about LIB2ADD variable yet, but we can see this small change, b9c7f39, affects it. The main issue here was the whole makefile system (*.mk files in libgcc) was iterating over the values of the variable wrong, because libgcc support commit was appending values to LIB2ADD instead of setting it. The LIB2ADD variable was set empty from the main makefiles, and appending to it was leaving an empty entry, so the iteration process was trying to compile an empty value.

This was superhard to debug, but this small change just made the whole thing compile and now I was able to test the whole thing further.

Still broken

But it was still broken. GCC didn’t want to compile. Some weird errors appeared, mentioning something like the extra_parts were not coherent between gcc and libgcc. Weird.

Reading gcc/config.gcc and libgcc/config.host I realized the use of the extra_parts variable and how it was certainly incoherent between the two files. But why?

This led me to analyze the whole build system, comparing the RISC-V support with others. I realized here that the buildsystem is mixed in gcc and libgcc folders and it’s extremely difficult to know what’s the line that separates one from another.

Apart from that, the buildsystem was unable to compile the crt* files, because it didn’t know how to do it… The recipes were missing.

This made me go for the most aggressive change possible, 9c0f736: just copy everything from the libgcc/config/riscv/ to the gcc/config/riscv, add the rules for the crt* files and make the extra_parts coherent.

Of course, this is not a good change, but it lets us try if the generated compiler is able to compile anything. “I’ll have time to clean this up later” I thought.

The buildsystem is just a pain in the butt

Now I was able to compile the GCC, so I could try it for some things.

I build a RISC-V cross compiler and tried to statically compile a small Hello World program. Errors appeared:

/gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/bin/riscv64-linux-gnu-ld: /gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/lib/libc.a(printf_fp.o): in function `_nl_lookup':
/tmp/guix-build-glibc-cross-riscv64-linux-gnu-2.33.drv-0/glibc-2.33/stdio-common/../include/../locale/localeinfo.h:315: undefined reference to `__unordtf2'
/gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/bin/riscv64-linux-gnu-ld: /gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/lib/libc.a(printf_fp.o): in function `__printf_fp_l':
/tmp/guix-build-glibc-cross-riscv64-linux-gnu-2.33.drv-0/glibc-2.33/stdio-common/printf_fp.c:394: undefined reference to `__unordtf2'
/gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/bin/riscv64-linux-gnu-ld: /tmp/guix-build-glibc-cross-riscv64-linux-gnu-2.33.drv-0/glibc-2.33/stdio-common/printf_fp.c:394: undefined reference to `__letf2'
/gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/bin/riscv64-linux-gnu-ld: /gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/lib/libc.a(printf_fphex.o): in function `__printf_fphex':
/tmp/guix-build-glibc-cross-riscv64-linux-gnu-2.33.drv-0/glibc-2.33/stdio-common/../stdio-common/printf_fphex.c:212: undefined reference to `__unordtf2'
/gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/bin/riscv64-linux-gnu-ld: /tmp/guix-build-glibc-cross-riscv64-linux-gnu-2.33.drv-0/glibc-2.33/stdio-common/../stdio-common/printf_fphex.c:212: undefined reference to `__unordtf2'
/gnu/store/gbvg2msjz488d4s08p57mb1ajg48nxlj-profile/bin/riscv64-linux-gnu-ld: /tmp/guix-build-glibc-cross-riscv64-linux-gnu-2.33.drv-0/glibc-2.33/stdio-common/../stdio-common/printf_fphex.c:212: undefined reference to `__letf2'
collect2: ld returned 1 exit status

The most logical thing to do was to build a MIPS cross compiler and check if the same issue appeared. Of course, it didn’t.

Researching a little bit in the old GCC internals documentation, I found a couple of interesting things:

https://gcc.gnu.org/onlinedocs/gcc-4.6.4/gccint/Target-Fragment.html#Target-Fragment

The LIB2FUNCS_EXTRA variable is the one that contains what it should be compiled and added to libgcc.
Floating Point Emulation support is added by generating a couple of files with some macros on top: fp-bit.c and dp-bit.c.

Neither of those were used in the libgcc support we backported because the GCC buildsystem changed a lot since 4.6.4. In fact, there is a commit², much later than the 4.6.4 release, that removes the need to generate those fp-bit.c thingies.

The LIB2FUNCS_EXTRA variable was not used either, but somewhere in the makefiles I found LIB2ADD was set from it. It looks like the whole buildsystem changed from LIB2FUNCS_EXTRA to LIB2ADD, which was an internal variable in the past. I don’t know.

I just moved the LIB2ADD to LIB2FUNCS_EXTRA and set the floating point emulation in the t-riscv makefile fragment and hoped my work was done there.

A huge pain in the butt

It still failed, but at least now the __letf2 symbol was found. The only one I needed to fix now was __unordtf2.

I was disheartened.

The __unordtf2 name did not appear anywhere in the code, but building libgcc for MIPS had the symbol inside (I checked it with nm!). I had no idea of what was going on.

I asked all my peers about this, and I was sent a program that was actually compilable and runnable (Janneke is a genius, someone has to say it!):

#include <stdio.h>

int
main ()
{
  return printf ("Hello, world!\n");
}

int
__unordtf2 ()
{
  return 0;
}

Hah! Still, no solution, but it was a little bit of hope.

This gave me the energy I needed to research further. This __unordtf2 function comes from software floating point support but the makefile fragments in the libgcc folder seem to be correctly set…

Moxie for the rescue

MIPS architecture was too complex to be understandable for this humble human being so I decided to go for Moxie this time.

Moxie is a really interesting thing. But we are not going to spend time on it, but in its support in GCC 4.6.4. Take a look to the files on both parts of the Moxie support: the libgcc and gcc:

gcc/config/moxie
├── constraints.md
├── crti.asm
├── crtn.asm
├── moxie.c
├── moxie.h
├── moxie.md
├── moxie-protos.h
├── predicates.md
├── rtems.h
├── sfp-machine.h
├── t-moxie
├── t-moxie-softfp
└── uclinux.h

libgcc/config/moxie
├── crti.asm
├── crtn.asm
├── sfp-machine.h
├── t-moxie
└── t-moxie-softfp

As you can see, some things are repeated, and most of the files are located in the gcc part, which was not the case in the backported commit. I used this as a reference for a massive cleanup of the previous aggressive duplication and I ended up with this commit: 703efe3

But that wasn’t enough.

I also found that the soft-fp support did not come from the libgcc directory, but from the gcc one, so I needed to fix some makefile fragments. The reference on how to do that was located in gcc/config/soft-fp/t-softfp. This file described all the variables that I needed to set up to make the whole process find the software floating point functions to add (see how the function names are built with the $(m) variable? That’s why I couldn’t find where did the __unordtf2 came from…).

Those variables were set in libgcc/config/riscv/t-softp* files. I replicated them in gcc/config/riscv as in the Moxie target and added referenced to them to the gcc/config.gcc file, copying the lines I had libgcc/config.host. The process was still failing, as the variables were not found by the main makefile. I decided to hardcode them and give it another go, this time it built and I was able to build files and the weird errors did not appear anymore.

I realized in the end that the reason why the main makefile wasn’t finding the variables was because I was referring to the t-softfp* files through the variable host_address, as it was done in the libgcc/config.host. The problem was that variable was not available in the main gcc/config.gcc file so I had to make a beautiful switch-case to deduce the wordsize.

With all this knowledge and with the help from the Moxie support I finally arranged a new commit, where I duplicated the files that I needed to duplicate, added the correct references to the makefile fragments and I even fixed some of the variables in the makefiles: f42a214

Yeah, all this was hard to deduce, because this buildsystem is really complex and makefiles are really hard to debug³. Also the fact that I don’t understand why I need to replicate the t-softp* files in both places drives me mad, but I have to learn to deal with the fact that I can’t understand everything.

In these commits you can see I deleted references to extra_parts and some other things, too. The reason is simple: if other architectures don’t need to set those variables, me neither. In the end, the crt* files were generated anyway.

Other changes

I also removed -latomic from the calls to the linker because it looks like it didn’t exist back then (we’ll see how this explodes in my face in the future), and fixed a couple of things more, but that’s not really interesting in my opinion⁴.

Missing things

There are many things missing still, but this some I won’t even try because they are out of the scope of the project. Remember: we just need to be able to compile a more recent GCC, not the rest of the world.

Some of the things I left might become mandatory in the near future as we do proper testing of all this. My goal here was to provide something that can run, and then I’ll collaborate with the different agents in this bootstrapping effort to fix anything we need to reach the full bootstrapping support.

There are few obvious things missing:

Big Endian support: riscv64be-linux-gnu support, basically (note the be in the target name). I won’t add this until we are sure we need it. It shouldn’t be difficult, I already found some commits in the main GCC where this was added and they were simple.
Specific device support: we didn’t add support for any specific device yet, that’s something we’ll need to think about in the future, but we probably won’t add because it will make us maintain more code, and I don’t think generic RISC-V code is going to have issues in the majority of the devices.
There are also many commits that came after the main port that fix some relocations and some other things. Many of them are not really relevant, because most of them are related with bugs that were introduced later, fix things that won’t change anything in the only program we need to build (GCC) and so on. In order to know which ones are relevant we need…
Proper testing! I didn’t do this yet, and I’ll probably need help with it. Compile your RISC-V software with this and give it a try! Send me the errors you get!
Libatomic: was directly removed from the calls to the linker, as I mentioned before and we have to make sure it didn’t exist back then and so on. Boring things…
I didn’t even bother to add the testsuite support, our only test has to be if we are able to compile GCC with this, which I didn’t really try yet anyway (because it needs some extra things).

Conclusion

This part of the project came in the worst moment. I wasn’t really motivated and I had some personal things going on. It was difficult for me to do this.

In contrast with what I did in the previous steps of the project, this part is really uninteresting because it doesn’t give you a lot of chances for learning, which is the only thing that keeps me alive at this point.

It’s also pretty boring and exasperating to feel you’ll never understand something and trying and trying almost in a trial and error way is really boring for someone like me.

Sometimes, working like this makes you feel really alone. You have almost no people to help you, and the project needs a huge amount of context to be understood so you can’t ask for help to anyone, and those who are supposed to know are really hard to reach. Or what it might be worse: maybe there’s none that understands this thing well, because it’s old, it changed a lot and probably just a handful of people do really took part in the development of the ~~fucking~~ buildsystem.

In conclusion, this is boring and uninteresting job, but someone has to do this, and… It was my turn this time.

You go next.

Some people also spent time with me in the IRC. Thanks to all that helped! ↩
569dc494616700a3cf078da0cc631c36a4f15821 ↩
Try to run make --debug in a project of the size of GCC and laugh with me. ↩
The rest of the post is not really interesting either, but I need to report what I did. It’s just me fighting against myself and a very complex buildsystem that could’ve been simpler and/or better documented. ↩

Milestone — Minimal RISC-V support in GCC 4.6.4

2022-04-08T00:00:00+03:00

In the series we already introduced GCC, its internals, and the work I’m doing to make it able to bootstrap on RISC-V. In this post we are going to tackle the backporting effort and see how I managed to make GCC-4.6.4 compile a simple program to RISC-V.

How to follow this post

As this is going to be deeply connected to the changes I introduced in the codebase, I suggest you to follow it directly in the repository, the branch where I did the changes is riscv, which starts from releases/gcc-4.6.4. As I will continue adding changes on top of this, I left a tag called minimal-compiler that points to the contents of the repository when this blog post was written.

In any case, I’ll share small pieces of the code in the post, but of course I can’t share everything here so I recommend you to go to the sources. I won’t link the sources directly but mention where you can find the changes so you are not forced to follow all the links in the browser and you can use your favorite editor for that.

Overview of the commits

The riscv branch were I made all the work is split in several commits from releases/gcc-4.6.4, where it started.

First it comes a series of 4 commits that make GCC-4.6.4 compilable with more recent toolchains. These should be separated as independent patches later and apply them by the distribution tool, Guix in this case.

Next a couple of commits describe a precarious guix.scm file that should compile the project properly. At the moment it’s not fully ready for distribution but that’s not really our job in the project, so I don’t want to spend a lot of time on that yet. At the moment it’s just working so you can run guix build -f guix.scm from the project directory and it should build a minimal compiler, as we’ll see later. There’s also a channels.scm file, so you can use the exact packages I used thanks to the very powerfull guix time-machine command and replicate my exact build.

Even if I didn’t want to spend a long time with the Guix package, I’d lie to you if I tell you I didn’t. Compiling legacy software is extremely difficult. In this case, I had to patch the code to be compatible with more modern GCC Toolchains, package an old flex, choose lots of configure time options… Still, there are tons of things missing: there’s no C++ support, the package doesn’t find system’s libraries such as glibc and it’s not integrated with system’s binutils. I don’t know how I’m going to fix that to be honest, but I don’t want to think on that right now.

The next commits are what interests us the most: changes on top of GCC.

The first of them¹ is just the RISC-V port commit from upstream GCC applied on top of the project, being a little bit careful about conflicts². Obviously, this change doesn’t really work, it doesn’t even compile, but it serves us to see which changes were needed on top of it.

In the next commit³ I made a high-level fix on the Machine Description files. If you remember from the post about GCC internals, the machine description files are some kind of Lisp-like files that describe both the translations between GIMPLE and RTL and also between RTL and assembly, among other things. In this commit I just removed some of the RTXs that were not available back in the 4.6.4 days but were in use in the port. I’m talking, more specifically, about define_int_iterator and define_int_attr. Thankfully they were just a couple of loops that were easy to unroll by hand. Not a big deal.

Then, I made a larger commit that tries to fix the rest of the gcc/config/riscv folder⁴. In this one I had two goals: make the port compatible with the old C-based API and remove parts that weren’t strictly necessary but complex to keep. This means I removed all the builtins support so I didn’t need to port them (nice trick, huh?) and I kept the code related with memory models out of the equation. I may need to fix that in the future, but I was looking for a minimal support and I didn’t need that for my goal.

After that I tried to compile the project and run it, but I realized there was a problem with the argument handling of the compiler. It was unable to find arguments like -march and it was always failing to compile anything.

I realized there was a weird file at gcc/common/config/riscv/riscv-common.c that looked like it was handling input arguments, so I focused on porting that one too. It happens that the old GCC didn’t have that code structure: everything was done in gcc/config/ back then, so I moved the support and made the argument handling follow the old API. That’s the last commit of the series⁵.

Deep diving

Now I’ll try to explain the changes I made in the code here and there, but first I have to explain the method I followed to make this.

It might be surprising but for the first time I didn’t try to understand everything but work my way through it. This means I have absolutely no clue about what does the code do in most of the places⁶. I just looked the overall shape of it and try to match that shape with the code found in other architecture, mostly MIPS, which the RISC-V support was based on. If I found anything that I didn’t know how to convert I would read how that thing was implemented on MIPS when the RISC-V support was added and then compare that implementation with the one at 4.6.4. That would give me an idea about how to convert to the old way to make things.

So, yeah, most of the coding was a mental exercise of pattern matching code and conversion. There are very few things that I coded myself, like understanding what I was doing deeply.

This doesn’t really mean you don’t need any knowledge to do this. Of course you do. You need to understand what the code does in a very high-level, and know how targets are described in GCC⁷, but you don’t really need to know each function to the detail.

Sadly, in some cases I had to read functions carefully and understand them, so there’s some knowledge needed, still.

First patch set

The first patch set is not really relevant. I just made it while I was trying to compile the project without changes. The compilation ended with errors, I reviewed them, go to the GCC issue tracker and search. In some cases I was lucky that I found a patch that fixed them, in others I only found suggestions and I had to fix the thing myself. Not really interesting, honestly.

The Guix package

The Guix part in guix.scm is not really interesting neither, at least for the moment. The most interesting part might be the addition of flex-2.5 to the input and the use of local-file as a source for the GCC package⁸.

All the rest is playing around with the configure flags and trying to read Guix’s GCC packages and Janneke’s work with the full-source bootstrap.

Even with all that, there are some things missing, so I have to come back to this in the future.

There is, though, a really interesting point to take in account. We already said in the post about GCC internals that GCC is a driver that calls other programs, such as as and ld from GNU Binutils, so we know we only need the very basics in order to test that our compiler can output RISC-V assembly so we can ignore the rest of things and focus on one thing: I’m talking, of course, about cc1, the C compiler.

That’s why I only set the target to all-gcc and focus on that. Later we’ll need to dig deeper.

One of the issues I’ll have to tackle is that the GCC I’m building is a cross-compiler, but this whole project is being developed for a RISC-V target. This doesn’t let the compiler check itself using the staged approach⁹, which is something I’m interested on watching.

Once the proper guix.scm file is generated, I’ll prepare a package for the RISC-V bootstrapping process. In that package I’ll define the first 4 commits as separate patches to apply on top of the source, but I’ll remove them from the original source. That way the codebase will continue to be compatible with old toolchains and we’ll only apply those patches where needed, that is, when we try to build with more recent environments.

Machine Description files

The machine description files did not change that much during the years. Some extra constructs were added but the idea, the goal and the shape of the files didn’t really change.

As we introduced already, the RISC-V port used define_int_iterator constructs in order to simplify some of the work, repeating pieces of the machine description file according to the integer iterator. Back in GCC 4.6.4 that construct was not available so I unrolled the loop by hand following the example at the GCC documentation:

https://gcc.gnu.org/onlinedocs/gccint/Int-Iterators.html

Simply repeat the structures (unroll them) using the value of the iterators and use the define_int_attr to set some of the fields too. The example in the docs gives a good description on how to do it.

On the other hand, I also found that the RTLs at RISC-V port were using simple-return in some places and I realized that didn’t exist in the past. I replaced that with return, hoping that it was the same, but I don’t remember if I reasoned further¹⁰. In any case, you can take a look into gcc/rtl.def¹¹ and see how SIMPLE_RETURN was added later.

Matching the API

There are other more meaningful changes. The large commit⁴ is full of changes related with the conversion back to the C API.

The most obvious ones are converting from rtx_insn * to rtx, and adding/removing machine modes where needed. It was just a matter of searching the functions being used in the MIPS target and trying to match them. Boring, and probably wrong in a couple of places, but looks like it’s working, I don’t know. Examples:

-  emit_insn (gen_rtx_SET (target, src));
+  emit_insn (gen_rtx_SET (VOIDmode, target, src));

-    op = plus_constant (Pmode, UNSPEC_ADDRESS (base), INTVAL (offset));
+    op = plus_constant (UNSPEC_ADDRESS (base), INTVAL (offset));

There were a couple of functions using a small class called cumulative_args_t that it was easy to convert to CUMULATIVE_ARGS * just removing calls to get_cumulative_args and pack_cumulative_args. In C everything is rougher and low level. Thankfully in this case, the low level API was still present so we could just use that instead of the new C++ one, and removing the abstraction level was trivial. See riscv_setup_incoming_varargs in gcc/config/riscv/riscv.c as an example. There might be some things wrong, but it looks reasonable.

There were also a couple of std::swap calls here and there I needed to get rid of. I made a temporary variable and made the swap by hand in the classic way.

Some other changes were harder to spot. Like these:

            || !TYPE_MIN_VALUE (index)
-           || !tree_fits_uhwi_p (TYPE_MIN_VALUE (index))
-           || !tree_fits_uhwi_p (elt_size))
+           || !host_integerp(TYPE_MIN_VALUE (index),0)
+           || !host_integerp(elt_size,0))
          return -1;

-       n_elts = 1 + tree_to_uhwi (TYPE_MAX_VALUE (index))
-                  - tree_to_uhwi (TYPE_MIN_VALUE (index));
+       n_elts = 1 + TREE_INT_CST_LOW(TYPE_MAX_VALUE (index))
+                  - TREE_INT_CST_LOW (TYPE_MIN_VALUE (index));

All those functions and macros are pretty different, but they happen to be more or less the same. What I did here was: read the newer MIPS implementation, try to find those and then go back in time to the old MIPS implementation and see what they were using instead. It wasn’t obvious at the beginning so I read the definitions of all of those things (ctags for the win!) and I even had to define some like sext_hwi, which I added to gcc/hwint.h like I could.

The include dance

If you check the changes on the top of gcc/config/riscv/riscv.c, you’ll see there are a lot of #includes removed and some new ones are added. This is normal, as the older C API was very different to the newer C++ one, but also because many of these includes were not really used inside of the code. First I reviewed which files did exist but later just copied from MIPS and rearranged until the thing compiled.

Crazy changes and inventions

Some other changes were crazier. I had to add the riscv_cpu_cpp_builtins which was defined in gcc/config/riscv/riscv-c.c but I had no way to make it work so I copied what was done in other places and made it a huge macro, added it to gcc/config/riscv/riscv.h and prayed. The compiler was happy with that change, and I was too. That let me remove the riscv-c.c file from the compilation process, even if it’s still included in the repository (yeah, I know…).

The riscv.h file has some other magic tricks too. The ASM_SPEC is a lot of fun now. Basically a copy of somewhere else, because defining the craziest macro I’ve seen in my life was too much for me:

#define ASM_SPEC "\
 %(subtarget_asm_debugging_spec) \
-%{" FPIE_OR_FPIC_SPEC ":-fpic} \
+%{fpic|fPIC|fpie|fPIE:-k}\
 %{march=*} \
 %{mabi=*} \
 %(subtarget_asm_spec)"

Wanna see the macro? Well you asked for it (this is just half of it):

#ifdef ENABLE_DEFAULT_PIE
#define NO_PIE_SPEC     "no-pie|static"
#define PIE_SPEC        NO_PIE_SPEC "|r|shared:;"
#define NO_FPIE1_SPEC       "fno-pie"
#define FPIE1_SPEC      NO_FPIE1_SPEC ":;"
#define NO_FPIE2_SPEC       "fno-PIE"
#define FPIE2_SPEC      NO_FPIE2_SPEC ":;"
#define NO_FPIE_SPEC        NO_FPIE1_SPEC "|" NO_FPIE2_SPEC
#define FPIE_SPEC       NO_FPIE_SPEC ":;"
#define NO_FPIC1_SPEC       "fno-pic"
#define FPIC1_SPEC      NO_FPIC1_SPEC ":;"
#define NO_FPIC2_SPEC       "fno-PIC"
#define FPIC2_SPEC      NO_FPIC2_SPEC ":;"
#define NO_FPIC_SPEC        NO_FPIC1_SPEC "|" NO_FPIC2_SPEC
#define FPIC_SPEC       NO_FPIC_SPEC ":;"
#define NO_FPIE1_AND_FPIC1_SPEC NO_FPIE1_SPEC "|" NO_FPIC1_SPEC
#define FPIE1_OR_FPIC1_SPEC NO_FPIE1_AND_FPIC1_SPEC ":;"
#define NO_FPIE2_AND_FPIC2_SPEC NO_FPIE2_SPEC "|" NO_FPIC2_SPEC
#define FPIE2_OR_FPIC2_SPEC NO_FPIE2_AND_FPIC2_SPEC ":;"
#define NO_FPIE_AND_FPIC_SPEC   NO_FPIE_SPEC "|" NO_FPIC_SPEC
#define FPIE_OR_FPIC_SPEC   NO_FPIE_AND_FPIC_SPEC ":;"

Well anyway, more things were basically made up like that, like these lines in gcc/config/riscv/linux.h:

-#define TARGET_OS_CPP_BUILTINS()                               \
-  do {                                                         \
-    GNU_USER_TARGET_OS_CPP_BUILTINS();                         \
-  } while (0)
+#define TARGET_OS_CPP_BUILTINS()  LINUX_TARGET_OS_CPP_BUILTINS()

   %{!shared: \
     %{!static: \
       %{rdynamic:-export-dynamic} \
-      -dynamic-linker " GNU_USER_DYNAMIC_LINKER "} \
+      -dynamic-linker " LINUX_DYNAMIC_LINKER "} \
     %{static:-static}}"

I just copied from other places because there were absolutely no references to those macros, so… I thought the best way to do this was to copy what other targets did.

Of course this whole thing is not really tested right now, because this affects how the linker is called, but that was broken anyway because of my distribution of choice (Guix I love you but…) so what could I do? Just make them up and fix them later sounded like a good plan.

As I already mentioned, I left builtins and memory models out of the equation. Just commented them out and hoped everything worked properly for small programs. I will try larger programs later.

Argument handling

The last commit⁵ was a little bit hard to do too, the changes related to this one were adding a file that was completely out of place, as we said earlier, so I reviewed other architectures and found how those architectures dealt with this. First, the API was pretty different so the first thing I made was to make the function’s formal arguments fit those on the API and then started making changes.

It was really hard to realize how the MASK_* macros worked just looking to the code, because there were defined nowhere!

The problem was I wasn’t looking in the correct place. More code generation magic! The gcc/config/riscv/riscv.opt file is what handles all those masks and TARGET_* macros, like TARGET_MUL to check if the target has the multiplication plugin. All those were defined there, even if the definition was obscure and hard to match with anything else in the code¹².

Once that was understood everything else was easier to do, “just follow MIPS and you’ll be fine” I told myself, and it worked. Moved everything to riscv.c where all the other target description macros and functions are defined and… Boom! Working compiler.

Result

With all these changes is now possible to generate a minimal compiler and compile a file. As we said, we are only interested on the C to assembly conversion at the moment, and that’s what we have and nothing else.

Taking the project as it is right now you can run:

$ guix build -f guix.scm
...
/gnu/store/gsq72r3xnv7b2f1l4z5idpy3j900hizk-gcc-4.6.4-HEAD-debug
/gnu/store/qglp0cx0nq2nblcg9ya4gmc5gfk2amjg-gcc-4.6.4-HEAD-lib
/gnu/store/l612a4h9a6l4hs7kq49rph4clwf6l2k5-gcc-4.6.4-HEAD

So you’ll get something like this:

$ tree /gnu/store/l612a4h9a6l4hs7kq49rph4clwf6l2k5-gcc-4.6.4-HEAD
/gnu/store/l612a4h9a6l4hs7kq49rph4clwf6l2k5-gcc-4.6.4-HEAD
├── bin
│   ├── riscv64-unknown-linux-gnu-cpp
│   ├── riscv64-unknown-linux-gnu-gcc
│   ├── riscv64-unknown-linux-gnu-gcc-4.6.4
│   └── riscv64-unknown-linux-gnu-gcov
├── etc
│   └── ld.so.cache
├── libexec
│   └── gcc
│       └── riscv64-unknown-linux-gnu
│           └── 4.6.4
│               ├── cc1
│               ├── collect2
│               ├── install-tools
│               │   ├── fixincl
│               │   ├── fixinc.sh
│               │   ├── mkheaders
│               │   └── mkinstalldirs
│               └── lto-wrapper
├── riscv64-unknown-linux-gnu
│   └── lib
└── share

...

16 directories, 28 files

If you want to try it, you can generate an extremely simple C file and give it a go:

$ cat <<END > hello.c
int main (int argc, char * argv[]){
    return 19;
}
END

$ /gnu/store/...-gcc-4.6.4-HEAD/bin/riscv64-unknown-linux-gnu-gcc -S hello.c
$ cat hello.s
.file   "hello.c"
    .option nopic
    .text
    .align  1
    .globl  main
    .type   main, @function
main:
    add sp,sp,-32
    sd  s0,24(sp)
    add s0,sp,32
    mv  a5,a0
    sd  a1,-32(s0)
    sw  a5,-20(s0)
    li  a5,19
    mv  a0,a5
    ld  s0,24(sp)
    add sp,sp,32
    jr  ra
    .size   main, .-main
    .ident  "GCC: (GNU) 4.6.4"

This can be later assembled and linked using binutils with not much trouble, as we might have introduced in the past.

Conclusion

The process as you can see is pretty much a pattern matching exercise, as I already mentioned in the beginning. Of course there were some places where I needed to review the different APIs and their implementation, but those were just a few. Not bad. We made this “work” in a short period of time and it looks pretty well.

Now I need to test this further, make more complex programs and try it, but it’s actually very difficult to do with the current compilation process because the standard C library is not found correctly and the assembler and the linker have to be dealt with independently. This means I need to fix the context first and then review the compiler itself.

On the other hand, the memory model related code, the builtins and the code I basically made up are worrying part of the project, because they might be a point of failure in the future. If they work only for optimizations and multithreading, that might not be an issue, because I don’t know how much of that is used in the GCC version we are going to compile with this compiler. Remember our backport’s only goal is to compiler a more recent GCC with it, so we don’t really need to care about other programs.

I already asked some people¹³ about the memory model parts and I got a very simple solution from them (basically forget about the memory models and always make a fence before and after synchronization code), so that’s going to be solved for the next post, and I can always review the builtins later if I need them.

The rest of the code looks like it would work in more complex cases, but still this needs proper testing and I need to be able to include the standard C library for that.

Reviewing the code

Of course, we are going to find bugs, and I did find some bugs in the development of the process. The code review is really hard to do so it’s better to use tricks and magic.

First of all, we need some debug symbols for gdb to find where the errors are and be able to debug them properly. The defined Guix package has a strip-binaries step that moves all the debug symbols to a separate folder:

$ guix build -f guix.scm
...
/gnu/store/gsq72r3xnv7b2f1l4z5idpy3j900hizk-gcc-4.6.4-HEAD-debug
/gnu/store/qglp0cx0nq2nblcg9ya4gmc5gfk2amjg-gcc-4.6.4-HEAD-lib
/gnu/store/l612a4h9a6l4hs7kq49rph4clwf6l2k5-gcc-4.6.4-HEAD

The debug directory there contains the debug symbols of the binaries so we can just call gdb and then use the symbol-file command to load the debug symbols associated with the program itself.

It is important to note that loading the gcc binary is a problem because it is a driver that execs other binaries, so the errors can’t be really followed properly. It’s better to choose the specific program we want to debug, normally cc1.

This happened to be extremely important because I forgot to convert one function to the old API and it was giving a segmentation fault. Using the GNU Debugger I found the source of the error and I just replaced formal arguments with the proper ones.

Last words

So, all that being said, we covered the changes, the possible problems, how to debug and what’s coming next. That was basically it.

If you have any question, suggestion, comment, or anything you want to share about this, contact me¹⁴. I’d be very happy to discuss.

From here, the plan is to review what I already did, test more complex software and share the results with you and also try to make the compilation process more reasonable. I hope it’s easier to do than it looks.

Wish me luck.

06166d9e5ff121fd3dfd6c0995621e557a023ef0 ↩
I screwed the ChangeLog files anyway LOL. ↩
af295d607786f96b4e8f2e35f41ca34820a9aacb ↩
14577a05e3d64c9e2a05e8f0ff1f8965ddb27b68 ↩↩
2b97a03a443fe8e408d7129bce9658032d0d9cd2 ↩↩
And I’m trying not to feel guilty for it. ↩
There’s a great set of videos about GCC at the GCC Resource Center. They specifically talk about GCC 4.6! I watched them before going for the code and they helped me a lot to understand how was the code organized and how did GCC work. I recommend them a lot. ↩
This local-file thing I learned from Efraim Flashner, currently a Guix maintainer, who gave a talk called “Compile it with Guix” where he introduces this method. Sadly, I can’t find the talk in the web to link you to it. ↩
This process is that you compile GCC with the compiler you had (stage-1), then the resulting GCC compiles itself (stage-2), and the resulting GCC compiles itself again (stage-3). One way to make sure everything is correct is to compare the binary of the stage-2 and the stage-3. If they are the same, there are chances that our code is correct. If they are different, our code is wrong. GCC’s compilation framework does this automatically (if --disable-bootstrap is not set) but, you can’t do it when cross-compiling, because there’s no way to run the stage-1 compiler. I would like to see the result of this process, but I can’t at the moment. ↩
See? That’s why I try to write blog posts about the things I do, that way I don’t forget things. It was too late for this. ↩
These .def files are a lot of fun in GCC’s codebase. They appear really often. They are files that look like a bunch of similar function calls but what they actually are macro calls. Then, this files are #included into another file right after the macro is defined so they generate code. Later, you can redefine the macro to create some other output and #include them again so they’ll always generate coherent code. This is used a lot on enums and switch-case statements, if you want them both to be coherent, you can move them to a .def file, define all the possible values of the enum there, and generate first the enum with the first #include and later the switch-case with a new #include later. Take a look to gcc/rtl.c and you’ll see what I mean. (Yes I know this is like hardcore magic and it’s hard to understand, I didn’t choose to do this). ↩
I say “hard to match” because searching for TARGET_MUL or MASK_MUL gave NO results, and searching for MUL gave too many. ↩
I asked Andrew Waterman himself (one of the authors of RISC-V, and the current maintainer of the RISC-V GCC target). Yep, and he actually answered. ↩
You can find my contact info in the About page. ↩

ELF format — why not?

2022-03-14T00:00:00+02:00

In the previous post of the series we introduced GCC and how it generates assembly code and we left a question unanswered: “Why is learning about ELF interesting if GCC generates assembly?”. In this post we are going to answer that question (not interesting) and maybe understand the very basics of ELF file format (more interesting).

What’s ELF

ELF is a file format with two main goals:

Represent an executable file
Represent a linkable file

Apart from that, ELF can also represent core dumps, but if you think about that all of the possible options have something in common: they represent contents on the memory. We can simply say ELF is a file format that acts as a picture of the state of the memory. In the case of the executables, the state will be loaded from the file, but in the case of the core dumps the state is obtained from the memory and dumped in a file.

Linkable files are those files that can be combined with others to generate executables or shared objects, so they can also fit that definition because they are going to end up in the memory anyway.

For efficiency reasons, the ELF format has two separate views of the same contents:

The Linking view is based on sections and needs a section header.
The Executable view is based on segments and needs a program header.

ELF header

The ELF header is the only thing that has a fixed position in the file, at the beginning. The ELF header has information that defines how to identify the file, the machine, the endianness and that sort of things, but it also says where are the headers located and identifies the size of their entries and their entry count.

It’s not that interesting, honestly. The most important thing is it points to the descriptions to both of the views (the headers) so we can check them.

Linking view

Based on sections, the linking view is the most detailed view of the file and it defines how the file should be linked with others in order to create an executable file.

Sections, the basic unit of the linking view, are consecutive sequences of bytes that do not overlap.

There are different types of sections according to their possible contents and meaning, the most interesting are:

SYMTAB and DYNSYM that hold a symbol table. The DYNSYM is for dynamic linking symbols, while SYMTAB normally is used for static linking but may contain both.
STRTAB holds a string table.
RELA contains relocation entries with addends and REL contains relocations without addends.
NOTE section contains some information of the file.
HASH contains a symbol hash table, necessary for dynamic linking.
DYNAMIC for dynamic linking information.

Each section has also a name, an address if it is supposed to appear in the memory of running process, an offset that defines where in the file do the section’s contents appear, a size, and some extra data fields that all together form a section header entry.

The section header entries are all located where the ELF header says, one after the other (like a C array of structures), so the programs just need to access that position in the file and read all the headers in a row. The contents of the sections are located throughout the file, where the section headers point.

String section

The string section (STRTAB) is one of the simplest. It contains all the strings of the file: the section and symbol names. It’s simply a set of null terminated strings, written one after the other (it also starts with a null character but whatever).

Anywhere in the file where we are supposed to get an string what we get is an index that points to the first position in this section to read from. We should read from that until we reach a null character. For example in the following string section:

    \0 h e l l o \0 n a m e \0

If a name of a section says 1, the actual name of the section is hello and if it says 7 it would be name. Also, if it says 9 it would be me, this trick could be used too.

Symbol table

The symbol table contains information needed to locate and relocate a program’s symbolic definitions and references. The symbol table is formed as an array of symbol elements that are defined with a name, obviously a value, their size, some extra info, the index of the section header they relate to (shndx) and some other stuff.

The info field manages symbol’s type (OBJECT for data, FUNC for function…) and binding attributes, which define the linking visibility and behavior of the symbol (local vs global…).

The value can be interpreted in several ways too, depending on the type of the symbol you are dealing with. But that’s not really relevant for us at the moment.

Relocation

According to the ELF documentation I got from somewhere I don’t really remember:

The relocation is the process of connecting symbolic references with symbolic definitions.

I hope it’s more explanatory for you than what it is to me, but I don’t have a clue of what that is supposed to mean. The Wikipedia does a much better job in the specifics right here:

Relocation is the process of assigning load addresses for position-dependent code and data of a program and adjusting the code and data to reflect the assigned addresses.

If this doesn’t really help, you have a really good example later, but we can basically say that it’s a way to adjust the code to point to the correct addresses, at linking or loading, or even execution, time.

ELF files have, as we said, sections that let us define relocations. These will point to some parts of the file and tell the linker or the loader that that positions of the file must be reprocessed.

There are two types of relocation sections and in both of them the relocation section is an array of entries where each of them represents one relocation. In the simple one (REL) each relocation only contains an offset and an info word, which also includes the type of relocation to apply. The more complex one (RELA) is mostly the same but it includes an addend which includes a constant value to use in calculation of the relocation.

The calculus of the final addresses are specific to the ISA and the relocation type, because processors have different instruction formats and different ways to pack addresses in instructions. RISC-V has no way to pack a full address inside of an instruction, while x86 does, so they have to patch the instructions in a different way.

Special sections

Some sections have a special treatment according to their name, normally the ones that start with a dot. These you might have found in the past in assembly files, defined like .data (for data), .rodata (for read only data) or .text (for code).

These are interesting to have in mind because they appear the same way they do in assembly, and we are going to disassemble some of them and play around with them.

Other special sections like .got or .dynamic don’t appear in assembly but they have a strong meaning in the resulting file, we are not going to deal with those today because we want to finish this post someday. If you need to deal with those I recommend you to read ELF’s documentation on special sections and the loading process.

Executable view

The executable view is another way to access the same contents, but with a different perspective. It’s based on segments rather than sections. Segments are also pieces of the file, as sections are, but segments can contain one or more sections.

Like in the linking view, the base unit, sections for the linking view but for segments for the executable view, are described in a header. The header of the executable view is called program header and it is, like the section header, a bunch of structures piled together, each describing one of the segments.

The program header describes the position and size in the file of each of the segments but also some important information about them: how they are supposed to be loaded in the memory and where (virtual address and physical address), the type of the segment, and some info more.

The most interesting segment types are the following:

LOAD is used for loadable segments, with the other fields of the segment the position and the size this segment will have in memory are described.
DYNAMIC are segments that have some dynamic linking information. It has to contain the .dynamic section.
INTERP gives the location and size of a null-terminated path name to invoke as an interpreter. Interpreter in this context usually means a dynamic linker, which will be called instead of loading this file to memory and the dynamic linker will be the one that will load the parts of the file it considers.

You can see how segments are interesting for loading the file in the memory, that is, they are mostly interesting for executable files or shared objects.

Segments vs Sections

If you want to have a clear idea about the difference between segments and sections, you can consider a file with multiple sections: .text, .rodata and .data.

A file that contains those sections can be understood from a linking perspective as a file that has some code (.text), read-only data (.rodata) and read-write data (.data). Each of those parts must be managed in a different way by the linker, but the reality is that the program loader doesn’t really care about some of the differences of them.

The code and the read-only data are loaded in the memory in the same way, with read and execute permission but no write permission, so the executable view can put both sections in the same segment, and make the loader’s life easier.

Also, the linker doesn’t really care about how is the memory loaded so the section header does not hold that information. It does care about the section’s goals though, as it will need to put them together in order during the linking. On the other hand, the loader is not really interested on what’s the goal of the contents of the file but only on what to do with those contents, so it only has that information.

So, why do we need to learn it?

We don’t really need to learn it very deeply, just learn how it works in a high-level way and make sure we are able to read it with the tools we have available. The good news for you is if the reasons I give you are not good enough it doesn’t really matter because you already learned¹. Continue reading and you’ll realize how much you understand now.

First, let me tell you a personal story. I have previous experience working with assembly, but only in small devices that have two memories, one for data and other for code (Hardvard Architecture). In those small devices you often don’t really need to think about how the code and the data is mapped to memory because your programs are small and the separation is clear. Computers are a different thing, and I have had issues understanding this whole assembly thing.

Computers store both code and data in the same memory, the main memory, (Von Neumann Architecture) and they normally have memory segmentation, pagination, memory management units and all that kind of stuff, because there are many processes running and they want to separate one from the other. That forces us to think about how the code and the data are mapped to the memory. Also, modern operating systems also use dynamic linkers, which are not available in small devices, and we need to be able to deal with that amount of complexity.

ELF allows us to make that all, because it was born for that. ELF is a distillation of many of the ideas from System V Unix, that include exactly all I mentioned. It’s a great way to understand how memory, linking and processes work in a modern operating system. This is why you need to learn it, at least a little. It makes you a cultivated person, which is always good².

The specifics

As I’m sure you are not satisfied totally with the answer of being a cultivated person³, let me go for some specifics.

So in this project GCC is not the only software we are dealing with, GNU Binutils and TinyCC are part of the party too, and I need to make them fit together in the best way possible. In those I need to make sure the relocations, formats and other things work properly, following the RISC-V ABI specification for ELF. That might be a point of failure, so being prepared on a high-level at least is interesting.

Of course, GCC’s output we need to analyze too, and in order to do that we need to make sure we know what it means. We already saw that some ELF sections are directly mentioned in the assembly, so in order to know their meanings ELF is a good way to understand them. They are really an OS related thing and ELF only reflects it, but learning them from the ELF perspective makes the path easier probably.

Relocations are a huge point in all this mess, because they are machine specific (instructions are too, but those I expect us to know already), and they are something I didn’t need to research on all the RISC-V adventures I had last year. I have to do it sometime.

In general, there are many sharp edges where we can get hurt, so it’s better if we wear gloves.

Tools

For all this process there are a couple of tools that were designed to help. GNU Binutils has many of them but we are going to focus on two, as they are more than enough for many usecases: objdump and readelf.

The example below uses both of them to analyze a piece of code and its compilation result. As you’ll see, the main problem they have is their output: it’s not always clear, the formatting is a little bit chaotic, it’s not obvious at all to get right and it’s really hard to use it procedurally.

There is a really cool tool you should investigate though, called GNU Poke, that is designed specifically to fight against those issues. I recommend you to take a look to it.

Example

Starting from a very simple C file we can follow a really interesting process and understand some of the ELF internals:

long global_symbol;

int main() {
  return global_symbol != 0;
}

We compile it to assembly with:

$ riscv64-linux-gnu-gcc -S b.c -O0

This are the contents of the assembly file:

        .file   "b.c"
        .option pic
        .text
        .globl  global_symbol
        .bss
        .align  3
        .type   global_symbol, @object
        .size   global_symbol, 8
global_symbol:
        .zero   8
        .text
        .align  1
        .globl  main
        .type   main, @function
main:
        addi    sp,sp,-16
        sd      s0,8(sp)
        addi    s0,sp,16
        lla     a5,global_symbol
        ld      a5,0(a5)
        snez    a5,a5
        andi    a5,a5,0xff
        sext.w  a5,a5
        mv      a0,a5
        ld      s0,8(sp)
        addi    sp,sp,16
        jr      ra
        .size   main, .-main
        .ident  "GCC: (Debian 10.2.1-6) 10.2.1 20210110"
        .section        .note.GNU-stack,"",@progbits

Assemble the file with as:

$ riscv64-linux-gnu-as b.s -o b.o

And this is what we get in b.o. The .text section contains the following:

$ riscv64-linux-gnu-objdump --disassemble b.o

b.o:     file format elf64-littleriscv


Disassembly of section .text:

0000000000000000 <main>:
   0:   ff010113        addi    sp,sp,-16
   4:   00813423        sd      s0,8(sp)
   8:   01010413        addi    s0,sp,16
   c:   00000797        auipc   a5,0x0
  10:   00078793        mv      a5,a5
  14:   0007b783        ld      a5,0(a5) # c <main+0xc>
  18:   00f037b3        snez    a5,a5
  1c:   0ff7f793        andi    a5,a5,255
  20:   0007879b        sext.w  a5,a5
  24:   00078513        mv      a0,a5
  28:   00813403        ld      s0,8(sp)
  2c:   01010113        addi    sp,sp,16
  30:   00008067        ret

Relocations

There are some relocations!

$ riscv64-linux-gnu-objdump b.o -r

b.o:     file format elf64-littleriscv

RELOCATION RECORDS FOR [.text]:
OFFSET           TYPE                  VALUE
000000000000000c R_RISCV_PCREL_HI20    global_symbol
000000000000000c R_RISCV_RELAX         *ABS*
0000000000000010 R_RISCV_PCREL_LO12_I  .L0
0000000000000010 R_RISCV_RELAX         *ABS*

But in order to understand those relocations properly we need to check the value of the symbols too:

$ riscv64-linux-gnu-objdump -t b.o

b.o:     file format elf64-littleriscv

SYMBOL TABLE:
0000000000000000 l    df *ABS*            0000000000000000 b.c
0000000000000000 l    d  .text            0000000000000000 .text
0000000000000000 l    d  .data            0000000000000000 .data
0000000000000000 l    d  .bss             0000000000000000 .bss
0000000000000000 l    d  .note.GNU-stack  0000000000000000 .note.GNU-stack
000000000000000c l       .text            0000000000000000 .L0 
0000000000000000 l    d  .comment         0000000000000000 .comment
0000000000000000 g     O .bss             0000000000000008 global_symbol
0000000000000000 g     F .text            0000000000000034 main

If you pay attention to the offsets of those relocations (0x0c and 0x10) they exactly match the instructions auipc a5, 0x0 and mv a5, a5 and those are expanded from the lla a5, global_symbol (load local address) pseudoinstruction from the assembly.

The mv is not really a mv. mv is a pseudoinstruction too, that should be expanded to an addi a5, a5, 0. The objdump is playing with us, making the opposite conversion so we can read better but in fact is tricking us.

The auipc + addi couple in RISC-V appears pretty often, because it’s the method it has to load addresses in memory. The first instruction, auipc adds a high part of an immediate to the program counter and stores the result in a register, the addi adds then another, in this case low, immediate to the register i.e. they make a x[reg] = pc + immediate operation in two steps: x[reg] = pc + hi20(immediate) followed by x[reg] = x[reg] + lo12(immediate).

As we have relocations in both auipc and addi this means their 0 values (the immediates) are going to be overwritten with something else at linking time, and there’s when RISC-V has something to say. All the relocations we can see are RISC-V specific, and you can read about them in RISC-V ABI Specification.

In our case we have really some simple ones, the easiest to understand (what a coincidence, huh?):

R_RISCV_PCREL_HI20: High 20 bits of 32-bit PC-relative reference, %pcrel_hi(symbol). The formula is: S+A-P [but only obtains the highest 20 bits].

R_RISCV_PCREL_LO12_I: Low 12 bits of a 32-bit PC-relative, %pcrel_lo(address of %pcrel_hi), the addend must be 0. The formula is: S-P [but it only obtains the lowest 12 bits].

Both the HI20 and the LO12 have a similar formula, this is the meaning of the elements on the formula:

S: Address of the symbol
A: Addend of the relocation
P: Position of the relocation

If you match their formulas with the description of what we just said about how do auipc + addi couples work, you can easily understand the formulas and their meaning. We are not going to do it, do something yourself!

The other relocation:

R_RISCV_RELAX: Instruction can be relaxed, paired with a normal relocation at the same address.

Is an addition our example doesn’t use but it could. The R_RISCV_RELAX basically means that if the relocation it points at is not needed it can be discarded. And when does that happen? Easy, when we can get global_symbol‘s address with only one of them, we can remove the other instruction from the program.

Relocation resolution

If we link the file and generate an executable, we can see the final value those zeroes get.

$ riscv64-linux-gnu-gcc b.o -o b.out

We link it like this because ld needs a lot of input fields and we don’t want to set them all by hand, but you can do it with ld if you feel like it.

$ riscv64-linux-gnu-objdump --disassemble b.out
...
00000000000005e4 <main>:
 5e4:   ff010113        addi    sp,sp,-16
 5e8:   00813423        sd      s0,8(sp)
 5ec:   01010413        addi    s0,sp,16
 5f0:   00002797        auipc   a5,0x2
 5f4:   a6878793        addi    a5,a5,-1432 # 2058 <global_symbol>
 5f8:   0007b783        ld      a5,0(a5)
 5fc:   00f037b3        snez    a5,a5
 600:   0ff7f793        andi    a5,a5,255
 604:   0007879b        sext.w  a5,a5
 608:   00078513        mv      a0,a5
 60c:   00813403        ld      s0,8(sp)
 610:   01010113        addi    sp,sp,16
 614:   00008067        ret
...

There you see the relocation was resolved (0x5f0 and 0x5f4) by the linker and the final values have been added. objdump is intelligent enough to tell us where are those instructions pointing (says 2058 <global_symbol>). Just to make sure we can search in the symbol table for the global_symbol:

$ riscv64-linux-gnu-objdump -t b.out | grep global_symbol
0000000000002058 g     O .bss   0000000000000008              global_symbol

NOTE: We could try to calculate the address of the global_symbol as the linker did, but it’s a little bit complicated because we also linked the file with the standard library and the startup files, which adds the crt files on top of the file. It’s really that we get more code than what we had in the assembly file. If you want to see that, you can see the rest of the output of the command, or even try with --disassemble-all and calculate the symbol address by hand. Good luck.

More sections

If you want the review some simple things, like a string section, you can use readelf for that. The -p flag (equivalent to --string-dump=) displays the contents of the section as strings. You can read the .comment section that way:

$ riscv64-linux-gnu-readelf -p .comment b.o

String dump of section '.comment':
  [     1]  GCC: (Debian 10.2.1-6) 10.2.1 20210110

This is what we had inserted in .ident on the assembly file by the compiler. We have it in the binary too.

In other distros the output is a little bit different. Look the output we have in Guix:

String dump of section '.comment':
  [     1]  GCC: (GNU) 11.2.0

Conclusion

So this whole this just to explain that ELF files are some kind of dual files that have two different goals at the same time. The executable one is kind of a picture of the memory state that can be used for loading that state in the memory, while the linking one just describes how different parts of the contents relate to each other and has tons of funny tricks to make the files relocatable, position independent and that kind of things. Cool.

There are still many fields of ELF we didn’t talk about but I consider this introduction more than enough. Having a simple understanding about how is the file organized and what kind of information it has is probably enough for the things we are going to need.

The proposed example shows that with the knowledge obtained by this short introduction we can dig a little bit on the files that result from a compilation and analyze their internals. That’s mostly the work I’ll need to do when I start combining compilers in a pipeline of death and destruction.

If I ever need to dig on something deeper, I’ll do.

Anyway, I’m still unsure if I answered the question we left in the previous post⁴:

Why is learning about ELF interesting if GCC generates assembly?

Did I?

Ha! Gotcha! ↩
It also makes you understand the complexities of the system so you can criticize it. Changing the world requires to learn about it first. ↩
For those that really are. That’s the good attitude in life. High five. You can read the whole section still, it has interesting points I think. ↩
It was a good cliffhanger, though. ↩

GCC internals — From a porting perspective

2022-03-08T00:00:00+02:00

In the previous post of the series the problem of the GCC bootstrapping was introduced. In this post we’ll describe how GCC works, from the perspective of someone who wants to port it so we understand what’s the job we have to do.

Disclaimer
Overview
1. The compiler generation framework
2. GCC as a coordinator
Source code parsing
1. GENERIC
GIMPLE
Register Transfer Language
Assembly code generation
Summary
My job in the backport
Last words
Learn more

Disclaimer

This post may be only valid for old GCC versions, like 4.something, because that’s the one I’m interested in. More recent versions may have different details, but I don’t expect them to be very different to what is described here. More specifically: I’m working on GCC 4.6.4, and the first GCC with RISC-V support is GCC 7.0.0.
This post will focus on how GCC compiles C programs because that’s the part we care about. Some other languages have differences on how they are treated but that’s not very relevant for us, as it has no implications on the back-end.

Both of these points will get clearer later.

Overview

GCC is structured as a pipeline of several steps that run one after the other.

Source code parsing
GIMPLE IR generation (target-independent)
Some GIMPLE tree optimizations
RTL IR generation (target-dependent)
RTL optimizer
Assembly code generator

Before starting to analyze each of the steps independently there are a couple of things to clarify.

The Compiler Generation Framework

An important point to note is GCC is a compiler collection meaning that it is able to compile code from many high level languages (HLL) and for many different targets. This has implications on how some steps are mapped to GCC’s source code.

The most important thing of all this is to differentiate between GCC’s code and an actual gcc executable. The key point here is that GCC’s codebase includes what is called CGF (Compiler Generator Framework) that can generate gcc executables from GCC’s code. The CGF generates gcc executables according to the input (target machine, host machine…) we give it, but the generated gcc executables may differ one from another even if they were generated from the same codebase.

Any gcc executable is able to compile any input HLL¹ (C, C++, Objective-C, Ada, Fortran and Go²), so GCC’s code must include parsers for each of these languages.

On the other hand, gcc executables are only able to generate code for one target (x86, MIPS, ARM, RISC-V…), that must be chosen when GCC is compiled. In order to make the porting efforts easier, GCC has a set of tools that generate the target-dependent code from some configuration files called Machine Descriptions (MD).

Putting all this together, source code parsing and AST generation depend on the input HLL, and the code that runs for each HLL is selected when gcc runs (steps 1). The intermediate representation, GIMPLE, is target-independent so everything related with that is copied inside the final gcc executable (steps 2 and 3). The RTL (Register Transfer Language) representation and assembly code generation are target-dependent and the code related to that is generated from MD files when GCC is compiled (steps 4, 5 and 6)³.

This all means if we want to be able to read the source code of GCC we have to have clear in mind how the source code maps to the actual executable, i.e. if we generate a gcc executable for x86 it won’t contain the code for other architectures and it won’t even check if it was correctly programmed, because it’s not going to compile it.

GCC as a coordinator

Many GCC users or C programmers (or me, not that long ago) might think there is something missing on the list of steps we recently reviewed. The normal usecase for calling gcc like

$ gcc -o helloworld helloworld.c

does several steps internally that we need to separate:

Preprocessing: the resolution of preprocessor macros like #define and stuff like that.
Compiling to assembly: the generation of assembly code files per compilation unit (a file that is the output of the preprocessor).
Assembly: the conversion from an assembly file to ELF object file.
Linking: the executable or library generation from the ELF object files created in the previous step.

The reality is there’s more than one program involved here and gcc is just a coordinator that makes other programs run if needed.

The preprocessor is called cpp and it is generated from the GCC codebase. The compiler is gcc itself but the assembler and the linker are generally obtained from GNU Binutils’ as and ld respectively.

So, one of the most important things to understand is GCC only generates assembly, but it looks like it doesn’t⁴.

This means we need a proper support for our architecture on the assembler and the linker too. But we’ll keep that story for another day⁵.

Souce code parsing

HLL dependent. Generates appropriate IR

So the first step of the compiler is to process the input text and convert it to the appropriate Intermediate Representation. The most used intermediate representation is GENERIC, that was designed for C but fits other procedural languages pretty well⁶.

This parsing process not really relevant for us, as we want to add a new target, but it’s interesting to note because it gives shape to the codebase. GCC splits the code for the different input languages in folders named like: gcc/$LANGUAGE.

GENERIC

GENERIC is just a representation, we don’t need to care that much about it but a word is not going to hurt anyone. GENERIC is a tree representation: a set of nodes with some extra common information. Those nodes can be read at gcc/tree.def.

A simple example of this could be a function declaration, that would take the a node of type FUNCTION_DECL that has some sub-nodes: one for the return type, another for the body of the function and another for the arguments of the function.

It’s a simple AST you could come up with yourselves, except the fact that it is pretty complex. 😅

GIMPLE

HLL- and Target- independent representation

The next step is called Gimplification (see gimplify.c), the process of converting to GIMPLE. Normally, representing the AST as GIMPLE is too complex to be done in one step, so the GENERIC (+ some extensions) is used as a previous step that is easier to create.

GIMPLE is the central internal representation of GCC. It’s target-independent and High-Level-Language-independent. At this point some optimizations can be applied, those related with the structure of the source code, like loop unfolding or dead code elimination.

From the porting perspective, this representation is important, as it’s the border line between the front-end and the back-end, and we are interested in the latter. A really interesting part is to understand is how is this converted to the next representation, RTL.

Register Transfer Language (RTL)

Target-dependent low level representation

The next part of the compiler work is done using the RTL intermediate representation. The RTL representation is based on LISP, so we have a reason to love it, and it serves two purposes:

Specify target properties via the Machine Descriptor files. These Machine Descriptor files are text files that look like LISP and are processed at compilation time.
Represent a compilation. Meaning that the RTL is also an intermediate representation, a low-level one, that represents sets of instructions.

GCC does not make any distinction between the first and the second purpose, calling both RTL, but there some difference on the purpose and the shape of the RTL. RTL has both, an internal form represented by structures (case 2) and an external form represented as a text file (case 1).

The RTL is formed by a set of objects: expression, integers, wide integers, strings or vectors. In the textual form they are represented like in LISP, using double quotes for strings, brackets for vectors… and a lot of parenthesis. The internal representation you can imagine, structures for expressions, integer types for integers, char* for strings, etc.

The most interesting RTL objects are expressions, aka RTX, that are just a name (an expression code) plus the amount of possible arguments.

This is how a piece of RTL may look, it represents an instruction that sets the register 0 to the result of the addition of the register 1 and the constant integer 10 (see rtl.def for more information):

(set (reg  0)
     (plus (reg 1)
           (const_int 10)))

In the example the only things that are not expressions are the numbers (0, 1 and 10), all the rest you can find in rtl.def and see what they mean.

From GIMPLE, there are two steps left to reach our target, assembly code, and both involve RTL. The first maps the GIMPLE nodes to pattern names in a target-independent way, generating a list of RTL insns. The second matches those insn lists to RTL templates described in Machine Description files and uses those matches to generate the final assembly code.

Those insns are objects that represent code in RTL. Each function is described with a doubly-linked list of insns. You can think about them as instructions in the RTL world.

In the first step, the RTL insn generation step, only the names matter (and they are hardcoded in the compiler), while in the second the structure of the insn is going to be analyzed as we’ll see later.

Target-dependent code

As we previously said, target-dependent steps are generated at compile time and then inserted in the final gcc executable. All this code is located in one folder per target, under gcc/config/$TARGET, so the CFG is able to load the target we choose at compile time (using --target=) and insert that in the final executable.

That is done in different ways depending on the type of file we are working with: Machine Description files are processed by the programs (gencodes, gentags…) that generate C code files from them, while target description macros and functions, which are C files, are inserted in the building process as any other C file.

I’d like to insist here on the fact that the --target is the only one to be processed and loaded and the other possible targets are going to be ignored. The build process is not going to complain if a target is broken or anything like that if it isn’t the target we chose. It just doesn’t care.

Machine Description files

Machine Description files (.md extension) let us define insn patterns, which are incomplete RTL expressions that can be matched against the insn list generated from the GIMPLE, attributes and other interesting things we may not try to decipher here.

define_insn is a RTX we can use to define new insn patterns. It receives four or five operands:

An optional name. It’s going to be used to match against GIMPLE.
An RTL template. A vector of incomplete RTL expressions which describe how should the instruction look like. Incomplete in this context means it uses expressions like match_operand or match_operator which are designed to match against the RTL insn list and see if they are compatible or not.
A condition. A final condition to say if the insn matches this pattern or not.
An output template. A string that contains the output assembly code for this insn. The string can contain special characters like % to define where should the arguments be inserted. If the output is very complex we can write C code on this field too.
An optional list of attributes.

This is an actual example from the RISC-V code we are backporting:

(define_insn "adddi3"
  [(set (match_operand:DI 0 "register_operand" "=r,r")
        (plus:DI (match_operand:DI 1 "register_operand" "r,r")
                 (match_operand:DI 2 "arith_operand"    "r,I")))]
  "TARGET_64BIT"
  "add\t%0,%1,%2"
  [(set_attr "type" "arith")
   (set_attr "mode" "DI")])

You can see the name adddi3 is something like: add + di + 3. This means it’s the add instruction with the di mode and 3 input arguments. That’s the way things are named.

Next block is a vector with the RTL template. If you try to ignore match_operand expressions you can see the template is not very different to the RTL example we gave before. In this case it’s something like:

(set (reg 0)
     (plus (reg 1)
           (reg 2)))

It’s basically storing in the first register the result of the addition of the other two.

The next field is the condition. In this case it needs to have TARGET_64BIT defined in order to work because the machine mode is DI (we’ll explain that soon).

The output code is simple, just a RISC-V add instruction:

add %0,%1,%2

Where %N is going to be replaced by the register numbers used as arguments for this instruction.

The last field are the attributes, which can be used to define the instruction size and other kind of things. We are not going to focus on them today.

Machine modes

Machine modes are a way to describe the size of a data object and its representation.

QI: quarter integer
HI: half integer
SI: single integer
DI: double integer
SF: single floating
DF: double floating

And so on.

The standard insn names include machine modes to describe what kind of instruction they are. The example above is addddi3, meaning it uses di machine mode: double integer. That’s why it needs the target to be a 64 bit RISC-V machine.

Machine modes also appear in some RTL expressions like plus or match_operand meaning that they operate in that machine mode, that is, with that data size and representation. For example (plus:SI ...).

RTL Templates

match_* expressions are what make RTL expressions incomplete, because they are designed to be compared against the insn list that comes from the previous step.

In the example above we had:

(set (match_operand:DI 0 "register_operand" "=r,r")
     (plus:DI (match_operand:DI 1 "register_operand" "r,r")
              (match_operand:DI 2 "arith_operand" "r,I")))

(match_operand N predicate constraint) is a placeholder for an operand number N of the insn. When the insn is constructed, the match_operand will be replaced by the corresponding operand of the insn. When the template is trying to match an insn the match_operand forces the operand number N to match the predicate in order to make the insn match the template. The match_* expressions are what defines how insns should be.

The predicate is a function name to be called. The function receives two input arguments: an expression and a machine mode. If the function returns 0 the function does not match.

predicates can also be combined in Machine Description files like this:

(define_predicate "arith_operand"
  (ior (match_operand 0 "const_arith_operand")
       (match_operand 0 "register_operand")))

So the arith_operand shown in the example above can be a const_arith_operand or (that’s what ior means) a register_operand. They can be more complex but this is more than enough to understand how they are built. In the end, they always check against C functions, but you can combine them with the convenience of the Machine Description files.

The constraint allows to fine-tune the matching. They define if the argument is in a register or the memory and stuff like that. r, for example, means the operand comes from a register.

There are other matching expressions too, but match_operand is the most used one and it’s the one that explains this concept of incomplete expressions the best.

Target description macros and functions

Apart from the machine descriptor files, there are other files involved. For example, the constraints defined above need to be defined in code somewhere.

The most important of these are the target description macros and functions, normally defined in gcc/config/$TARGET/$TARGET?(.h|.c). The .c should initialize the targetm variable, which contains all the machine information relevant to the compiler. It is initialized like this:

struct gcc_target targetm = TARGET_INITIALIZER;

That TARGET_INITIALIZER is a huge macro, defined in gcc/target.h, that initializes the targetm structure. This macro is split in smaller macros with reasonable defaults that may be overwritten by pieces. Each target should have a file that includes both target.h and target-def.h and overwrites any inappropriate default by redefining new macros and ends with the initialization line we just introduced. This is normally done in gcc/config/$TARGET/$TARGET.c, while the .h is normally used to define some macros that are needed in the .c file.

As a reference, the RISC-V code we need to backport (see gcc/config/riscv/riscv.c) uses the file to introduce the amount of registers, the type, the size, and that kind of things, and many others.

All this information contained in targetm is used by the compiler to decide how registers have to be allocated, which ones have preference, the cost of them, and many other things.

Assembly code generation

Having the previous step clear is enough to understand how does the assembly generation work. Each of the insns in the list obtained from GIMPLE is going to be compared against the RTL templates and the best match is going to be chosen. Once the match is chosen, the corresponding assembly is going to be generated from the corresponding field of the define_insn RTL expression.

As simple as that, but also that complex.

Why do I say it’s complex? Because many things have to be considered and GCC does consider them. Each instruction has a size, that has to be considered to calculate addresses, but also they have some execution time associated and GCC calculates the best matches to make the final assembly file as optimum as possible.

The RTL step has a lot of optimization passes, too. It’s a complex step but it’s not really important for us because we just need to make a temporary compiler that lets us compile a better one. It doesn’t really matter if it’s not perfect, at least at this point.

Summary

So, in summary, the process is the following:

The HLL language is parsed to a tree, normally GENERIC.
GENERIC is converted to GIMPLE.
GIMPLE optimizations are applied.
GIMPLE is matched to a insn list using pattern names.
The insn list is matched against the RTL templates defined in the Machine Description files.
RTL optimizations are applied.
The matches convert the RTL to assembly code also taking in account the information obtained from the target definition macros and functions.

From our perspective, the most important things to remember are these:

The front-end is not very relevant for us, from the parsing to GIMPLE we can ignore for the moment.
RTL step is pretty complex, and the GIMPLE->RTL conversion is too.
GCC is a compiler collection that has a very powerful compilation process, the Compiler Generator Framework (CFG), in order to modularize the code and make it easier to port.
The machine description files and the target definition macros and functions are designed to make the porting process simpler. Those are the only files we need to touch.

If you like my job here, consider hiring ElenQ Technology.
Even if I’m busy with this, I still have some time slots available.

My job in the backport

With all the process more or less clear, we can be more specific on the job I need to do. I share some specifics in this section, so if you like reading code you are going to have some fun⁷.

First, I need to make sure all the used RTL expressions are compatible with the old version of the compiler. If they are not, I have to translate them to the old way to make them. Some examples of this are iterators like (define_int_iterator ...) which is not available in old GCC versions, so I need to unfold a couple of loops by hand and make them only use the old constructs.

Second, I need to convert the target description macros and functions to the old internal C-based API instead of the more modern C++-based one, as the recent port uses. These changes involve many layers and I didn’t yet analyze this in detail. They can be simple like converting from rtx_insn class to rtx, the older way to do this. But they can also be complex, like removing the 40% of the #include directives from riscv.c, which has many that were not available in the past. It’s going to be a lot of fun I predict.

Third, as this whole compilation process is complex, I decided to make it as accessible as possible, so other people can audit and replicate my work. For that I’m using Guix, my package manager of choice. I added a guix.scm and channels.scm file to the repository so my work can be replicated precisely by myself in the future, or by others⁸.

The Guix package also provides a better interaction with the building process of GCC, letting us replace inputs in a very simple way. I’m thinking about the next steps of the project here, when we need to compile my backported compiler with TinyCC, test if it works and then patch TinyCC until it does. Having the guix.scm file makes it easy to replace the current compiler with a patched TinyCC and ensures nothing is interfering in the compilation process because the compilation is done in an isolated container.

That’s mostly the job I need to do in the backport.

Something to keep in mind is that we don’t need to make it perfect, we just need it to work. The backported GCC is not going to be used as a production compiler, but just as a bridge with the next GCC version, so there’s only one program it needs to be able to compile correctly: GCC 7. Once we make the bridge with the next version, we can use that to compile anything we want.

Last words

I know this post is long, and the lack proper diagrams make everything a little bit hard to understand. That’s exactly how I felt reading about GCC, but the difference was I had to read some documentation that is… About 100 times longer than this post (see Learn more below). Not that bad after all.

There are many things I decided to leave out, like peephole optimizations, instruction attributes, and some other constructs that are not that important from my perspective. You may want to make your research on those on your own.

In any case, if you have any question you can always contact me⁹ and ask me any questions you have or send me some words of support.

In the next post I’ll describe a little bit about ELF, the executable and linkable format, just the bare minimum to understand the format, as it will be relevant for us in the future. And you might be thinking, why is it relevant if GCC compiles to assembly? Well, that’s one of the questions that we will be answering in the next post.

Now I leave you with a couple of interesting links on the next section.

Good luck with your ports!

Learn more

The GCC internals documentation: if you are interested on my work you should read an older version of the documentation. See Disclaimer.
The GCC source code: of course, this has everything you need to understand GCC, but the problem is that GCC is a huge codebase, you probably need to spend months reading it in order to understand everything. That’s why I think posts like this one are interesting, they help you focus on the parts you are interested in.

When calling gcc you can choose which language you are compiling using -x language option or you can let gcc guess from the extension. ↩
And also Java in the past! ↩
The RTL optimizer contains many steps, most of them being target independent. That doesn’t really matter here, but those are not generated but copied from GCC’s source, as GIMPLE is. ↩
Other compilers have different approaches for this. For example, TinyCC generates machine code directly, without the intermediate assembly file generation step, and is also able to link the files by itself. ↩
This post is already long enough and we only made the introduction. ↩
The case of FORTRAN is a little bit weird, as it generates its own representation that is later converted to GENERIC, we don’t really care about this at this point. ↩
I’ll link to some examples of the code on RISC-V’s GitHub account. This code is already merged in GCC. ↩
I’m hosting this on GitHub at the moment because the repository is huge. I’ll probably move all this to my server and edit the post after that. ↩
You can find my contact info in the About page. ↩

Intro to GCC bootstrap in RISC-V

2022-02-14T00:00:00+02:00

You probably already know about how I spent more than a year having fun with RISC-V and software bootstrapping from source.

As some may know from my FOSDEM talk, NLNet / NGI-Assure put the funds to make me spend more time on this for this year and I decided to work on GCC’s bootstrapping process for RISC-V.

Why GCC

GCC is probably the most used compiler collection, period. With GCC we can compile the world and have a proper distribution directly from source, but who compiles the compiler?¹

Well, someone has to.

The bootstrap

Bootstrapping a compiler with a long history like GCC for a new architecture like RISC-V involves some complications, starting on the fact that the first version of GCC that supports RISC-V needs a C++98 capable compiler in order to build. C++98 is a really complex standard, so there’s no way we can bootstrap a C++98 compiler at the moment for RISC-V. The easiest way we can think of at this point is to use an older version of GCC for that, one of those that are able to build C++98 programs but they only require a C compiler to build. Older versions of GCC, of course, don’t have RISC-V support so… We need a backport².

So that’s what I’m doing right now. I’m taking an old version of GCC that only depends on C89 and is able to compile C++98 code and I’m porting it to RISC-V so we can build newer GCCs with it.

Only needing C to compile it’s a huge improvement because there are Tiny C Compilers out there that can compile C to RISC-V, and those are written using simple C that we can bootstrap with simpler tools of a more civilized world.

In summary:

C++98 is too complex, but C89 is fine.
GCC is the problem and also the solution.

What about GNU Mes?

When we³ started with this effort we wanted to prepare GNU Mes, a small C compiler that is able to compile a Tiny C Compiler, to work with RISC-V so we could start to work in this bootstrap process from the bottom.

Some random events, like someone else working on that part, made us rethink our strategy so we decided to start from the top and try to combine both efforts at the end. We share the same goal: full source bootstrap for RISC-V.

Tiny C Compilers?

There are many small C compilers out there that are written in simple C and are able to compile an old GCC that is written in C. Our favorite is TinyCC (Tiny C Compiler).

GNU Mes is able to build a patched version of TinyCC, which already supports RISC-V (RV64 only), and we can use that TinyCC to compile the GCC version I’m backporting.

We’d probably need to patch some things in both projects to make everything work smoothly but that’s also included in the project plan.

Binutils

Binutils is also a problem mostly because GCC, as we will talk about in the future, does not compile to binary directly. GCC generates assembly code and coordinates calls to as and ld (the GNU Assembler and Linker) to generate the final binaries. Thankfully, TinyCC can act as an assembler and a linker, and there’s also the chance to compile a modern binutils version because it is written in C.

In any case, the binary file generation and support must be taken in account, because GCC is not the only actor in this film and RISC-V has some weird things on the assembly and the binaries that have to be supported correctly.

Conclusion

This is a very interesting project, where I need to dig in BIG stuff, which is cool, but also has a huge level of uncertainty, which scares the hell out of me. I hope everything goes well…

In any case, I’ll share all I learn here in the blog and I keep you all posted with the news we have.

That’s all for this time. If you have any question or comment or want to share your thoughts and feelings with me⁵ you can find my contact information here.

PS: Big up to NlNet / NGI-Assure for the money.

wHo wATcHes tHE wAtchMEN? ↩
Insert “Back to the Future” music here. ↩
“We” means I shared my thoughts and plans with other people who have a much better understanding of this than myself. ↩
But there are some others that are really interesting (see cproc, for example) ↩
Or even hire me for some freelance IT stuff 🤓 ↩

Lessons learned on machine code generation

2021-06-16T00:00:00+03:00

Machine code generation sounded like a weird magic to me half a year ago, I swear, but now it doesn’t look so disturbingly complicated. Nothing in computer science is that complicated, after all.

Basics
Lessons learned
Final thoughts

Basics

There are many contexts where you may need to generate machine code. If you are writing a compiler, an assembler, a jit compiler… In the last months I’ve been working on Lightening, a machine code generation library that powers Guile’s JIT Compilation, a RISC-V assembler and interpreter and Hex0, which was introduced in the previous post, where I needed to assemble a file by hand.

All of those cases result in the same thing, even if they have different conditions: we are generating machine code.

In this post I’ll try to talk about some issues that are generic and apply to all the cases and others that are more specific to some of the projects I mention.

But first we need to clarify some stuff just in case.

Machine code is numbers

Machine code is what people from the electronics world call “code”.

I know you know it, but let’s refresh some things about computing we may have forgotten thanks to all the efforts that hide the complexity of our everyday business.

Machine code instructions are basically blocks of bits your processor is reading and interpreting. Those bit blocks encode all the information the processor needs: the identifier of the instruction and its arguments.

The identifier is normally known as opcode. The arguments can have many different meanings, depending on the instruction so we are not getting into that. The instructions normally alter the values of registers, so they need to have identifiers for the source and destination registers, or literal values that are introduced literally inside of the instruction (they are called immediates).

Let’s put a simple RISC-V example here. Consider this assembly instruction:

addi a0, zero, 56

This thing you interpret as some assembly instruction that adds 56 to the zero register and stores the result in the a0 register, has to be encoded in a way that the machine is able to understand. Better said, it is encoded in a way that you can understand! The real instruction is a bunch of bits that represent the same thing.

RISC-V base ISA has a various instruction formats which depend on the goal of the instruction. This one is from the I format, because it includes an immediate. Read it and compare with the following:

First the opcode, addi for you, has a binary counterpart: 0010011. 7 bits for this instruction format.
Then the destination register, a0, has a binary representation: 01010. There are 32 registers in RISC-V so each of them are represented by a 5 bit value.
There’s some extra space for an opcode-like field called funct3: 000
Then there the source register, zero, which is: 00000. Again 5 bits.
And the immediate you are adding, 56, which is just the binary representation of 56: 000000111000. It’s 12 bits wide for this instruction format.

Putting all together:

000000111000 | 00000 | 000 | 01010 | 0010011

So this forms the following binary value:

00000011100000000000010100010011

Or in hex:

0x3800513

Just for if you didn’t realize, you just assembled an instruction by hand.

That instruction we just created is going to be processed by the machine, reading each of the fields and activating its circuits as it needs according to the voltage levels those values represent.

In this case, it’s going to activate the ALU to add the numbers and all that kind of things, but in other cases it may just change the value of the program counter or whatever. All this is executed by the circuitry of the device, right after it loads the instruction.

That’s for the machine, but for us, from the perspective of a programmer, instructions are just numbers, as we just saw.

Demonstration

I purposely used machine to refer to the device that runs our instructions, but we have to be more specific about it now.

I’m going to talk specifically about modern (and common) microprocessors, because other devices may have peculiarities that can sidetrack us too hard¹.

In our modern and common microprocessor, instructions are located in the memory. But that’s nothing we didn’t know! If we run a binary it’s loaded in the memory and executed from there. We all know that!

But you can be surprised to certain level if we stretch that a little bit.

Well, we know from the previous part that instructions are just numbers, and we know that they loaded from the memory so let’s do some C black magic and see what happens:

#include<stdint.h>
#include<stdio.h>

typedef int f0(void);

int main(int argc, char* argv[]){
    uint32_t instructions[2];

    instructions[0] = 0x03800513; // addi a0, zero, 56
    instructions[1] = 0x00008067; // jalr zero, ra, 0

    f0 *load_56 = (f0*) instructions; // Reinterpret the array address
                                      // as a function
    int a = load_56();
    printf("%d\n", a);
}

In that example we build an array of two values. The first one corresponds to the instruction we encoded by hand before and the second corresponds to jalr zero, ra, 0, the return instruction, which you can encode yourself.

After that we convert the address of the array to a function that returns and integer and… Boom! We execute the array of numbers.

The code only works on RISC-V, but don’t worry, I can tell you that it prints 56.

So it was true that the machine can execute stuff from the memory, but what we may not know is that for the machine there’s no actual distinction between instructions and data². We just executed an array of numbers!

The machine doesn’t care. If it looks like instructions it executes.

You can try to put random values in the array and try to execute them, too. An Illegal instruction error is going to happen, probably. If you are lucky you may execute something by accident, who knows.

But how did this thing work that well? Why did it return the value correctly and all that?

Calling convention

The code worked because we are following the RISC-V ABI, the same that C is following in the example. It tells us how do we need to pass arguments to functions and return and all that. This part of the ABI that defines how to call and return from functions is called calling convention.

I’m not going to extend a lot talking about this, but I will just say that RISC-V has some registers to pass arguments on: a0, a1…a7. And those registers are also used for return values.

In the example we are not taking any argument so we don’t need to read from any but we return one value, just writing it in a0.

With what you know, you can now create a function that gets an input argument and adds an immediate to it. Why don’t you try?

On the other hand. RISC-V ABI defines there’s a register called ra that contains the Return Address, so we need to jump to it if we want to finish our function execution.

There are many things more you can read about, but this is enough to begin.

Memory protections

The C example where we executed an array is correct, it runs and all that, but the reality is that memory has different kinds of permissions for each part of it.

Code in memory is normally read-only and executable, and data can be read-only or not, depending on the goal it has (constant or variable).

If you think about the example above, once the array is set, we can overwrite it later, or even write it from the instructions we inserted on it. This could lead to security issues or unexpected results. That’s why code is normally read only and any attempt to write it will raise an exception to the kernel.

There are several ways to identify a memory block as code: the RISC-V assembly (and many others) uses the .text directive which automatically sets the block as a read-only block that can be executed; the mmap Linux system call needs some flags to indicate the protections on the memory block (PROT_EXEC, PROT_READ, PROT_WRITE…); etc.

Just-in-Time Compilation

Just-in-time (JIT) Compilation is a way to execute programs that involve a compilation step at runtime. Typically this happens on interpreted programs, where the interpreter consumes part of the execution time. An interpreter with a JIT Compilation feature is able to compile parts of the code it’s going to run to machine code and speed up the execution of those parts.

Clever interpreters are able to predict if the time they need to compile and execute the JIT Compiled parts is less than the time they need to interpret them, so they can decide if it’s worth the effort.

Normally, the JIT Compilation is more effective in pieces of code that are executed many times because the code only needs to be compiled once and the speed increase is going to be obtained in every execution. But many algorithms may be defined, and parts of the code may be recompiled looking for different optimizations while the interpreter collects data about the performance of the program.

Explained like this it looks like it’s a complex thing to do (and it is) but with the previously mentioned points we can imagine a simple JIT machine code generation library. We “just” need to:

Know what code to generate (choose a function to compile, this step may need some code analysis).
Reserve some space (malloc, mmap…)
Fill the space with numbers (the machine code instructions resulting from the compilation of the function).
Next time the program wants to call the function we compiled, call the numbers instead (as we did in the demonstration).

Example: Lightening, Guile’s machine code generation library

The just-in-time compilation process in Guile is simple, but effective³. Guile uses a library called Lightening for it. Lightening is a template-like library that defines a virtual instruction set. That instruction set is translated by the library to the instruction set of the actual machine.

Implementing support for another architecture is as simple as implementing all the translation code for the new architecture. That’s what I’ve been doing these days.

Guile’s JIT compiler only needs to call the instructions of the library and they will generate actual machine code by themselves, packaged in a function the interpreter will be able to call later.

Lightening is simple because it doesn’t need to compile from source code, or make code analysis to find which part of the code does it need to compile. It just exposes an API that looks like an instruction set and that’s the thing that we translate to the machine code.

The JIT is going to call the API of Lightening, creating more complex operations by combining Lightening’s instructions and Lightening is going to convert those operations to their machine code by a simple translation, filling the array of numbers and returning its address as a function pointer we can call later.

Of course, it is much more complex than that because it needs to solve many other problems we are talking about later but that’s the idea. And the idea doesn’t sound too difficult, once you have in mind what we talked about previously.

Lessons learned

There are many problems that a machine code generation library like that can encounter, but they are not exclusive for those libraries. This kind of problems can also appear in compilers, assemblers and many other things.

The lessons I learned come as problems I encountered during these days of digging and implementing and some possible solutions or thoughts about them.

Problem: Large immediates

Large immediates are one of the most obvious but boring issues in this world, and they apply to many cases.

In the example above we encoded an addi instruction that added 56, an immediate, to a register, and we said the immediate had a 12 bit space in the instruction. Registers in RISC-V are 32 bit (in RV32) or 64 bit (in RV64) wide, so we can work with larger values, but we are limited to use 12 bit immediates in addi and all the other I-type instructions.

Why is that? Well, RISC-V instructions are 32 bit and they need to be able to pack much more information than the immediate they use, so the immediates can’t be as large as we want. The fixed instruction size is a design decision that keeps the processor simple, but other processors have other design decisions⁴ around this.

Solution: multiple instruction expansion

There are several solutions for this, but the most obvious one is to use more than one instruction to operate in an immediate.

If we want to load a 64 bit value, we can add, rotate left, add, rotate left, add… Until we fill a whole register with the value we were looking for.

This means a simple addition can be expanded to many instructions. In some cases they are going to be just a few, but as the immediates get big we may need more than eight instructions, very well encoded, to write the immediate to a register and be able to operate with it.

Solution: constant insertion

This is not a solution we can take everywhere, but we can take in the context we are right now (code is stored in the memory and all that, remember). Consider this RV64 code:

auipc t0, 0             // x[t0] = PC + 0
ld t0, 12(t0)           // x[t0] = mem[ x[t0] + 12 ]
jal zero, 3             // PC    = PC + 3
0xAAAAAAAAAAAAAAAA      // This is a 64 bit literal
addi t0, t0, 1          // x[t0] = x[t0] + 1
// What's the value of t0 here?

The code has some comments on the right that I’m going to use through the whole post, so get used to them. The x means register access (base registers are called X registers in RISC-V), and mem is memory, PC (program counter) is written in uppercase and not as if it was a register because it’s not accessible by the programmer so we need to treat it as a global variable we can only set using jumps or get using auipc.

RISC-V instructions are 32 bit long (4 bytes), so you can get what the offset in the ld instruction does, right?

Basically we are loading a doubleword (ld) at the position of the 0xAAAAAAAAAAAAAAAA in the t0 register and adding 1 to it. So the answer to the question is 0xAAAAAAAAAAAAAAAB.

But can you see the trick we are using?

The jal instruction is jumping over the constant so we can’t execute it by accident (which would cause an Illegal Instruction error), and using the ld instruction we are able to load a big constant to a register. A constant which is mixed with the code, as any immediate would be, but without being associated with any instruction.

If we know the code we are generating is a function, we can always wait until the return instruction and insert all the constants after it, so they are perfectly separated and we don’t insert jumps to avoid executing the constants by accident. For that case, we need to change the values of the auipc and the ld accordingly, making them point to the correct address, which has some associated issues we need to talk about now.

Keep in mind you can hire ElenQ Technology if you like this kind of material.
We teach with this mixture of passion and awkward charisma. We also code and research.

Problem: Unknown addresses and offsets

Addresses and offsets are a pain in the ass because you may don’t know them when you expect to.

Let’s consider an unconditional jump like the one of the previous example. The number we introduce is the amount of instructions to jump from the program counter: an offset. The immediate offset can be positive, for forward jumps, or negative, for backward jumps.

jal zero, 3             // PC    = PC + 3

Generating this jump supposes that you know where you need to jump: you want to jump 3 instructions to the future.

But imagine you are assembling a file, a real assembly file that is not an oversimplification of the assembly, like what we did in the previous example. A real assembly file with labels:

add a0, a0, t1     // I don't care about this instruction
j end              // Unconditional jump to `end`

// Some code here

end:               // Label `end`
    ret            // return

If you are assembling this file line by line, you can actually assemble the add in the first line, because you know everything from it, but you are unable to emit the j end because you don’t know where end is yet.

If this assembly is written in a file you can always preprocess the whole file, get the labels, associate them with their addresses and then assemble the whole thing, but you are not always in this situation.

Lightening, for instance, generates the code as you call the API, so it doesn’t know where your jump points to until you call the API for the label later.

Compilers may encounter this issue too, when they are using separate compilation and linking steps. You must be able to compile one source file by your own but you may not know where do global variables appear, because they might be in a different file, and you only know those at link time.

Solution: relocations

There’s one simple way to solve it: introduce a fake offset or address and patch it later, when we know the position of the symbol. That’s what relocations do.

Example: C compilers

The relocations are a mechanism to pass information between the compiler and the linker, you can actually see them in the object files generated by your compiler. Make a simple file with a global variable and compile it. Something like this:

int global_symbol;
int main(int argc, char* argv[]){
    return global_symbol !=0;
}

If you compile it with gcc -c, you can inspect relocations in the result with objdump, using the -r flag alongside with -d for disassemble. In RISC-V you’ll find things like R_RISCV_HI20 or R_RISCV_LO12 where the relocations are located. They are ways to encode immediates in U-type instructions and I-type instructions respectively. In my case I get something like this (it’s not the full result):

   6:       00000797                auipc   a5,0x0
            6: R_RISCV_PCREL_HI20   global_symbol
            6: R_RISCV_RELAX        *ABS*
   a:       00078793                addi    a5,a5,0x0
            a: R_RISCV_PCREL_LO12_I .L0 
            a: R_RISCV_RELAX        *ABS*
   e:       639c                    ld      a5,0(a5)

There are two types of relocations but we are going to talk about the R_RISCV_RELAX later. You see my relocations have PCREL in the middle, but just to mention they are relative to the program counter.

If you just inspect the binary with the -d you won’t see the relocations and the result will look like nonsense code⁵:

    6:       00000797   auipc   a5,0x0         // x[a5] = PC + 0
    a:       00078793   addi    a5,a5,0x0      // x[a5] = x[a5] + 0
    e:       639c       ld      a5,0(a5)       // x[a5] = mem[ x[a5] + 0 ]

This adds 0 to program counter and stores the result in a5, then adds 0 to a5, and loads a doubleword to a5 from the address at a5. But the address at a5 at the moment of the load is nothing but the program counter at the auipc instruction. Weird.

The relocation is going to point to the auipc and the addi, and tell the linker it has to replace the zeros by other value. Which one? The address of the global variable. If we replace the zeros by a combination that is able to load the address of the global variable the code will work. That’s what the relocation does here.

So, as we don’t know where to point, we insert anything (zeros) and we fix the instructions when we know where do they need to point to.

Example: Lightening

The same approach is followed in Lightening, and you can follow in your assembler, library or anything that has a similar problem. Let’s consider some code using Lightening (obtained from tests/beqr.c, comments added by me):

// Make a function that loads two arguments
jit_load_args_2(j, jit_operand_gpr (JIT_OPERAND_ABI_WORD, JIT_R0),
                jit_operand_gpr (JIT_OPERAND_ABI_WORD, JIT_R1));

jit_reloc_t r = jit_beqr(j, JIT_R0, JIT_R1); // branch if equal registers
jit_leave_jit_abi(j, 0, 0, align);           // end ABI context
jit_reti(j, 0);                              // return 0
jit_patch_here(j, r);                        // make the branch jump here
jit_leave_jit_abi(j, 0, 0, align);           // end ABI context
jit_reti(j, 1);                              // return 1

// Obtain the function we created
jit_word_t (*f)(jit_word_t, jit_word_t) = jit_end(j, NULL);

// Test if it works
ASSERT(f(0, 0) == 1);       // 0 == 0 so it jumps        -> returns 1
ASSERT(f(0, 1) == 0);       // 0 != 1 so it doesn't jump -> returns 0

In this example we see how we generate machine code statement by statement, so there’s no way to know where does the beqr need to jump until we generated all the code for it.

You see the beqr function doesn’t receive the target address or offset as an argument, but it returns a jit_reloc_t, which other functions like reti don’t return.

That jit_reloc_t is what we are patching later with the jit_patch_here indicating where does it need to jump. The jit_patch_here function is going to correct the bits we set to zero because we didn’t know the target at that moment.

There are different kinds of relocations, as it happened in the previous example with the compilers, because different instruction formats need to be patched in different ways. In the case of Lightening, the relocation has a type associated with it, so we can check and act accordingly.

Problem: Long jumps

As we saw, some jumps encode the target as an immediate. This has a couple of implications that we described previously:

The jump target could be larger than the space we have for the immediate.
Sometimes we can’t know the target until we reach the position where the jump points to.

Both issues can be combined together in a killer combo. Consider this code:

j label     // jump to label

// A lot of instructions here

label:
    // this is the target of the jump

In RISC-V the j pseudoinstruction is resolved to jal, that has a 21 bit (signed) space for the jump target. If we had a hella lot of instructions between the jump and the target we may need more bits for the jump than the space we actually have.

Again, in the case were we can preprocess everything there’s no problem, but if we are assembling the instructions as they come we are going to have issues. We realize we can’t jump that far too late, because we already inserted a 21 bit jump and too many instructions when we reach the label. Patching the jump is not enough, because we didn’t leave enough space to insert the offset we need.

Solution: always insert the largest jump possible

There’s an obvious solution: always insert the largest possible jump and patch the whole jump later.

In RISC-V jalr jumps to the absolute address that is stored on a register with an optional 12 bit (signed) offset. Combined with the auipc (add upper immediate to program counter) it lets us make 32 bit relative jumps in just 2 instructions. Let’s explain that in code just in case:

auipc t0, offset_hi         // x[t0] = PC + (offset_hi<<12)
jalr zero, offset_lo(t0)    // PC    = x[t0] + offset_lo

If we consider offset as a value we know, we can split it in two blocks: the highest 20 bits as offset_hi and the lowest 12 bits as offset_low and use them to jump to any address in the 32 range from the current position, using just 2 instructions.

In 32 bit machines, this jump is the largest jump possible, because the machine can only address 32 bits, so we will be sure that any relative (or absolute, using lui instead of auipc) jump we want to make can fit in place. The only thing we have to take in account is to patch both instructions when we find the targets, not only one.

Optimization: pointer relaxation

But using the largest possible jumps can lead to inefficiencies because we use two instructions for jumps that can potentially fit in just one.

We can use something we saw before for that: relocations. More specifically, in the case of the GCC toolchain, we can use the R_RISCV_RELAX that appeared before.

The relaxation relocation is going to tell the next step, which can be the linker or anything else depending on the context we are working on, that the pointer can be relaxed. In the case of the auipc + jalr, possibly by replacing both instructions by a 21 bit jump like jal.

So we start with the longest jump possible, but when we actually know the target of the jump, we can reduce it to something smaller that needs fewer instructions.

Example: relaxed global variable access in C compilers

Global variables, as we saw before, are some of those points where compilers need to use relocations and let the linker clean the result.

Global variables don’t necessarily involve jumps but they do involve pointers for the loads and stores needed to operate with them. In the final executables, global variables are part of the .data segment, because they are known at compilation time, so we can exploit that fact a little and relax our weird auipc + load/store combos.

RISC-V has many registers, so we can use them for things that may not be the norm in other platforms where registers are scarce. In this case, we can exploit the gp (global pointer) register on RISC-V to improve the access to the global variables. We can cache the address of the .data segment of the program in the gp register so, as we know most of the global variables are going to be near (12 bit offset) to the beginning of the .data segment, we are probably going to be able to remove some of the auipcs we inserted before.

So a simple load of a global 64bit variable to a register:

auipc t0, offset_to_global_hi  // x[t0] = PC + offset_to_global_hi << 12
ld t0, offset_to_global_lo(t0) // x[t0] = mem[ x[t0] + offset_to_global_lo ]

Is optimized to this:

ld t0, offset_from_data(gp)    // x[t0] = mem[ x[gp] + offset_from_data ]

Of course, the offsets have to be calculated and all that, but this not that difficult.

Solution: Veneers

There are other solutions that don’t involve messing around with the code we generated earlier in an aggressive way like removing instructions, which can be pretty bad because you have to shift the array of instructions you generated to clean the gaps the pointer relaxation leaves.

Veneers are non destructive, and they involve no instruction reorganization, so they are interesting for those cases where you need to generate the code as you go.

Let’s explain them with an example:

beq a0, a1, branch // Jump to `branch` if x[a0] == x[a1]

// Instructions...

branch:
// Branch target

As we saw previously, if we insert too many instructions in between the jump and the target we screw it. What we didn’t mention is that as we go assembling instructions one by one we can keep a track of the amount of instructions we are inserting.

Having that in mind, we can take decisions in time, right before it’s too late. We can combine that knowledge with the constant insertion method introduced before to insert full-range jumps if needed, right before we exhaust the possible offset of the original instruction.

Of course, we need to patch the original instruction to jump to the code we are just going to insert, and we need to add some protections around the veneer to make it only accessible to the original jump.

beq a0, a1, veneer      // Jump to `veneer` if x[a0] == x[a1]

// Many instructions, but not too many!

// Here we realize we are running out of offset range so we insert a helper
// block that lets us jump further.
j avoid                 // Jump to `avoid` so the normal execution flow
                        // doesn't fall in the veneer
veneer:
    auipc t0,0          // x[t0] = PC + 0
    ld t0,12(t0)        // x[t0] = mem[ x[t0] + 12 ]
    jalr zero,0(t0)     // PC    = x[t0]
    ADDRESS(branch)     // Literal address of `branch` label
avoid:

// As many instructions as we want

branch:
// Branch target

As it happened with constant insertion, there are positions where the veneer insertion can be optimized a little, like right after a return or an unconditional jump, so we don’t need the protection (j avoid in the example).

The bad thing about veneers is they insert a bunch of instructions in the cases that are not in range and the jumps are done in two steps, that has a negative effect on the performance because they drop the pre-processed instructions in the pipeline.

Of course, the veneers themselves have to be patched too, because we won’t know the target (branch in the example) until we reach it. But, in the case of the veneer we can be 100% sure that we are going to be able to point to the target.

Example: Lightening’s constant pools

Lightening uses veneers for the jumps⁶, but they are part of Lightening’s constant pool mechanism. Constant pools work the same for the constant insertion than for veneers, because veneers are basically constants. Remember, code is numbers!

Basically anything that might be inserted as a constant, which can be a veneer or just a number or whatever, is queued to the constant pool. The library is going to emit instructions and check on each instruction if it needs to emit any of the constants of the pool.

The constant pool and each of the entries on the pool have associated information that tells the emitter if they need to be emitted now or if they can wait for later so the emitter can decide to insert them.

The literal pool entries have, of course, an associated relocation that contains information of the original jump or load instructions we may need to patch, as we already saw. So, in the case of a veneer emission, we need to patch the original jump to the veneer and remember the veneer needs to be patched later, when we find its target.

The mechanism is not complex, but it’s not simple neither. There are several kinds of relocations, depending on what we want to do with them, different kind of patches we need to do, address calculations and all those things that require a level of attention to the detail I’m not prepared to talk about.

Problem: Register access

You may have seen a problematic point in some of the solutions we worked with: we are using registers.

It’s not a problem by itself, but using registers might be really problematic if we are inserting code between the instructions someone else wrote because we can’t control the register use the original program did and we might be changing the values inside of the registers in the magic tricks we sneakily inserted.

Imagine we use, say, t0 register in our veneer but the original program uses that register for something else. That’s a problem. We are messing with the value in the register and potentially (surely) breaking the program.

Solution: use the stack

The most obvious solution you can think of is to use the stack. We can surround our veneers or code insertions with some protection code that saves the values of the registers on the stack and restores them when we finished.

It’s a simple solution in your mind, but if you need to deal with the jumps it can get messy. You may need to restore the register far away in the code and keeping track of everything. It can be complicated.

On the other hand, memory access is slow and boring and we don like that kind of things in our lives. We need more dynamite.

Solution: controlled register access

The other solution we can provide is to keep control of the registers that are being accessed and use others for our intervention.

A simple way to do this is to provide functions to get and release temporary registers, instead of letting the programmers do whatever they want. This makes sure that all the register access is controlled and we are not changing the values of any register in use.

The main problem we can have comes when the programmer needs all the register for their things and then we can’t really use any for our magic tricks. But we can always keep at least one register for us and only for us (throwing an error to the programmer when they use it) or even combine the use of the stack with this solution.

If we are directly working with assembly code, where we can’t force the programmer to use the interface we want, we can choose the solutions that don’t involve register access so we don’t need to analyze the code to deduce if the programmer is using the registers or not. Avoiding the problem is sometimes the best solution.

In the case of libraries like Lightening register access control is a must because Lightening can’t control how its (virtual-) instructions are translated to machine code instructions: each machine has its own peculiarities and details. In many cases they need to make use of temporary registers and, as the instructions are built incrementally, preventing instructions from peeing on each other is important.

Please, consider supporting me on Liberapay to encourage my free software work.

Final thoughts

I know these are just a few things, but they are enough to let you make your first program that involves machine code generation to certain level.

I’m not a computer scientist but a telecommunication engineer⁷, so I may put the focus on things that are obvious for the average reader of these kind of posts, but at the same time I may be flying over things that I consider basic due to my studies but the average reader doesn’t. In any case, feel free to contact me if you have questions or corrections.

Some of the tricks and lessons I included here are more important than others, but the most important thing is to start thinking in these terms. Try to understand the problems you face when you have separate compilation, assume the fact that you can’t know the future… The mindset is the most important point of all this and, once you have it, everything comes easier.

It’s also a lot of fun to realize code is just numbers in memory you can mess around with. I hope you keep it in your brain forever.

I hope this post throws some light on this dark hole that machine code generation is and makes you try to make your own findings on this beautiful area of compilers, machines and extremely long blog entries.

One of those peculiarities is the Harvard Architecture that is not going to let us make the fantastic trick I’m going to show you now. Harvard Architecture is popular on microcontrollers. ↩
LISPers are always right. ↩
You can read more about how it works here. ↩
In x86 all the instructions don’t have the same length and some can encode larger immediates. ↩
addi a5,a5, 0x0 is adding 0 to a5 and storing it in a5 so it’s just moving it. RISC-V has a pseudoinstruction for that: mv a5,a5, which is expanded to the addi. objdump is going to write mv in its output, because it tries to be clever, but that is not the goal of the instruction we have. I changed it to the actual instruction so we can understand this better. ↩
Only in the architectures that need them. x86 does not need constant pools or veneers because the ISA is complex enough to handle the problematic cases adding levels of complexity ISAs like RISC-V or ARM didn’t want to deal with. RISC vs CISC, y’know… ↩
So, for all that software developers that write blog posts like “Are we really engineers?” or stuff like that: I am, thanks for the interest. LOL ↩

RISC-V Adventures II: hex0

2021-06-08T00:00:00+03:00

Stage0 is a crazy project that is pretty well aligned with our vision of trust, bootstrappable software and whatnot.

During the last two weeks we have been working on the port of Stage0 to RISC-V, providing the very first step of the process, so we came here to talk about it, including a fantastic software necromancy moment you are going to enjoy.

The origin of the times

Once upon a time, software was written in machine code. Directly expressing the machine instructions by the hands of the programmers. That was long, long, time ago.

One day some programmer decided to write a translator that mapped that machine code to something more human readable and created what we call assembly language today. It gained popularity and programmers decided to add more and more functionalities to the assembly language until the point that what they created was not a one-to-one mapping with machine code anymore.

That’s how the first programming languages were born.

Their power was so immense that programmers decided to rewrite all their tools using the new programming languages, they even wrote newer programming languages with them.

But power corrupts the mind of the fool. Blinded by the power of programming languages, most of the programmers forgot the origin of the times, and forgetting the history is always a mistake.

Epic music starts…

The problem

Warning: I oversimplified during the beginning of the post, but now… Oh boy! I’m going to flatten this shit.

Well, we need auditable software. I don’t think anyone can deny that fact.

But what does “auditable software” mean? Isn’t free software enough?

It is true that the best way to audit stuff is to read the code of the programs. It’s the classic way we had to know if a program is doing what we want it to. But, how can you be really sure the code you are reading is the one that is shipping with your program?

You can’t! In general it’s impossible to know. There are many reasons, but I will oversimplify and give you just some thoughts and let the people from bootstrappable do the dirty job.

The compilation process is not reproducible, so the same source can result in different binaries. You can’t just compare different binaries to make sure your compiler is compiled correctly.
We have no way to solve the chicken-egg problem. The recipe to build the compiler version X is to get the sources of the compiler version X and compile them with the compiler version X-1. But how do you get the compiler version X-1? Rinse and repeat.
Also… Where’s the first version of your compiler? Does it run in modern machines with modern operating systems?
As there’s no real way to get your compilers compiled by yourself, there’s no real way to be sure that the compilers are emitting the code they are supposed to. You have to trust them, and you can’t audit what you need to trust.

The first point is where projects like Nix and Guix have sense: they try to create reproducible stuff. Not only for compilation processes but also for scientific studies (that have to be reproduced by other people, because… that’s how science works, isn’t it?) and other things. Being able to create identical environments where you can ensure that their (compilation) output is going to be identical is extremely important, but I’ll leave that for now.

The second and third point are two different problems but they result in the same: software distributions ship huge binary blobs (hundreds of megabytes) as an starting point (bash, gcc…) so the users have no chance to check if those binaries are corrupt.

GNU Mes is a Scheme interpreter and C compiler that was designed to reduce the size of the binaries you need to ship with your distro. Mes has successfully reduced the size of the binaries that need to be shipped with distros like Guix, but the project is more ambitious that that.

Full-source bootstrap

Stage-0 is a project that is tackling the same “trusting trust” problem but from the opposite perspective, starting in the low-level, rather than the high-level approach that GNU Mes uses.

Both projects work together to provide a greater goal: the full-source bootstrap. The whole bootstrap is started from source, with no binaries involved, so the distros don’t need to ship binaries anymore.

But how is that?

Hex0

Stage0 starts in the low level, the lowest possible, and builds more complex programs from there, step by step.

The first step, Hex0, is a self-hosting “assembler”. I quote the word assembler there because I think it’s a very strong word for this: it’s an ELF file written in hexadecimal, with extra comments.

Hex0 is able to compile itself to a binary ELF file, converting the Hexadecimal values to the binary values and stripping the comments.

We still have to compile the first Hex0 with something but that’s not as difficult as compiling the first GCC because basically Hex0 can be compiled by very simple programs or even by hand, because it contains literally what is going to be written in the final ELF.

The comments on the Hex0 files describe the instructions on each of the lines of the ELF file so the resulting files can be audited, instruction by instruction, with the manual of the ISA as a reference.

This starting point is more than enough to build on top of. We just need to add more functionalities to the next steps: labels, constants… Until we are able to compile a simple C compiler, Mes or anything.

It’s a clever solution for a crazy problem.

Hex0 in RISC-V: my experience

So this is my blog and I come here to talk about myself!¹

Some weeks ago I had the chance to make that first step, Hex0, for RISC-V (64 bit). You can take a look to the code here.

There you can see I added three files: the assembly file, the hex0 file and the binary of the compiled hex0. They are basically the same thing, but they are included for readability.

The assembly

The first step for me was to write the assembly file. It’s easy once you know how to make system calls in POSIX.

POSIX system calls in RISC-V are pretty easy:

Load the arguments for the call in registers a0, a1…
Load the syscall number in a7
Run ecall

The result of the system call comes in a0.

Input arguments are also important, because we need to be able to tell to Hex0 which is the file we want to compile, and where to put its output.

That’s pretty easy, input arguments are inserted in the stack, so we can load them by pop-ing them. As in any C program, the first element we get is the amount of arguments and the rest of them are the arguments themselves.

Putting all together, if you take a look to the first block of the program:

_start:
ld a0, 0(sp)         # Get number of the args
ld a1, 8(sp)         # Get program name
ld a2, 16(sp)        # Input file name

...

# Open input file and store FD in s2
li a7, 56             # sys_openat
li a0, -100           # AT_FDCWD
mv a1, a2             # input file
li a2, 0              # read only
ecall
mv s2, a0             # Save fd in for later

In the first chunk we are reading the stack, element by element, and in the second we are opening the input file, using the filename we just obtained from the stack.

Simple.

As a note, I’d like to remind you to finish your program, because if you don’t it will continue to execute the memory after it and it’ll explode in your face:

terminate:
# Terminate program with 0 return code
li a7, 93             # sys_exit
li a0, 0              # Return code 0
ecall

This tells the OS to finish the execution.

The internals of the assembly file are simple, I won’t explain them in detail. It basically iterates character by character, removing the comments and converting the hex value couples to a byte.

Read it and tell me if you need help understanding it!²

The conversion to the Hex0

Hex0, as I said, is an ELF file, written in hexadecimal, so we need to compile our assembly file to binary and represent each of the instructions in hexadecimal. And we need to resolve all the labels to final addresses.

There’s no easy way to do it. I started doing it by hand, reading the RISC-V spec and converting the instructions one by one. But I tackled several difficulties doing that.

Pseudoinstructions are expanded to more than one instruction so we need to be careful in the comments and explain that correctly. Also, we need to resolve the addresses accordingly. For example:

la a1, buffer

This is a pseudoinstruction, and we need to resolve it to:

auipc a1, 0
addi a1, a1, $OFFSET

Where $OFFSET is the offset from that instruction to the label buffer.

These kind of expansions change our perception of the amount of instructions we have and we have to be extremely careful. I didn’t even mention the case where the offset is very large! That’s another story (thankfully we don’t have to deal with yet).

Once the pseudoinstructions are expanded we need to convert them to the hex value, and I swear it’s the most boring task I ever made in my life. Basically because the RISC-V instructions are not easy to map to their binary (for reasons related with the hardware implementation).

The eagles are coming!

But I had a trick, a deus ex machina that would safe my life. During the last months I’ve been randomly working on a Scheme compiler for RISC-V assembly and that made me start making a RISC-V assembly interpreter and compiler in python. It’s still an early WIP, and was almost abandoned, but it has the basic machinery that lets me compile simple instructions to hex.

With this dirty glue code I was able to compile the instructions one by one:

from registers.RV32I import *
from InstructionSets.RV64I import *

Regs = RegistersRV32I()

def x(registerName):
    return Regs.getPos(registerName)

def compile(instruction):
    hexrepr = hex(instruction.compile().value)
    hexval = hexrepr[2:]
    if len(hexval) < 8:
        hexval = "0" * (8 - len(hexval)) + hexval

    final = ""
    for i in range(0,8,2):
        final += hexval[i:i+2] + " "
    final = final.rstrip().upper()
    return " ".join(reversed(final.split(" ")))

I just needed to open a python shell and write something like:

compile( addi(x("a0"), x("a1"), 12) )

And that would compile that instruction for me, giving me the output in a beautiful hexadecimal format.

13 85 C5 00

Not the best UX but usable enough for a small file like this.

The addresses

The addresses are still something to solve.

I’m an idiot so I counted the instructions by hand and then realized I had to expand some pseudoinstructions I forgot, so all the branch instructions were broken. Yes, I’m like that.

Try to be smarter than I am. Use this trick:

Leave all the instructions that use addresses set to a wrong address, like 0 or something, until you converted the whole file. Once you have that, resolve the addresses. That way you’ll make sure every pseudoinstruction is expanded and you’ll be able to use tools that will help you to choose addresses correctly.

The trick I used was to add the ELF header, compile the file and then inspect the resulting binary.

For the compilation there are two choices: we can use the assembly file we wrote previously simply assembling it to a binary and use it as the hex0 assembler; or we can use the high level C prototype that Stage0 provides. In any case, they have to give the same result.

I still don’t know why objdump is unable to process the binaries of the hex0 files but GDB is able to do it so… Launch a GDB as I explained in the previous post and disassemble the whole file³.

It’ll look like this:

0x0000000000600078: ld  a0,0(sp)
0x000000000060007c: ld  a1,8(sp)
0x0000000000600080: ld  a2,16(sp)
0x0000000000600084: li  s4,0
0x0000000000600088: li  s5,0
0x000000000060008c: li  a7,56
0x0000000000600090: li  a0,-100
0x0000000000600094: mv  a1,a2
0x0000000000600098: li  a2,0
0x000000000060009c: ecall
...

With that dissasembled result, we can literally make some math with the addresses and fix all the instructions. We just need to substract the target address from the current address in the branches, so we get the offset.

NOTE: Be careful with the la pseudoinstruction (auipc + addi). The base address here is the one of the auipc, not the one of the addi.

The ELF header

If we don’t know how to make the ELF header, we can’t make the previous step, so better if we mention something about it.

Other files of the project are hex0 files too, so they also have a heavily commented ELF header we can use as a reference. Also, wikipedia has a great explanation of it.

The main point we need to change is the e_machine field. We need to set it to 0xF3, indicating RISC-V. Also we need to make sure the flag of 64 bit system has to be set for RV64 and remember to check the endianness.

NOTE: Big Endian it’s the most natural way to write the file by hand. If you want to go for Little Endian this might get weird to write. The python script above uses Little Endian, watch the reversed call on it.

Debugging

Once you have everything ready you need to make sure it’s doing what it’s supposed to.

My first working program was failing to use an output file. Someone in the #bootstrappable IRC channel (I’m sorry, I can’t remember who was) told me to strace the program to see what was going on, and with that and some debugging with GDB’s layout asm, I was able to figure out one instruction was using a wrong register.

These tools are important because the all the process is done by hand so there are many chances to screw up somewhere.

strace is extremely handy in this specific program because most of the functionality it has is based on system calls. If you clean the output correctly you can see everything the program does accurately.

Remember you can hire ElenQ Technology to help you with your research, development or training.
If you want to encourage my free software work you can support me on Liberapay.

Final thoughts

This contribution have been a lot of fun. It let me understand a little bit more about the ecosystem around the full-source bootstrap, which is kinda complex and includes some other stuff I didn’t even mention.

I learned a lot from this, now I have a deeper understanding of the instruction formats on RISC-V and I learned some cool GDB tricks that are always useful.

Stage0 has a really interesting approach for auditability that’s worth thinking about. They build everything from a commented binary file (that’s basically what hex0 is) that acts like a seed, so we can audit everything, including the very first step. The solution of having the contents of the ELF file directly written in hexadecimal is enough to ensure we can certify the contents are what we expect, and having every instruction commented with its assembly counterpart gives us the chance to go to the ISA and check if that’s actually what it is supposed to be. Perfectly auditable.

Using this first step as a building block for anything else ensures that we never need to rely on a binary file we can’t know where does it come from.

Really interesting stuff.

Now we have this very first step ported to an open instruction set, what also opens the door to the auditability from the hardware perspective. Now we can start thinking about having an auditable software stack in a device we designed ourselves, so we can audit it too. This is huge.

Now, we need to keep pushing in this direction, porting all the rest of the steps of Stage0, Mes and many other projects, if we want to reach a full RISC-V support. This is just one small step in that direction.

Hey! I almost forgot! And thanks to this I had the chance to work a little bit more on my assembly interpreter, and recover it from the darkness. That’s also great. Isn’t it?

Well, so we learned some things today, but the most important is that all the stupid things we do, all the random projects we work on, all the experiences we have in life are not just a waste of time. They may appear to be useful in the future, but you don’t know when…
What is sure here is if you don’t stay creative and active, you’ll never have any experience to learn from.

Not really, in fact, I use my experience as a vehicle to introduce you to great projects and interesting pieces of knowledge. But ssssh, don’t tell anyone. ↩
Contact me, seriously. I have my contact info here ↩
If you want to know where to start to disassemble, you can just ask where to GDB. ↩

RISC-V Adventures: Lightening

2021-05-19T00:00:00+03:00

In the latest post I summarized the last year because I wanted to talk about what I’m doing now. In this very moment I just realized that almost the half of this 2021 is already gone so following the breadcrumbs until this day could be a difficult task. That’s why I won’t give you more context than this: RISC-V is a deep, deep, hole.

I told you I was researching on programming languages and that made me research a little bit about ISAs. That’s how I started reading about RISC-V, and I realized learning about it was a great idea for many reasons: it’s a new thing and as an R&D engineer I should keep updated and the book I chose is really good¹ and gives a great description about the design decisions behind RISC-V.

From that, and I don’t really know how, I started taking part on the efforts of porting Guix to RISC-V. One of the things I’m working on right now is the port of the machine code generation library that Guile uses, called lightening, to RISC-V, and that’s what I’m talking about today.

The lightening

Lightening is a lightweight fork of the GNU Lightning, a machine code generation library that can be used for many things that need to abstract from the target CPU, like JIT compilers or so.

The design of GNU Lightning is easy to understand. It exposes a set of instructions that are inspired in RISC machines, you use those, the library maps them to actual machine instructions on the target CPU and returns you a pointer to the function that calls them. Simple stuff.

The code is not that easy to understand, it makes a pretty aggressive and clever use of C macros that I’m not that used to read so it is a little bit hard for me.

I could try to explain the reasons behind the fork, but the guy who did it, that is also the maintainer of Guile explains it much better than I could. But at least I can summarize: lightening is simpler and it fits better what Guile needs for its JIT compiler.

Boom! Lightened!

The process

So Lightening is basically simpler but the idea is the same. But how do you make the port of a library like that to other architecture?

The idea is kind of simple, but we need to talk about the basics first.

Lightening (and GNU Lightning too, but we are going to specifically talk about Lightening from here) emulates a fake RISC machine with its functions. It provides movr, movi, addr and so on. Basically, all those are C functions you call, but they actually look like assembly. Look a random example here taken from the tests/addr.c file:

jit_begin(j, arena_base, arena_size);
size_t align = jit_enter_jit_abi(j, 0, 0, 0);
jit_load_args_2(j, jit_operand_gpr (JIT_OPERAND_ABI_WORD, JIT_R0),
                jit_operand_gpr (JIT_OPERAND_ABI_WORD, JIT_R1));

jit_addr(j, JIT_R0, JIT_R0, JIT_R1);
jit_leave_jit_abi(j, 0, 0, align);
jit_retr(j, JIT_R0);

size_t size = 0;
void* ret = jit_end(j, &size);

int (*f)(int, int) = ret;
ASSERT(f(42, 69) == 111);

Basically you can see we get the f function from the calls to jit_WHATEVER, which include the call to the preparation of the arguments, jit_load_args_2, and the actual body of the function: jit_addr. The word addr comes from add and registers, so you can understand what it does: adds the contents of the registers and stores the result in other register.

The registers have understandable names like JIT_R0 and JIT_R1, which are basically the register number (the R comes from “register”).

So, if you check the line of the jit_addr you can understand it’s adding the contents of the register 0 and the register 1 and storing them in the register 0 (the first argument is the destination).

That’s pretty similar to RISC-V’s add instruction, isn’t it?

Well, it’s basically the same thing. The only problem is that we have to emit the machine code associated with the add, not just writing it down in text, and we also need to declare which are the registers JIT_R0 and JIT_R1 in our actual machine.

Thankfully, the library has already all the machinery to make all that. There are functions that emit the code for us, and we can also make some defines to set the JIT_R0 to the RISCV a0 register, and so on.

We just need to make new files for RISC-V, define the mappings and add a little bit of glue around.

The problems

All that sounds simple and easy (on purpose), but it’s not that easy.

Some instructions that Lightening provides don’t have a simple mapping to RISC-V and we need to play around with them.

There’s an interesting example: movi (move immediate to register).

Loading and immediate to a register is something that sounds extremely simple, but it’s more complex than it looks. The RISC-V assembly has a pseudoinstruction for that, called li (load immediate) that can be literally mapped to the movi. The main problem is that pseudoinstructions don’t really exist.

You all know there are CISC and RISC machines. CISC machines were a way to make simpler compilers, pushing that complexity to the hardware. RISC machines are the other way around.

The RISC hardware tends to be simple and they have few instructions, the compiler is the one that has to make the dirty job, trying to make the programmer’s life better.

Pseudoinstructions are a case of that. The programmer only wants to load a constant to a register but real life can be very depressing. When you want to load an immediate you don’t want to think about the size of it, if it fits a register you are fine, aren’t you?

Pseudoinstructions are expanded to actual instructions by the assembler, so you don’t need to worry about those details. In fact, RISC-V doesn’t really have move instructions, they are all pseudoinstructions that are expanded to something like:

addi destination, source, 0

Which means “add 0 to source and store the result in destination”.

The li pseudoinstruction is a very interesting case, because the expansion is kind of complex, it’s not just a conversion.

In RISC-V all the instructions are 32bit (or 16 if you take in account the compressed instruction extension) and the registers are 32bit wide in RV32 and 64bit wide in RV64. You see the problem, right? No 32bit instruction is able to load a full register at once, because that would mean that all the bits available for the instruction (or more!) need to be used to store the immediate.

Depending on the size of the immediate you want to load, the li instruction can be expanded to just one instruction (addi), two (lui and addi) or, if you are in RV64 to a series of eight instructions (lui, addi, slli, addi, slli, addi, slli, addi.). There are also sign extensions in the middle that make all the process even funnier.

Of course, as we are generating the machine code, we can’t rely in an assembler to make the dirty job for us: we need to expand everything ourselves.

So, something that looked extremely simple, the implementation of an obvious instruction, can get really messy, so we need a reasonable way to check if we did the expansions correctly.

And we didn’t talk yet about those instructions that don’t have a clear mapping to the machine!

Don’t worry: we won’t. I just wanted to point the need of proper tools for this task.

The debugging

The debugging process is not as complex as I thought it was going to be, but my setup is a little bit of a mess, basically because I’m on Guix, which doesn’t have a proper support for RISC-V so I can’t really test on my machine (if there’s a way please let me know!).

I’m using an external Debian Sid machine (see acknowledgements below) for it.

I basically followed the Debian tutorial for cross compilation environments and Qemu and everything is perfectly set for the task.

Next: how to debug the code?

I’m using Qemu as a target for GDB, so I can run a binary on Qemu like this:

qemu-riscv64-static -g 1234 test-riscv-movi

Now I can attach GDB to that port and disassemble the *f function that was returned from Lightening to see if the expansion is correct:

$ gdb-multiarch
GNU gdb (Debian 10.1-2) 10.1.90.20210103-git
...
For help, type "help".
Type "apropos word" to search for commands related to "word".
(gdb) file lightening/tests/test-riscv-movi 
Reading symbols from lightening/tests/test-riscv-movi...
(gdb) target remote :1234
Remote debugging using :1234
0x0000000000010538 in _start ()
(gdb) break movi.c:15
Breakpoint 1 at 0x1d956: file movi.c, line 15.
(gdb) continue
Continuing.

Breakpoint 1, run_test (j=0x82e90, arena_base=0x4000801000
    "\023\001\201\377#0\021", arena_size=4096) at movi.c:15
15       ASSERT(f() == 0xa500a500);
(gdb) disassemble *f,+100
Dump of assembler code from 0x4000801000 to 0x4000801064:
   0x0000004000801000:  addi    sp,sp,-8
   0x0000004000801004:  sd      ra,0(sp)
   0x0000004000801008:  lui     a0,0x0
   0x000000400080100c:  slli    a0,a0,0x20
   0x0000004000801010:  srli    a0,a0,0x21
   0x0000004000801014:  mv      a0,a0
   0x0000004000801018:  slli    a0,a0,0xb
   0x000000400080101c:  addi    a0,a0,660 # 0x294
   0x0000004000801020:  slli    a0,a0,0xb
   0x0000004000801024:  addi    a0,a0,20
   0x0000004000801028:  slli    a0,a0,0xb
   0x000000400080102c:  addi    a0,a0,1280
   0x0000004000801030:  ld      ra,0(sp)
   0x0000004000801034:  addi    sp,sp,8
   0x0000004000801038:  mv      a0,a0
   0x000000400080103c:  ret
   0x0000004000801040:  unimp
...

Of course, I can debug the library code normally, but the generated code has to be checked like this, because there’s no debug symbol associated with it and GDB is lost in there.

Important stuff. Take notes.

This free software work is also work. It needs funding!
Remember you can hire ElenQ Technology to help you with your research, development or training.
If you want to encourage my free software work you can support me on Liberapay.

The acknowledgements

It’s weird to have acknowledgments in a random blog post like this one, but I have to thank my friend Fanta for preparing me a Debian machine I can use for all this.

Also I’d like to thank Andy Wingo for the disassembly trick you just read. Yeah, there were no chances I discovered that by myself!

The code

All the process can be followed in the gitlab of the project where I added a Merge Request. Feel free to comment and propose changes.

Here’s the link.

The future

There’s still plenty of work to do. I only implemented the basics of the ALU, some configuration of the RISC-V context like the registers and all that, but I’d say the project is in the good direction.

I don’t know if I’m going to be able to spend as much as time as I want on it but I’m surely going to keep adding new instructions and eventually try to wrap my head around how are jumps implemented.

It’s going to be a lot of fun, that’s for sure.

It’s available for free in some languages and it’s 20 bucks in English. Totally worth it:
http://www.riscvbook.com/ ↩

Review of 2020

2021-05-16T00:00:00+03:00

It’s been a while since the previous post here, and it’s not because I don’t have anything to talk about. I’ve been working on many things since the previous one.

I wanted to write specifically about something I’m doing these days, but that’s difficult to contextualize if there’s a full year gap in the middle. So I decided to talk about the 2020 and make a short review about what we did so we can look forward and see what can we build from this.

2020 at ElenQ Technology

2020 have been harsh for everyone, including ElenQ Technology. We started the year with a lot of energy and we were pretty busy with courses here and there. But then the pandemic came and all the in-person training stopped so we lost our main income source, which is also one of the works I personally enjoy the most.

So, after finishing our course on Modern C++ in July (we’ll talk about that in a future post), right after we were freed from the lockdown here, everything stopped. No more in-person courses, no more clients, nothing.

We knew that the pandemic was affecting the economy so we were well aware that there were few chances to get clients in the rest of the year. Thankfully, we had some work to do: ElenQ Publishing.

We spent the summer and part of the autumn preparing the books, the printing and making the paperwork as well as the tools we needed for the website and future books. By November 13 we already had every book shipped and the website was almost ready. At the beginning of December, the website was finished and published.

It was more work than we expected but now we have a complete set of tools for future publications, that can cover any of the points of the process with almost no human interaction. We automated almost everything, and those things we didn’t automate are simple things once you know how to make them.

Of course, as engineers, we only consider automating things that we are going to repeat so you can think about all this work as a plan to keep publishing new material in the future.

It’s really interesting to mention that our whole process is reproducible as we are using Guix as a tool, so no matter what happens we could still go back in time and remake the books exactly as they were when we published them.

As you see, at a company level, most of our work of 2020 was focused on teaching and making the books (another form of teaching), because it’s something I personally enjoy a lot and I’d say it’s more fulfilling than anything else I’ve done. But it was sadly affected by the pandemic, so we need to reorganize a little bit our strategy.

Personal level

Of course, I spend time on other things too. A great part of my job is to randomly research anything I find interesting, so I can keep my mind fresh for the possible projects that may come. This gives me tools and ideas, and also lets me learn from other people.

During the year I spent some time contributing to Guix, for reasons I already discussed here. The most notable contributions were the addition of a really interesting package that was missing: Meshlab; and the correction of a package that was failing to compile for months: FreeCAD.

Being locked at home, I also had the chance to go back to electronics, which are a huge part of what I studied at university, but I never had the chance to work on that in a professional level. I even designed some PCBs, produced and soldered them with the highest level of quality possible. It was a great experience.

On the other hand, I also needed some time to relax and try to recover from some longstanding health issues I’ve been dealing with, that also deteriorated because of the pandemic.

After some time practicing yoga and taking care of my body, I feel much better in general, even if my issues are still there, at least they are not aggravated by the bad posture and the physical stress that working in a computer can provoke. So, if you are open to a suggestion: stretch, make some strength exercises and try to keep your body on shape, specially if you work in an office or any other kind of sedentary work that makes use of repetitive movements like using a mouse or typing in a keyboard.

December

As I mentioned, our work with ElenQ Publishing was done at the beginning of December. We approached that as a chance to stop and think.

During the last three years I had few chances to focus on an specific subject for a long time, I had to quickly jump from one thing to another, in order to be able to reach all the projects we had.

I was frustrated because of that. I’m easily distracted and it’s hard for me to pay attention for a while to the same thing but I really like to understand things deeply, those who know me or that attended to my courses know it, and my everyday life, full of stress and various stimulus, was making me unable to concentrate.

I had moments of attention and clearness of mind during the pandemic (and due to the pandemic) that made me feel in peace so I wanted to feel that kind of frustration-less live on purpose, not only when things come like that.

So that’s what I did. I just needed something to investigate, something I was interested since the early beginning of my career: programming languages.

I collected some books on compiler implementation and started reading them, then I realized I was interested on operating system implementation so I read about that too. Both things need to run somewhere so I also spent some time digging on various architectures and their instruction sets, and so on.

I started developing a simple Scheme implementation (only started, not finished or anything) that served as an excuse to have a goal in mind in the process. Also, I decided to live stream my research process so I could share my findings with others and let them provide me some thoughts and help me go slowly, paying attention to the interesting details.

And let me tell you compiler implementation is often a difficult subject for me, specially the theory, because my background is lacking some of the concepts that Computer Science students have but I have to study from scratch¹.

Having the chance to tackle a difficult long term task helped me forget and not worry about the bad year we had as a company, in which we only had actual paid work during the first half of the year. I was just grateful to be able to sustain myself enough time to have the chance to breathe and spend more time with myself, doing something I don’t always have the chance to do, regardless of everything we, individually and collectively, were going through.

I hope you had some moments of relief too.

What I learned

I obviously learned many things during the year (books have been read!) But I don’t want to focus on that.

Sometimes the most important thing is not the goal, but the process. You learn more from the travel than from the arrival, right?

I like to think that I learned to care more about myself in 2020. I’m still sick, and my recovery got stuck as I was literally stuck at home, but that’s just a temporary issue, because I’m taking care of myself. Maybe not everyday, but almost everyday I take care of myself. That’s what counts.

2020 taught me how to make a publishing house. That’s some important piece of knowledge, but I consider more valuable to reclaim my time and my attention. That taught me an important lesson by itself and it also served me to learn about myself.

I learned that I was feeling alone in my interests. I had no one to share my interests with. I know it is surprising to you, but basically nobody is interested on how do garbage collectors, processors or anything like that work. Most of the people don’t even care about what they are. Crazy huh?

Sharing my findings, my research and my errors with other people makes me feel better. I feel someone is there, on the other side. It helps me avoid the frustration and the lack of motivation I have been feeling during the last years.

The streaming helped with that³: I had people reacting instantly, some sent me papers to read, ideas, and others proposed me interesting things to do. That feels good. It helped me remember that I’m not alone.

If 2020 had taught me anything is that I, or we, need others to feel better. We need to take care of people⁴, because life is much better with them.

On top of many things, being conscious that I was researching deep opened the door to apply that deepness in my everyday life more often. Not that I wasn’t doing that before, those who know me are aware that I’m kind of an intense guy, but that I’m more conscious about it and I can selectively choose to go deeper about my thoughts and feelings.

This time for myself remind me how intense I was back then and how I enjoyed being a dedicated person.

So what

As I said, in a company level I decided to use that time to arrange a new strategy. I wouldn’t say I changed it that much, because I was in peace when it was developed, almost 4 years ago, but it let me rethink it taking in account my professional and personal experience in the recent years.

Collaborating on free software projects has shown me that I feel comfortable with larger codebases and more complex concepts that were too much for me in the past. Now I feel more confident about that.

Of course, this came with practice and time, but also after years of stressful work and random research that is not really fulfilling. I don’t mean that you need to spend time on that to be able to tackle bigger projects. I mean that my past is part of what I am now, and even the bad times can help forge a better future.

I decided to keep researching the way I was, because it’s something that makes me feel good, and work more slowly, but paying attention to the details as I like to do.

I’ll try to share more about my work, in a technical and a personal level. I’ll keep streaming for some time, and I’ll try to use this blog more, as I was in the past.

So, as I was saying, all this year helped me remember about important things, and forget a little bit about urgent things.

“Instead of swimming fast trying to reach as far as I could, pumping my blood, splashing water around and having to take a short breath between each arm stroke, now I want to dive. I’m far enough from the coast, already.

I want to stay in the surface until I’m ready, having some rest and breathing as much as I want, and then, I’ll dive. I’ll discover the colors of the coral reef, the sea creatures and even the deepest darkness if I feel like it. When I’m done or I’m tired, I’ll go back to the surface, take a deep breath and have some rest, feeling the sun in my face, until the next immersion.

I’m not going anywhere. I’m not in a hurry anymore.”

But hey, I’m much more comfortable with low level stuff like ISAs and all that. My degree is not useless after all. ↩
In this blog, as contrast, I can’t really know how many people reads or interacts with what I write. So I encourage you to contact me and share ideas! ↩
Making the videos also helped me to feel more confident about my English (people understand what I say!) and that is helping me tackle larger projects that involve people from different places. ↩
More now, that we have some heavy shit going on out there. ↩

Our own git server

2020-07-09T00:00:00+03:00

Having some rest these days after some really hardworking months… I decided I wanted to solve something that was on my to-do list for a long time. Really long time.

I wanted to have my own git repository for ElenQ and my personal projects (which are the same thing because I take ElenQ very personally) and so I did.

You may think: so you installed Gitea or something and you are done, right?

But the answer is no.

That would be the normal approach if I didn’t have the arbitrary constraints I imposed. This text is about those weird constraints and random thoughts I have about this, and also about what’s my current setup and how to make it. Serving the second part as a tutorial for myself if I screw up and I need to start over and also as a way to consciously think about what I did.

Context: random thoughts

For me code is not a social network and it shouldn’t be. I understand why github is the way it is but for me it’s just code. I don’t need to show how much I code, I don’t need to follow, like, star, fork, or even share my opinion about other people’s work publicly. That’s completely unrelated to the job.

Large projects like github are changing the way we collaborate. I’m not against that, but looks like we start to forget that git doesn’t need anything else to function. There’s no need for pull/merge requests, for web servers or anything.

Web interfaces for code are cool, but nothing is better than your own editor. I realized I just clone the repositories I want to dig in and search with my own tools in my local clone so… Why bother to have a powerful¹ web interface?

I don’t like to be forced to register in a platform just for sending patches or taking part in a project. Why do I need to have a github account?

ElenQ Technology currently uses a free gitlab account, but recently I’ve started to be concerned about gitlab’s business practices so I prefer to start migrating out of it. I’ve seen they always send you to login page when you hit a 404, and all that kind of weird behaviours that don’t look they have been done by accident². Of course, there’s also the fact that they are a startup and all that. I don’t really trust them. But that’s a different story. I still like the fact that their community edition is free software. It’s a business model we should do more.

Gitea and Gogs are easy to install, which is a must for me, and they are simple and useful, but replicate the same model. It’s much better to self-host your code than relying in a third party, no doubt. But that makes the login problem even harder: more gitea or gogs instances we create more separate places to register in³.

Those free software tools solve the problem of the centralization in different scales, but they are small social networks, still.

Possible replacements

I’m ok with sending an email and I’m ok with receiving emails from people.

With git’s email workflow (git sendmail, or git format-patch if you don’t want to configure your connection to your email account) you can send patches via email that can be applied in the code directly. That’s more than enough for many projects.

Issues, suggestions and questions can be sent via email with no hassle, too.

The possibility to clone the repositories via git protocol gives people the chance to check the code freely in their editor of choice, without being tracked while they browse⁴.

Problems

Issue management makes perfect sense to me and it’s a process that is cool to have in the open. It helps people take part in projects, check what’s the status of the project, collaborate more effectively, share the bugs they find and so on. But, for the kind of projects I have, issue management is more of a problem than a solution. I’ve been receiving spam in my gitlab issues for a while. To be honest, there have been more spam than real issues in my gitlab account.

There’s not any easy way to fully replace an issue management tool, though. Maybe a good use of the README.md and some extra files in the repository can help. People are still able to reach and share their bug reports via email without being publicly exposed.

That’s also a thing: if you let people interact freely on an issue board you need some moderation (which requires skills and effort). It is true that people may come to very interesting ideas if working together, but it’s also true that only happens in very popular projects⁵. Handling that privately helps to avoid misunderstandings you can’t control.

Apart from that, we have to admit only sharing repositories via git protocol has exactly 0 discoverability, so we have to share them in a website or something. Maybe not for interacting with them, but at least to show them.

Git is able to handle a website via gitweb too, but it’s simple, a little bit hard to configure and not too fast. Also, it can be more visually appealing by default.

On the owner’s side, it’s interesting to be able to decide which repositories you want to share with the public. Being able to give permissions to specific people without giving them permissions to the whole server is also nice. If the permissions can be set in specific branches of the repositories better.

Other option

Fossil-scm is really interesting. It comes with support for issues and wikis, and devnotes are a great idea I’m sure I could take advantage of.

But the tool itself is not as good as git in my opinion.

Fossil uses SQLite databases for its things (it’s developed by SQLite’s developers) which is cool sometimes but in other times is not as good as it sounds. I’m getting too used to plain text files maybe?

I tried to configure a multi-repository fossil for the server and I gave up in the past but it’s probably my fault rather than theirs.

If you are interested on trying something new, you should take a look to fossil. If you do, please, contact me and tell me your experience with it.

My solution

For the permissions I used Gitolite, which is an authorization control that makes heavy use of ssh. It uses a management repository where the administrator can add users’ public keys and each project’s permissions and metadata.

It basically creates ssh pseudo-sessions that are locked in gitolite-shell, which decides if the user has access to the repo or not. Interesting use of ssh for this. Read more in the website, they explain it much better than I can.

For the website I chose cgit, which is famous for being fast (cached by default) and reliable, and turned out to be easy to configure.

Both projects are in the order of some thousands lines of code, which is an amount I could manage to read and edit if I want to.

How to configure

Well, this is the reminder for myself, but it can be useful for you too.

I installed both of the projects using debian’s package repository.

Gitolite

By default, debian package creates a gitolite3 user so you have to take that in account if you want to make gitolite work in a debian machine (other machines will have other details to check).

Gitolite’s debian package also asks for administrator’s public ssh-key so you have to provide it sooner or later. Once that’s done you’ll get a fantastic /var/lib/gitolite3 folder with everything you need. You’ll see that folder contains a projects.list file, that lists the git repositories, a repositories folder with the repositories, a .gitolite folder and a .gitolite.rc file. The last one needs some changes in order to work correctly with cgit:

Enable cgit access to the repos

Set .gitolite.rc‘s UMASK to 0027 to give group access to new repositories, that will let other users in the group (cgit and git-daemon) access the repositories.

You probably don’t want to share the gitolite-admin repository so leave it with the permissions it came with. If you screw up here or there don’t be afraid to chmod any repository later.

You also need to make GIT_CONFIG_KEYS more permissive (.* if you are crazy enough) if you want Gitolite to be able to load git configuration. That way you’ll be able to set gitweb description in the repository that cgit can read.

Enable git unauthenticated clone

There are a couple of ways to do this. The first is to set the HTTP mode, that is something I didn’t do but you can check how to do it in the docs.

I used git-daemon for git based unauthenticated clones. It’s simple but you may need to create your own systemd service or something:

#  git.service file

[Unit]
Description=Start Git Daemon

[Service]
ExecStart=/usr/bin/git daemon  --base-path=/var/lib/gitolite3/repositories --reuseaddr /var/lib/gitolite3/repositories

Restart=always
RestartSec=500ms

StandardOutput=syslog
StandardError=syslog
SyslogIdentifier=git-daemon

User=gitdaemon
Group=gitolite3

[Install]
WantedBy=multi-user.target

Once you do that you should add it to systemd with systemctl enable /path/to/git.service or something like that. Once added you can start it.

But that’s not going to show any repository because you didn’t export any. If you want to export them, Gitolite has an specific configuration option you have to set in the gitolite-admin repo. You have to give the user daemon read access:

repo testing
    # daemon is what adds the daemon-export
    R       =   daemon
    # You should add some extra people too...

    # This is for cgit:
    config gitweb.description = For testing purpose.
    config gitweb.owner = Ekaitz

When you add the daemon access Gitolite adds a git-daemon-export-ok file to the repository that says to git-daemon the project can be shared. It won’t be possible to push to it anyway because we didn’t allow it in the git-daemon configuration.

cgit

Some cgit configuration does the rest. This is my example configuration on cgit. I’ll probably change it soon, but there it goes:

# cgit config
# see cgitrc(5) for details

css=/cgit.css
logo=/cgit.png
footer=/usr/share/cgit/footer.html

repository-sort=age

# if cgit messes up links, use a virtual-root.
# For example, cgit.example.org/ has this value:
virtual-root=/

clone-url=git://$HTTP_HOST/$CGIT_REPO_URL
# gitolite3@$HTTP_HOST:$CGIT_REPO_URL

enable-index-links=1
enable-index-owner=1
enable-git-config=1
enable-gitweb-owner=1
remove-suffix=1

# Readmes to use
# readme=README.md
# you can set more of them here like README.rst and stuff, but all of them
# require some rendering I didn't want to configure.

# Set title and description
root-title=ElenQ Technology
root-desc=Software repository for ElenQ
root-readme=/usr/share/cgit/root-readme.html

project-list=/var/lib/gitolite3/projects.list
scan-path=/var/lib/gitolite3/repositories

# Mimetypes
mimetype.gif=image/gif
mimetype.html=text/html
mimetype.jpg=image/jpeg
mimetype.jpeg=image/jpeg
mimetype.pdf=application/pdf
mimetype.png=image/png
mimetype.svg=image/svg+xml

But cgit is still unable to see the projects because it’s not part of the gitolite3 group. Make it part of the gitolite3 group with usermod or something.

Also, cgit is a web server you have to add to your stuff. I have an nginx based config so I need to add cgit to it. Cgit can work with uWSGI or fcgiwrap. I chose the latter for no real reason:

server {
    listen 80;
    listen [::]:80;
    server_name           git.elenq.tech;
    root                  /usr/share/cgit;
    try_files             $uri @cgit;

    location @cgit {
        include             fastcgi_params;
        fastcgi_param       SCRIPT_FILENAME /usr/lib/cgit/cgit.cgi;
        fastcgi_param       PATH_INFO       $uri;
        fastcgi_param       QUERY_STRING    $args;
        fastcgi_param       HTTP_HOST       $server_name;
        fastcgi_pass        unix:/run/fcgiwrap.socket;
    }
}

Also you may be interested on HTTPS support, but you know how to add that (certbot does, and it’s not hard to do).

Closing words

Now it’s live at https://git.elenq.tech. If you were wondering, cloning and pushing from there crazy fast, and the server that hosts it is the cheapest server possible. It’s much faster than github, or that’s at least my impression.

So yeah… That’s most of it.

I just wanted to share some thoughts about software development workflow and find an excuse to write down my configuration since I had issues to find any explanation that had all the points I needed together.

And I think I did, didn’t I?

Stay safe.

There’s no power for free. Powerful also means resource-intensive. ↩
cHeCKiNg YOur brOWsER bEfOrE AcCeSsIng gitLAB.cOm. ↩
Maybe not. Forgefed is doing a good job: https://forgefed.peers.community/ ↩
It also depends on the editor you choose. Choose wisely. ↩
If a project I make reaches that kind of popularity, I’ll open a tool for that kind of discussion or maintain a mirror somewhere else. ↩

ElenQ Donations — Chibi Scheme

2020-05-31T00:00:00+03:00

In a previous post I already talked about why I consider important to donate money or time to Free Software projects.

This time I want to talk about my recent contributions to Chibi Scheme’s standard library.

Chibi Scheme is a R7RS scheme implementation that was designed to work as an extension or scripting language. It’s just a small library you can embed. That’s similar to Lua, but with a lot of parenthesis¹.

For those that are not familiar with Scheme: it’s just a programming language you should read about. You’ll probably discover all those new cool things you have in your programming language of choice are not that new².

There’s a detail I’d like to talk about, though. Contrary to other programming language definitions or standards, Scheme’s R7RS report is something you can read yourself. It’s less than 100 pages long³.

If you combine that with the design decisions that Alex Shinn (who also took part on the R7RS definition) took on Chibi Scheme, you end up with a simple programming language you can actually read.

That’s important.

You might wonder why should you care about the readability of a programming language if you are just a user. The answer is simple too: free software relies in the fact that you can audit and improve or customize it. If you are not able to read it you can’t exercise your rights by yourself and you’ll always need to rely in someone else. That’s not intrinsically bad, that’s the only solution that non-programmer users have. Programmers need to trust other people in other things as well so that’s not a major issue.

Problems come when projects get so complicated —and I mean millions-of-lines-of-code complicated here— only large companies have enough resources to tackle the task of editing the code. In those cases, software is not really free anymore, because in practice you are unable to use your rights and you can’t afford to find someone else to do it for you.

We started to get used to that, though.

Something I learned as a sculptor is the tools that fit you better are those that you made, you hacked or you get used to. As programmers, we are supposed to know how to program, so we are supposed to be able to make and hack the tools but we are deciding to get used to the ones that others built.

The first step to control your workspace, and consequently your own job is to control your tools⁴.

I’d love to say those are the reasons why I use Chibi Scheme, but that’s not totally true. I don’t know why I use it. I just like it.

Anyway, the other day I realized Chibi Scheme’s JSON parser was unable to parse Spanish accents so I was unable to control ElenQ Publishing’s book’s metadata correctly. That’s a problem.

As the language is simple, I was able to read the standard library and propose a change that would let the JSON parser use UTF-8 characters.

https://github.com/ashinn/chibi-scheme/pull/643

During the process I checked CPython’s JSON parser implementation and I realized I could do it better adding surrogate pair support. So I decided to add it too.

https://github.com/ashinn/chibi-scheme/pull/644

Once my changes were merged, I realized it was a good idea to keep going and add a JSON encoder, that wasn’t developed yet. So I did.

https://github.com/ashinn/chibi-scheme/pull/648

While I was testing my JSON encoder I realized there was an issue with floating point numbers in the JSON parser. So I fixed that too.

https://github.com/ashinn/chibi-scheme/pull/647

I also fixed some random indentation issue I found:

https://github.com/ashinn/chibi-scheme/pull/646

I didn’t really need to do all what I did, but I did it anyway. I just wanted to keep Chibi Scheme healthy while I was opening the door to some future contributions. Now I have a little bit more control on my tooling, and I feel more comfortable with the fact that I might need to make some changes on Chibi’s code in the future.

It doesn’t need to be perfect, neither. I’m sure it isn’t, because I didn’t write C code since I was at university and I had zero experience working on Chibi-Scheme’s internals. My code was just enough to make the features happen, now with Alex’s changes the code is running fine and everyone can benefit from this.

So, the message I take from this can be summarized in these points:

Use tools you can read and edit like Chibi Scheme or even CPython, which is a large codebase but is surprisingly easy to read.
Programming languages (or their stdlib) should never be considered something untouchable. Touch them, change them, make them fit your needs.
Don’t be afraid of tackling something that may seem hard on the first look.
You don’t have to be perfect.
Spend time and energy on stuff that matters.

Hope all this —the post and the code— is useful.

Being useful is the greatest goal of an engineer, after all.

Take care.

And no end-s. Less typing for sure: good for your hands. ↩
And maybe not that cool neither. ↩
I’m not going to talk about the implications of that fact. It’s obvious there must be some kind of trade-off comparing to other standards that are more than one thousand pages long. I’ll just recommend you to read it, it’s pretty good: https://small.r7rs.org/attachment/r7rs.pdf ↩
In fact, that’s the second. I’m supposing we all know what we are doing. ↩

ElenQ Donations — Intro + GNU Guix

2020-05-25T00:00:00+03:00

I consider my work part of my responsibility to make this world a better place so since the early beginning of the company I decided to donate as much as I could to the free software projects I was using for my work in order to help the ecosystem be sustainable.

Many times, free software projects that are being extensively used by companies are considered just free products that don’t carry any kind of responsibility with them. It is fine to use free software for your own goals (that’s the freedom 0), but it’s not morally acceptable to base your business model on a project that independent developers made with no funds (or with very low funds) and don’t even consider helping them.

We already had cases of free software developers that are keeping the projects running with their own expenses while the whole ~~fucking~~ world is just using the software they make without thinking about their conditions.

ElenQ Technology has been founded by a not-very common individual. That’s obvious.

Sadly, sometimes ElenQ Technology simply can’t afford to donate a part of our income¹. But I can code.

I can always share my time between projects and my free time on trying to support free software projects that make my our life easier.

GNU Guix

GNU Guix is one of those projects. I started using it a couple of months ago as a package manager and now I moved to the full software distribution.

For those who don’t know Guix yet, it’s a package manager and a software distribution like Nix and NixOS are. They are based on the same principle and have the same core.

The innovation they carry is the transactional package manager that eases rollbacks and isolated environment creation. In the case of the software distribution, the whole system can be described by an easy-to-write file that is also version controlled, so you can always recover an older configuration if you need to.

All the packages and system descriptions are defined in code. In Nix, they are defined with Nix programming language (a DSL for that purpose). In Guix, they use Guile (scheme) programming language (a general-purpose programming language).

As my work at ElenQ forces me to visit many different codebases and use a wide variety of software in short term projects, Guix is very handy for me. I can create a new isolated environment, code on it and, once I’m done, remove it from my system in the cleanest way.

Also, package definition is easy and straight to the point, so I can package anything I want just coding few lines.

It’s an interesting project for system administrators too. Machines are easy to replicate with it and it’s easy to go back if you screwed up in the configuration.

Further than that, they are working really hard on reproducible builds and the chain of trust that modern software needs.

You should check the project yourselves better for more detailed info.

So what?

Since I use the project I’m started to take part on it, packaging new code and sending simple patches. More I get involved on it more I will do. I’m not really used to Guix yet, so I didn’t dig the code deeply enough and I’m not able to code very complex stuff on it.

At the moment, I’m trying to package Meshlab, a 3D mesh editing software you’ve probably heard about.

For that I packaged (already merged) openctm, lib3ds and openkinect (in its three flavors: C/C++ lib, python bindings and open-cv bindings). And during that time I also found a couple of details I could improve and I made some patches for them too.

In the past I also contributed with few package patches, on chicken and chibi scheme implementations and kitty terminal emulator. You can find all of them searching my name in the issue board you’ll find in the following link:

https://issues.guix.gnu.org

GNU Guix is not a very big project and it doesn’t have a large userbase that can help them grow fast and reliably so it needs some extra help, from me and surely from you. They have been a very welcoming community so I encourage you to take part if you are interested on it.

I hope this helps to spark your interest on helping on any project you like and maybe pressure your company to spend some resources on helping any project they use.

Stay safe.

You can change that hiring us. ↩

ElenQ Publishing

2020-02-18T00:00:00+02:00

Hi all,

This 18th of February of 2020 the crowdfunding campaign of ElenQ Publishing started and I’d like to talk a bit about it.

http://en.goteo.org/project/elenq-publishing

The platform

First of all, I want to talk you about the crowdfunding platform: http://goteo.org.

Goteo is platform for social crowdfunding that aims to support projects with a social goal. The software it runs is Free Software licensed under the Affero GPL license, meaning if you want to make your own crowdfunding platform you can use the code that Goteo shares, as long as you provide the ~~users~~ people the source code that is running in your platform (server and client).

Goteo Foundation, the maintainers of the code and the people behind goteo.org, fund themselves with the 5% of the crowdfunded money from the campaigns, a fair price for their services. They also receive some help from different government entities like Barcelona’s local government or Spanish Education, Culture and Sports Ministry because of their social impact.

At least in Spain, people that takes part in the campaigns run in goteo.org have the chance to declare they donated money for social goals and get some money back in their tax returns.

For those who want to make a campaign, Goteo reviews and gives feedback, they make the campaign management easier and they are really focused on being multilingual. They support translations for the campaigns and the UI is in many languages, including local all Spanish regional languages and some extra languages more.

This platform is perfectly aligned with the philosophy of ElenQ Technology.

The story

I don’t talk about it enough, but I’ve been teaching informatics related topics since I started with ElenQ Technology. In these 3 years I gave many courses: introductory python, advanced python, data analysis, web scraping, bitcoin and blockhain¹, introductory clojure… And some more I can’t remember at the moment. All of those were done in different contexts, from courses for young unemployed people to courses for engineers in research centers. Also, it looks I’m going to keep teaching, because I like it and the students say I’m good at it.

But this is not good enough. It helps me to make a living but it’s not enough. I want to make my best to correct many of the issues I found in this 3 year travel.

I realized I have some tested course structure and materials I want to share.

I realized many people’s English level is not good enough for learning by technology by themselves. They needed someone like me to serve as a bridge. They are isolated from knowledge because the place they come from and the culture they have.

I realized all the technical publications I was reading were written from the same perspective of technology. That made sense, because every author authors came from the same context. I would like to have more diverse people writing about technology and the only way to do it is to make technology more accessible.

I realized that, in my local context, access to knowledge is broken in many ways that looks nobody is willing to change:

In my area, government backed courses only focus in groups of people that are likely to get a job soon anyway. This way they can say they got the job thanks to the money the government invested on the courses and win elections with that. Young people that finished university this year are likely to get a job in the next year. This doesn’t mean they don’t need the course², but the course itself won’t really affect their employability. What about people with real employability problems³?
Some people have individual problems that don’t fit in the goals of social campaigns run by the government or other entities because they focus on large groups of society with similar problems and they don’t focus the individual. It makes sense because they can help more efficiently that way, but the net has some holes we should repair.
There is no structural support for people who just want to learn new things with no further intention. University is deriving. Now it’s just a place where you get a paper that helps you get a job, but it’s not fulfilling the goals of knowledge it should. It’s not a place where you find knowledge anymore. Many people don’t want to think or learn, but some do and we are preventing them from doing it.
In other places the problem of education is even worse and they don’t have resources (or will⁴) to solve it. Individuals shouldn’t suffer from that. It’s our responsibility to help everyone have all the chances to develop themselves as much as they want, regardless of their context.

In general, all the points are summarized in one: Knowledge should be free (libre). If it’s not free it’s not knowledge, is just something that makes you more powerful than others: it’s injustice.

I realized many of these things could be solved with a good repository of knowledge in different languages, and, as I’m teaching stuff and I like to write, I considered interesting to work a little bit more on the notes I give my students and make them look like a book.

With a little bit of effort, I can make a book that can be published on the web, on a physical book and on a easy-to-print PDF that anyone can print and copy in a local print shop. I can make it arrive any place in this world.

Not only that. Some people designed a license that lets others create new contents on top of what I did and force them to share what they did with the same license: Creative Commons.

So, with some effort and some funding (and a smile in my face) I can create a publishing project where I gather all the knowledge that my job makes me deal with and I can share it in a way that is ethical and respects everyone’s circumstances.

This is something I want to try. It’s something I need to try.

The campaign

That’s why I’m trying it.

This campaign is the first attempt to make this happen. If it’s successful, it will make me spend some of my time giving love to the contents I want to release and let me talk with people who have knowledge in areas I don’t and help them publish it.

The campaign has physical books as a goal but they are just a vehicle to be able to publish them in a way that is easy to share. The physical objects are just a way to get funds.

The main goal is not just to make books: it’s the creation of an infrastructure to share knowledge that I can use for the things I research but it can be used also for things other people researches. All the content is going to be published in raw in a repository that anyone could audit, review, improve or create a new project based on it.

Once the infrastructure is ready, publishing new books should be a piece of cake. This first project is going to teach us how to make the paperwork for the ISBN and the book registration and is going to give us time to create the website where the content is going to be stored. Once all those points are ready, the rest of it is “just” write and publish.

The campaign’s goals are separated in two levels: the minimum and the optimum.

The minimum is the publishing of the books in Spanish, my mother tongue language, and covers all the infrastructure costs of that. This way Spanish speaking people will have at least some technical books in their language.

The optimum ensures the publishing in English⁵. This goal enables the translation to different languages because many people would probably know or Spanish or English, because they are two of the most widely spoken languages in the world, and that way they would be able to translate the books to their mother tongue language to help their own community. I’m not able to supervise a translation to a language I don’t know, but I’m able to make a reliable material in both of these languages for people to work on top of.

As you see, the goals have a really interesting point of contradiction: I want to provide technical material for people that doesn’t talk English, but I’m trying to make it in English for it. Funny, huh?

I’m just trying to be as practical as I can.

There are many points I’d like to consider, many other translations I’d love to do, but I need to focus my effort on something useful in the short term because if it happens to be useful it’s going to push me to keep working on it in the future, providing more and more material and translations.

The feelings

I like the idea of crowdfundings since I heard about them and I’ve been planning to make one for years. Electronic devices, software, miniature games and collections… Everything was candidate for a crowdfunding campaign in my mind, but I didn’t want to disappoint the patrons and I never started one.

This time I think it’s possible to provide good material. The content is already defined and tested in my courses, it only needs to be updated, the goals are simple and doable, and I have all the good people around me that is going to help me with everything.

This project helped me connect with the people I love and that’s one of the best things in life. I know everything is going to be fine with their help and support.

Since I started with ElenQ Technology I had the chance to meet many good people with incredible skills and love in their heart. This project is somehow the result of the love they gave me because it filled me with the courage I needed.

Just wanted to express my gratitude⁶.

Thank you all.

If you want to take part…

There are many ways you can help but crowdfunding campaigns look like the only one is giving money and that’s not true. Also, there are many ways to give money and some are more effective than others.

For people that is not able to provide funds but want to take part there are also very helpful tasks:

Sharing the campaign with people, communities, universities, libraries and so on, that might be interested is always good.
Once the contents are released in our repository, reviewing the content or improving it will help us a lot.
Translating the content to other languages will help other communities we can’t directly help. Your language skills are valuable.
Your love and support is always welcome and it helps us to keep on the good job.

For the people that want to provide monetary help, there are points to consider:

The best way to help is to give money and don’t ask for any physical good (the first reward is for that) because the books have an associated printing and shipping cost. They are going to be released online anyway so if you don’t really like the idea of having a physical object, you can also take part and get the result of the campaign.
The second best way is to give money and ask for the physical good(s). In the case of this campaign, more books you order the cheaper their production is (bulk orders and scale economy, you know…).
One of the highest costs is the shipping, asking us to get the goods in person⁷ (Bilbao) or making bulk shippings reduces the costs and makes the donation more efficient. It’s better for us if a group of friends make just one big order than having many small orders.

That said, here goes the link to the campaign if you want to take part.

http://en.goteo.org/project/elenq-publishing

Thank you.

don’t judge me too fast: the course was a technical explanation about every detail of how does bitcoin work. I wanted students to learn the cryptography behind and all the good design ideas bitcoin has while I tried to make them be critical about the blockchain technology during the blockchain boom. ↩
They need it, more if its a course like mine where I talk them about working with ethics and being independent. :D ↩
Say: women, unemployed people in their 50s dropped out of their jobs because of the de-industrialization, immigrants, people with disabilities, people that just get out of jail… ↩
USA, I’m looking at you. ↩
Don’t worry, I know my English is bad and I’m not going to translate them, a professional service will (under my supervision for the terminology and stuff). ↩
I don’t know why… I’m like that I guess. ↩
If you get the books in person I’ll take a coffee/tea with you. ↩

ElenQ Publishing

2020-02-18T00:00:00+02:00

Saludos,

Hoy 18 de febrero de 2020 ha empezado la campaña de ElenQ Publishing y me gustaría hablar un poco sobre ello.

http://goteo.org/project/elenq-publishing

La plataforma

En primer lugar, me gustaría destacar la plataforma en la que se ha publicado:

http://goteo.org

Goteo es una plataforma para campañas de mecenazgo con un fondo social. Funciona sobre código libre publicado con licencia Affero GPL, que permite la reutilización y la extensión del mismo siempre y cuando el código de la plataforma (tanto servidor como cliente) esté disponible para ~~los usuarios~~ las personas que la usen.

La Fundación Goteo se encarga de mantener el código de goteo y de gestionar goteo.org. La fundación se financia con el 5% del dinero obtenido por las campañas, una cifra bastante justa por sus servicios, y por el apoyo que reciben de entidades públicas como el Ayuntamiento de Barcelona y el Ministerio de Educación, Cultura y Deporte de España.

Además, al menos en España y no sé si en otros países, las personas que participen aportando dinero en campañas de Goteo pueden declararlo en su Declaración de la Renta y desgravar por donación a fines sociales.

Para los que quieran hacer una campaña, la Fundación Goteo revisa el contenido y aporta recomendaciones e ideas. Además, la plataforma está pensada para aceptar varios idiomas y la interfaz está traducida a todos los idiomas regionales de España y algunos otros más.

En general, encaja muy bien con la perspectiva ética de ElenQ Technology.

La historia

No hablo mucho sobre ello, quizás menos de lo que debería, pero he estado dedicándome a dar cursos relacionados con la informática durante estos primeros 3 años de ElenQ Technology. He dado cursos de diversos temas: introducción a python, python avanzado, bitcoin y blockchain¹, clojure… Y algunos más que no recuerdo. He trabajado en muchos contextos distintos desde cursos para jóvenes en desempleo a cursos para ingenieros en centros de investigación. Parece, además, que voy a seguir haciéndolo, porque es un trabajo que disfruto y los alumnos suelen decirme que se me da bien.

Por mucho que me ayude a ganarme el pan, creo que esto no es suficiente. En mi día a día veo muchos problemas que me gustaría resolver.

Me dí cuenta que tengo materiales y cursos ya probados que quiero compartir.

Me dí cuenta que la gente que no estudia estos temas por su cuenta no lo hace por vagancia o porque no sean suficientemente inteligentes. Muchos de ellos no lo hacen porque tienen problemas con el inglés y necesitan a alguien como yo que les sirva de puente. Están aislados por el lugar en el que nacieron o por la cultura que tienen. Esto es inadmisible.

Me dí cuenta que las publicaciones técnicas que leía estaban escritas, en general, desde la misma perspectiva. Tiene sentido, porque la mayor parte de los autores parten del mismo contexto. Me gustaría tener unas publicaciones técnicas más diversas y la única forma de conseguir esto es hacer que la tecnología sea más accesible.

Me dí cuenta de que en mi contexto local el sistema educativo tiene problemas evidentes que parece que no hay ningún interés en resolver:

En mi provincia, los cursos subvencionados sólo se centran en colectivos que es probable que consigan un trabajo en el corto plazo. De este modo, los políticos responsables pueden decir que consiguieron el trabajo gracias a ellos y seguir ganando elecciones. Los jóvenes que acaban de terminar la universidad conseguirán un trabajo en el corto plazo independientemente de que realicen o no realicen cursos subvencionados. Esto no significa que no los necesiten², significa que no afectará a su empleabilidad. ¿Qué pasa con quienes tienen verdaderos problemas para conseguir empleo³?
Algunas personas tienen problemas individuales que no encajan en los objetivos de las campañas sociales de gobiernos y otras entidades porque se fijan en grandes grupos de personas con problemas similares y no en personas individuales. Tiene sentido que lo hagan, porque de este modo su ayuda es más eficiente, pero esta red de soporte tiene orificios que deberíamos resolver.
No hay soporte estructural para personas que simplemente quieren aprender por el placer de hacerlo y no con un fin laboral. La universidad está derivando. Hoy en día no es mucho más que un lugar que te da un papel con el que luego es más fácil conseguir un trabajo, pero no está satisfaciendo las necesidades intelectuales de la sociedad como debería. Ya no es un lugar donde encontrar conocimiento. Muchas personas no quieren aprender ni pensar, pero otras sí que quieren y les estamos impidiendo hacerlo.
En otros lugares el problema de la educación es incluso peor y no tienen recursos (o voluntad⁴) para solucionarlo. Las personas no deberían sufrir las consecuencias de un sistema que no funciona, sea por la razón que sea. Es nuestra responsabilidad ayudar a todo el mundo a tener oportunidades para desarrollarse tanto como quiera independientemente de su contexto.

En general, todos estos puntos vienen a resumirse en uno: El conocimiento tiene que ser libre. Si no es libre no se puede considerar conocimiento, es sólo algo que me hace más fuerte que los demás: es injusticia.

Me dí cuenta que todas estas cosas pueden solventarse (o tratar de solventarse) con un buen repositorio de conocimiento en varios lenguajes y, ya que doy clases y me gusta escribir, considero interesante dedicar algo más de tiempo a los apuntes que entrego a mis alumnos y darles forma de libro.

Con un poco de esfuerzo, puedo crear un libro que se puede publicar en la web, en un libro físico y un PDF fácil de imprimir y de copiar en una copistería cercana. Puedo hacer que esto llegue a cualquier lugar del mundo.

No sólo eso. Alguien se ha tomado la molestia de crear una licencia que permite que los contenidos que publique sean editados y mejorados siempre que el producto resultante tenga la misma condición: Creative Commons.

Por tanto, con un poco de esfuerzo y un poco de presupuesto (y una sonrisa) puedo crear un proyecto de publicación que albergue el conocimiento que a diario me encuentro gracias a mi trabajo y poder así compartirlo de forma ética y accesible, respetando las circunstancias de las personas que quieran consumirlo.

Esto es algo que, evidentemente, quiero intentar. Es algo que tengo que intentar.

La campaña

Es por eso que lo estoy intentando.

Esta campaña es el primer intento para hacer de esto una realidad. Si tiene éxito, me permitirá dedicar un poco de tiempo a dar amor a los contenidos que quiero publicar y me permitirá publicar el conocimiento de otros.

El objetivo de publicar libros físicos es sólo un vehículo para poder publicarlos de modo que sea fácil de compartir. Los objetos físicos son sólo una forma de conseguir los fondos.

La idea principal no es hacer unos libros, es crear una infraestructura para compartir contenido que me permita compartir lo que investigo y pueda servir de plataforma para que otros compartan lo que ellos investigan. Todo el contenido será publicado en un crudo en un repositorio que cualquiera pueda auditar, revisar, mejorar o crear proyectos derivados desde éste.

Una vez que la infraestructura esté disponible, publicar nuevos contenidos será extremadamente sencillo. El primer proyecto nos enseñará a tratar con el papeleo necesario para conseguir los ISBN, el registro del libro, etc. y para crear las herramientas necesarias para publicar (una web…). Una vez resueltos estos puntos, el resto es “sólo” escribir y publicar.

Los objetivos de la campaña están separados en dos niveles: el mínimo y el óptimo.

El mínimo trata de publicar los libros en español, el idioma en el que pienso, y cubre los gastos de infraestructura para estos. De este modo, todas las personas de habla hispana podrán tener libros técnicos en su idioma.

El objetivo óptimo asegura la publicación de los libros en inglés⁵ con el fin de habilitar la traducción a otros idiomas. Muchas personas son capaces de hablar, además de su propio idioma, inglés o español, ya que son dos de los idiomas más hablados del mundo. De este modo, la probabilidad de que alguien pueda tomar nuestras publicaciones y traducirlas a otros idiomas aumenta de forma radical, facilitando así que ayuden a sus comunidades de una forma en la que nosotros no estamos capacitados. Puedo hacer lo posible para ayudar en traducciones a los idiomas que conozco, pero no tengo más alcance que ése. Aportar una buena base en un idioma común permite que estas traducciones espontáneas surjan de forma independiente.

Me encantan estas contradicciones: quiero aportar material para acabar con la hegemonía del inglés y lo publico en inglés. Es gracioso. ¿Verdad?

Sólo intento ser lo más práctico posible y elegir qué batallas puedo librar.

Hay muchas cosas que me gustaría revisar, muchas traducciones que me gustaría hacer, etc. pero necesito fijarme en lo que puedo aportar a corto plazo y, si resulta útil, crecer desde ahí, ya que me dará fuerzas para seguir trabajando en el futuro.

Los sentimientos

La idea de los crowdfunding lleva años en mi cabeza. Llevo años tratando de hacer alguno: de electrónica, software, juegos y colecciones de miniaturas… Todas mis aficiones eran candidatas a ser un crowdfunding, pero nunca me animé a hacerlos porque no quería decepcionar a los mecenas. Me daba vértigo.

En esta ocasión creo que puedo aportar buen material. El contenido está probado en mis cursos, sólo necesita actualizarse, los objetivos son simples y realizables y estoy rodeado de buenas personas que me ayudan con todo.

Este proyecto me ha ayudado a conectar con las personas que quiero y eso es algo maravilloso. Sé que todo va a salir bien con su ayuda y su apoyo.

Desde que empecé ElenQ Technology, he tenido oportunidad de crear vínculos con gente maravillosa a la que tengo mucho respeto. Este proyecto es, de alguna manera, el resultado del amor que me dan, porque me ha hecho superar el miedo al fracaso. Con su ayuda, creo que puedo conseguir todo lo que me proponga.

Sólo quería expresar mi gratitud⁶.

Gracias.

Si quieres colaborar…

Hay muchas formas de colaborar pero la realidad es que los crowdfunding dan la impresión de que la única es la monetaria. Eso no es cierto. Además, algunas formas de aportar fondos son más efectivas que otras.

Para las personas que quieran ayudar sin hacer aportaciones monetarias hay unas tareas que serían muy útiles:

Compartir la campaña con personas, comunidades, universidades y librerías que puedan estar interesadas siempre es útil. Yo prefiero compartir con quien pueda tener interés más que insistir de forma aleatoria en las redes sociales.
Una vez los contenidos estén disponibles en el repositorio, revisarlos y mejorarlos nos ayuda mucho. La tarea de revisar libros es tediosa y aburrida y cualquier ayuda que podamos tener ahí será bienvenida.
Traducir el contenido a otros idiomas es importante. Nosotros a título personal no podemos ayudar en esa tarea, si lo haces, ayudas a tu comunidad de forma directa.
Tu amor y tu apoyo es siempre bienvenido y nos ayuda a tener fuerzas en los momentos donde el trabajo se acumula.

Para los que quieran y puedan permitirse una ayuda monetaria, hay algunos detalles a considerar:

La mejor manera de ayudar es no pedir el producto físico (para eso está la primera recompensa). De este modo, ayudas sin crear ningún tipo de gasto material y de igual modo podrás acceder al contenido una vez se publique. Evidentemente, los libros físicos tienen algo de romanticismo y por eso se ofertan.
La segunda mejor manera es pedir los productos físicos. En el caso de esta campaña, cuantos más libros produzcamos menos coste tendrá cada unidad (economía de escala).
Uno de los mayores gastos de la campaña es el envío. Recoger los libros en mano⁷ (Bilbao) o agrupar pedidos reduce el coste del envío y hace que la donación sea más eficiente. Es mejor para nosotros (y para ti, debido a los descuentos) si se forma un grupo de personas y hacen pedidos conjuntos.

Dicho esto, aquí va el link de la campaña por si te apetece participar:

http://goteo.org/project/elenq-publishing

Muchas gracias.

No me juzguéis demasiado rápido: el curso estaba enfocado desde una perspectiva técnica que trataba de fomentar el pensamiento crítico en los tiempos del boom del blockchain. ↩
Los necesita, sobre todo si son cursos como los que yo hago en los que hablo de ética y de cómo crear tecnología independiente :D ↩
Hablo de las mujeres, de los desempleados de más de 50 años que fueron expulsados de sus puestos de trabajo por la desindustrialización, las personas con discapacidad, inmigrantes, las que acaban de salir de la cárcel… ↩
Un saludo para los Estados Unidos de Norteamérica. ↩
Tranquilidad, no seré yo el traductor. Soy consciente de que mi nivel de inglés no es suficiente para una tarea así. Sí que trataré de supervisar que la terminología, etc. es la correcta, pero no llego a mucho más. ↩
Soy así, yo qué sé. ↩
Si te acercas a la ciudad para recibirlo en mano te invito a un café o un té y charlamos un rato si te apetece. ↩

Dark or Light: choose one

2020-01-15T00:00:00+02:00

Since this afternoon this blog has a way to change between dark and light themes.

I made this because my eyes hurt when I visit really light websites from a window with a dark background. My desktop environment is configured to show everything with a dark background and I spend most of my time on the terminal so my eyes get used to the dark background and the light ones hurt, specially at night.

I realized one of the sites that made my eyes hurt was my own website and I can’t fix the whole web, but at least I can fix my site and write down what I did to encourage you to fix yours.

User preference

First things first, since 2019 CSS has a new mediaquery that lets you know if the visitor has a dark or a light background configured in their system. I was introduced to this thanks to @sheenobu, who took the time to answer to my message and make all this happen¹.

Here you have some documentation about that mediaquery called prefers-color-scheme, but in summary it can take three values —light, dark and no-preference— that are quite self-explanatory in my opinion.

So if you mix that with a little bit of CSS custom properties magic (AKA variables) you can just parametrize the whole color scheme and then use the mediaquery to choose the variables you want to use. That’s fine.

Locked in your preference

The problem comes when you want to be able to let the user change from one color scheme to other.

The prefers-color-scheme mediaquery gets user’s preference but, at least in Firefox², it’s not easy for the user to change to a light theme if they want. They are locked in what they chose for their OS.

Sometimes it’s interesting to let the readers change the theme by themselves for multiple reasons. As each developer or designer chooses the colors they want, that may led to color scheme inconsistencies between the system colors and the sites or have readability issues. Also, readability is subject to personal preference: I like to use dark backgrounds, but sometimes I prefer to read from a lighter background if the ambient light is stronger.

Letting visitors contradict themselves

In order to let the visitors go against that blood pact they signed with their OS, we need some JavaScript³.

Note about my personal taste: I avoid the use of JavaScript in places that is not needed. I consider it unnecessary for blogs or websites that show you information in a format supported by the web (text, audio, video…). Also, I consider really important to think about the users who don’t want to or can’t run JavaScript.

Most of my sites don’t use JavaScript at all, modern CSS and HTML are more than enough for most of the applications. Webpages with a heavy use of JavaScript are a threat to accessibility and make bots, spiders and non-canonical browsers hard to implement⁴.
This blog makes use of JavaScript for two different things:

The theme change I’m talking about in this post

Source code highlighting

In both cases the blog is prepared to work perfectly for users with JavaScript disabled. When the JavaScript is disabled, code blocks respect the HTML tags for code declaration but they have no any extra tags or style. In the case of the theme control, when JavaScript is disabled, the website makes use of the user’s default preference leaving the option to change the theme in hands of the browser or the operating system. Most of the time, these design decisions work in favour of users that access the web from browsers that don’t need any kind of styling (terminal browsers, screen readers…) helping the browser find the content more easily.

When I was going to start implementing it I remembered a Medium post by Marcin Wichary that explains the process very well. I used as a reference but I added a couple of points I want to share with you. I’ll also try to cover everything the author talks about with my own words, just in case someone doesn’t want to access Medium⁵.

First difference from the reference post is what I told you about in the previous section. The post is from 2018, and the prefers-color-scheme mediaquery is from 2019, so it’s not mentioned in the post⁶.

The mentioned post has also an introduction to CSS Custom Properties and their use. I already gave you a link to the MDN Web documentation and I don’t feel myself informed enough to try to explain you anything about CSS, so better go there and read.

That said, the first point we have to solve is to have some property that makes CSS know which theme is in use. That can be implemented like the article does, adding a data-something attribute to the html that then can be captured in CSS like this:

html[data-theme='dark'] {
    /*Your dark theme style here*/
}
html[data-theme='light'] {
    /*Your light theme style here*/
}

WARNING: Be careful with the priority of this change, you have to put it after the prefers-color-scheme mediaquery to make the cascade work as it should. If you put it before, the mediaquery is going to override this configuration and will make it pass unnoticed.

But now you have to deal with the attribute and make it change whenever the visitor selects one or other configuration. As I said, you need JavaScript for that. Setting the attribute is as simple as this vanilla JavaScript line:

document.documentElement.setAttribute('data-theme', color);

Good. Now it’s quite easy to start, right? Add a button, put an event listener on it and whenever it’s pressed change the theme by setting the attribute with the line I just show you. For instance:

var theme_switch = document.getElementById('dark-light-switch');

function change(color){
    document.documentElement.setAttribute('data-theme', color);
    theme_switch.setAttribute('color', color);
}
function theme_change_requested(){
    color = theme_switch.getAttribute('color');
    if(color=='light')
        change('dark');
    else
        change('light');
}
theme_switch.addEventListener('click', theme_change_requested);

We selected an element that will act as a theme switcher and added an event listener to it. Whenever it’s clicked it will run the theme_change_requested function that will change the color from the current to the other. Easy.

Problems come now.

Get the initial color

In order to start that process, you have to be able to know the current theme in use, that way you’d be able to activate the necessary attribute for the html tag or the current look of the theme switcher (in this blog a sun or a moon).

This current theme inspection results to be difficult because JavaScript doesn’t have access to the prefers-color-scheme mediaquery. You can bypass that by getting something you know is going to be present in your CSS and reading it. In my case I used the background-color of the body because I set the background to white in the light color scheme as you can see in the getCurrentColor function:

var theme_switch = document.getElementById('dark-light-switch');

function change(color){
    document.documentElement.setAttribute('data-theme', color);
    theme_switch.setAttribute('color', color);
}
function theme_change_requested(){
    color = theme_switch.getAttribute('color');
    if(color=='light')
        change('dark');
    else
        change('light');
}
function getCurrentColor(){
    // This is dependant of the CSS, be careful
    var body = document.getElementsByTagName('BODY')[0];
    var background = getComputedStyle(body).getPropertyValue('background-color');
    if(background == 'rgb(255, 255, 255)') {
        return 'light';
    } else {
        return 'dark';
    }
}
function init( color ){
  change(color);
  theme_switch.setAttribute('color', color);
}
init( getCurrentColor() )
theme_switch.addEventListener('click', theme_change_requested);

Now, with this new code you are able to get the current theme when the page loads and prepare your button and your html tag to start with the color scheme the visitor has configured by default.

Page-change amnesia

Once you have all we explained working you’ll realize the website forgets visitor’s decision when they navigate form one page to another. It makes perfect sense, because there’s no way to keep the selection set.

We can make use of localStorage for this. With the following line we can set the 'color' item in the localStorage to the color visitor chose:

localStorage.setItem('color', color);

Updating the getCurrentColor function, we can get the color from the localStorage first, and, if it’s not set, we can use the strategy we used before with body‘s background-color. This is the updated getCurrentColor function:

function getCurrentColor(){
    // Color was set before in localStorage
    var storage_color = localStorage.getItem('color');
    if(storage_color !== null){
        return storage_color;
    }

    // If local storage is not set check the background of the page
    // This is dependant of the CSS, be careful
    var background = getComputedStyle(body).getPropertyValue('background-color');
    if(background == 'rgb(255, 255, 255)') {
        return 'light';
    } else {
        return 'dark';
    }
}

With this function we can know what color has the user configured or the color they chose in our color selector button, but still have to activate the theme if the user has chosen one that is not the one on their preferences. Updating the init and change functions this way is more than enough for that:

function init( color ){
    change(color, true);
    localStorage.setItem('color', color); // CHANGED!
    theme_switch.setAttribute('color', color);
}
function change(color, nowait){
    document.documentElement.setAttribute('data-theme', color);
    theme_switch.setAttribute('color', color);
    localStorage.setItem('color', color); // CHANGED!
}

Smooth transitions

In the article I had as a reference the author made a simple but very effective approach for theme transitions. The article uses the following CSS for smooth transitions:

html.color-theme-in-transition,
html.color-theme-in-transition *,
html.color-theme-in-transition *:before,
html.color-theme-in-transition *:after {
  transition: all 750ms !important;
  transition-delay: 0 !important;
}

The article also explains how to activate the transition, the following piece of JavaScript code activates the transition and deactivates it one second later:

window.setTimeout(function() {
  document.documentElement.classList.remove('color-theme-in-transition')
}, 1000)
document.documentElement.classList.add('color-theme-in-transition');

We have to be careful with where do we add this because we may be forcing transitions in the navigation and that’s really annoying. Updating the change function with the transition is not enough, we need a way to discard the transition for the changes produced by the init function. We can exploit the fact that JS arguments are optional for that. Of course, the transition must be added in the change function.

function init( color ){
    change(color, true); // Add true for nowait
    localStorage.setItem('color', color);
    theme_switch.setAttribute('color', color);
}

function change(color, nowait){ // Add the nowait argument
    // Discard transition is nowait is set
    if(nowait !== true){
        window.setTimeout(function() {
            document.documentElement.classList.remove('color-theme-in-transition')
        }, 1000)
        document.documentElement.classList.add('color-theme-in-transition');
    }

    document.documentElement.setAttribute('data-theme', color);
    theme_switch.setAttribute('color', color);
    localStorage.setItem('color', color);
}

Now with all this we are able to make the website the user configuration from one page to another.

Wrapping up

With this configuration we are able to:

Get visitor’s configuration based on the OS color theme: dark or light.
Make the visitor able to change their mind by choosing a different color scheme.
Get the initial color of the page to be able to initialize the buttons.
Make the web remember the color scheme selection from one page to another using localStorage.
Add smooth transitions but don’t activate them in page changes to avoid weird flashings.

And that’s all.

No! It isn’t! We also had some fun talking about philosophy, accessibility and sites you shouldn’t visit. In fact, all the color theme stuff was an excuse to talk about it, but sssssssh don’t tell anyone.

If after knowing that you are still interested on the excuse itself, all the code together it looks like this:

var theme_switch = document.getElementById('dark-light-switch');
var body = document.getElementsByTagName('BODY')[0];

function init( color ){
  change(color, true);
  localStorage.setItem('color', color);
  theme_switch.setAttribute('color', color);
}
function change(color, nowait){
  // Discard transition is nowait is set
  if(nowait !== true){
    window.setTimeout(function() {
      document.documentElement.classList.remove('color-theme-in-transition')
    }, 1000)
    document.documentElement.classList.add('color-theme-in-transition');
  }

  document.documentElement.setAttribute('data-theme', color);
  theme_switch.setAttribute('color', color);
  localStorage.setItem('color', color);
}
function theme_change_requested(){
  color = theme_switch.getAttribute('color');
  if(color=='light')
    change('dark');
  else
    change('light');
}
function getCurrentColor(){
  // Color was set before in localStorage
  var storage_color = localStorage.getItem('color');
  if(storage_color !== null){
    return storage_color;
  }

  // If local storage is not set check the background of the page
  // This is dependant of the CSS, be careful
  var background = getComputedStyle(body).getPropertyValue('background-color');
  if(background == 'rgb(255, 255, 255)') {
    return 'light';
  } else {
    return 'dark';
  }
}
init( getCurrentColor() )
theme_switch.addEventListener('click', theme_change_requested);

Thanks for being there! ↩
The way to change that is to access about:config and update the ui.systemUsesDarkTheme field: 1 means true and 0 means false. Be careful because it’s not a boolean field, it’s an integer field (I don’t know why, don’t ask me). This change affects to all the tabs. ↩
This isn’t true in every context. We need it here because this is a Static Website. This means the content you read is already created at server side before you ask for it. If it wasn’t, all this could be simpler: just load a different CSS depending on the user’s choice. The static counterpart of this approach would be to create the whole website once per color scheme and leave them in different folders like domain/dark/whatever.html and domain/light/whatever.html this is not practical at all and carries tons of extra problems. ↩
As everyone want to have a good rank in the search engines more than anything else, Google made a lot of decisions about how websites should be in order to be able to be indexed properly. With the market quota they had (almost 100%) they had the power to force developers and designers make websites the way Google liked. That was obviously bad but it had some good consequences: websites were easy to scrape or read by a robot with low resources (that was what Google wanted). But since some years ago Google announced their spider is able to run JavaScript, that made all those developers and designers who wanted to make their websites have a good ranking free: they don’t have any other limit to the use of JavaScript right now (because they don’t really care about anything else). That made many pages impossible to read by clients that don’t use JavaScript and made the process of accessing websites automatically or with non-canonical browsers impossible in many cases. Thank you developers and designers for breaking the Web. ↩
There are so many reasons to avoid Medium that someone made a specific website for them. Also, some interesting free software projects decided to migrate away from it and wrote about it, that’s the case of ElementaryOS. I asked in the fediverse about this and many people sent me articles and links. Thanks to all! ↩
Too bad Marcin, you were unable to see the future. ↩

Screencasts: discussing with ffmpeg

2020-01-11T00:00:00+02:00

When you battle using your arguments that’s called a discussion… That’s exactly what I’ve been doing for a couple of days with ffmpeg: I’ve been using arguments trying to reach an understanding.

I wanted to record my screen, and a couple of cameras, for reasons you’ll know about later, and I didn’t want to play around with new GUI programs and configuration so I decided to go for a simple ffmpeg setup.

You probably had played with ffmpeg in the past. It has tons of different input arguments and options. It’s crazy.

Most of my previous times using it were just video conversions and it’s as simple as choosing the right extensions for files, but when it comes to video and audio recording it gets complicated. I have no idea about video and audio encodings and I don’t really have the time to dig in such an exciting topic. I searched on the Internet for examples and I found some: cool.

I played around with multiple inputs and outputs, I changed arguments I can’t even remember now and it kinda worked until I decided to record my voice at the same time. Delay.

What to do then?

Just go to the internet an keep searching.

I found a project called vokoscreen that now is archived because it’s migrating from ffmpeg to gstreamer (I also struggled with gstreamer in the past but that’s another story) but it worked fine. It was in the repos of my distro, it only asked me to install one dependency, a couple of megs only… Great!

I tried to make a screencast and the audio worked like a charm. I went for the code, read it and realized the arguments it uses to call ffmpeg are easy to find.

Even better, in the program itself there’s a button to show a log of what it does and it dumps the exact call it does.

With that and some extra things I learned from the investigation in the deep abyss of the Internet, boom! There it goes:

ffmpeg
    -y -f x11grab -draw_mouse 1 -framerate 25 -video_size 1920x1080 -i :0+0,0 \
    -f alsa -ac 2 -i default \
    -pix_fmt yuv420p -c:v libx264 -preset veryfast \
    -c:a libmp3lame -q:v 1 -s 1920x1080 -f matroska \
    output.mkv

Today in half an hour I solved the thing I’ve been struggling with a couple of days. But I think I wouldn’t be able to solve it if I didn’t struggle with it last days… I don’t know.

The good thing is I learned a couple of things from this I’ll write down here to avoid forgetting them:

Multiple inputs

Like the command in the example, ffmpeg can get multiple inputs. In the case of the example, they are x11grab (my screen) and alsa‘s default input (the microphone). More inputs can be combined, like music playing in the background or whatever you want.

Multiple outputs

There’s also the chance to put multiple outputs there just like the multiple input thing does but in the output part of the command¹.

Pipe

You can even pipe the command to a different one, like:

ffmpeg [...] - | ffplay -i -

In this case you can use one of the outputs to record to a file and another one to ffplay which plays the video in screen.

This is useful if you want to record from a webcam and you want to see what you are recording.

Closing note

So yeah, I was an ignorant about ffmpeg and I still am.

But at least I learned a couple of the arguments and learned how to deal with all my cameras and screens at the same time.

Good enough.

I mean, it works, right? And that’s the most important thing².

Yes, it’s hard to know where’s the input and where’s the output. ↩
It’s not. The most important thing is to be happy, do what you like, enjoy your life and feel appreciated and valued. If your software works it’s like… Uugh… Congratulations I guess? ↩

Hiding the complexity

2019-10-27T00:00:00+03:00

I’ve been recently playing with Scheme, reading R⁷RS and so on and I found something really interesting: Even with its high level of abstraction, it doesn’t hide the complexity and makes you pay attention to it.

That’s extremely powerful and interesting.

It’s even more interesting if you think about the fact that Scheme can be implemented from scratch in an acceptable amount of time by a couple of hands. You don’t need to be a big corporation or a big group of developers coding for years to implement it.

It’s simple but it doesn’t hide the complexity of the implementation. That’s a really powerful balance.

But both points are too much to leave them here without playing with them separately so let’s try to understand why both of the points are¹ fundamental.

Hidden Latent complexity

You can create the best programming language in the world but the complexity of the programming can’t be eliminated because the user of the language will be, actually, programming. So, you can take two approaches here:

Expose the intrinsic complexity of programming.
Make it look as complexity doesn’t exist. Which means hiding the complexity as much as you can under layers of abstraction.

Most of modern programming languages go for the second option. But not only programming languages, also operating systems, computers themselves and many areas more. Which is not specially bad in general, but it’s dangerous when you need control.

Most of the times where complexity is hidden by design, it’s just latent complexity. It’s harder to reach by the user, so the user scope of things they can do is reduced (and with it the their ability to decide with a high level of detail) but the complexity is still there, happening without being noticed, under the surface and being impossible to correct if something goes wrong.

Think about your cellphone. You can’t open it, change the battery, change it’s software, change… anything. Because it’s hard to do and “people don’t need to know about that“. But finally, what you have is a phone that is impossible to repair if something goes wrong or impossible to change if you want it to do something that is not the default behaviour.

It is a problem (some people is trying to fix, by the way) but it’s not a problem for everyone because everyone doesn’t need to have that level of complexity exposed. But they should have the right to see it if they wanted to.

Exposed complexity

As engineers working on technology, we should be demanding for the complexity of things being exposed, more than running away from it.

As engineers we are supposed to want to know how stuff works!

In the case of programming languages, I want to control what the program does and I be aware of what I’m doing and which decisions I’m taking.

When complexity is exposed you are reminded of the importance of every choice you make. It’s not something that happens: you have to think about it.

In Scheme: List vs Vectors. Which one is better? Why have both? Why not use use one all the time?

It’s reminding you what do you have under the hood, even if you aren’t implementing it yourself. That way you don’t forget about your job.

Simplicity

Simplicity means that there’s no unneeded complexity. It doesn’t mean that complexity is hidden. We tend to confuse both terms too often.

Scheme is simple, because its core is small and it’s based in few concepts. Being simple means that concepts are clear and consistent and have few or none exceptions.

Other programming languages use the same concepts that scheme does but they are not clearly stated so you can’t rely on them for your understanding of a language. Scope in JavaScript (for instance) is often explained as a thing related with curly brackets’ position while hiding the fact that it’s a lexical scope. Watching engineers prefer a silly trick than an academical fact is unsatisfying².

In many platforms abstraction layers are added until the internals are hidden or blurred. In this case, complexity is directly hidden by implementation, more than by users themselves running away from it.

Some would ask: “Who cares about the details?” ³ And it’s perfectly fine to think that at some point but when it comes to choosing the right tool to the right problem, performance and fine tunning, you’d really like to know how they are implemented because implementation is what makes some operations be faster or more accurate than others. And, probably more important than that, being aware about how stuff is implemented make us independent enough to change the implementation if we want, which is the base of free software.

When your tools hide reality from you for long enough, you start to forget that the reality still exists even if you are not watching it and you start acting like it wasn’t there.

Assembly then… Right?

Don’t get me wrong. I’m not against simplification or making our job easier. Scheme, is a really high-level language. Abstraction is good.

Accidental self-lobotomy is not that good.

NOTE:
This blogpost was triggered by this talk where a musician talks about chiptune music and says how making chiptune music made him a better guitarist. It has some good points about constraints and complexity.

https://youtu.be/_7k25pwNbj8

In my opinion, of course. This is my blog. ↩
And insulting, I’d say. ↩
TLDR: You, as an engineer, should. ↩

TUI Slang: Speak like the natives

2019-06-15T00:00:00+03:00

The previous post introduced termios as a native interface to configure the terminal input processing. With termios we managed to make our C programs get input character by character processing them as they came with no buffering but we didn’t integrate that with our Clojure code. Now it’s time to make it.

Run before it’s too late

Before we dig in the unknown, I have to tell you there are other alternatives for the terminal configuration. The simplest one I can imagine is using stty¹ as an external command. I learned this from Liquid, a really interesting project I had as a reference. If you want to see this work check the adapters/tty.clj file in the src directory of the project.²

Of course, it has some drawbacks. stty is part of the GNU-Coreutils project and you have to be sure your target has it installed if you want to rely on that. I’m not sure about if it’s supported in non-GNU operating systems³.

In my case, I decided to stay with termios interface to deal with all this because I didn’t really want to rely on external commands and it’s supposed to be implemented in any POSIX OS. The good (bad?) thing is it made me deal with native libraries from Clojure and had to learn how to do it.

The floor is Java

When dealing with low-level stuff we have to remember Clojure is just Java, and most of the utilities we need to use come from it. This means the question we have to answer is not really “how to call native code from Clojure?” because if we are able to call native code from Java, we will be able to do it from Clojure too (if we spread some magic on top).

So, how to call native code from Java?

First I checked the Java Native Interface (aka JNI), but I thought it was too much for me and I decided to check further. Remember there are only a couple of calls to make to termios from our code, so we don’t really want to mess with a lot of boilerplate code, compilations and so on.

My research made me find Java Native Access (aka JNA) library. If you check the link there you’ll find that the Wikipedia⁴ describes it as:

JNA’s design aims to provide native access in a natural way with a minimum of effort. No boilerplate or generated glue code is required.

Sounds like right for me. Doesn’t it?

I encourage you to check the full Wikipedia entry and, if you have some free time at the office or something, to check the implementation because it’s really interesting. But I’ll leave that for you.

A lantern in the dark

JNA is quite easy to use for the case of Clopher, even easier if you realize there is lanterna, the TUI library, out there, using it internally so you can steal⁵ the implementation from it. Lanterna is a great piece of software I took as a reference for many parts of the project. Digging in the internals of large libraries is a great exercise and you can learn a lot from it.

First of all, like many Java projects, the amount of abstractions it has is crazy. It takes some time to find the actual implementation of what we want. This isn’t like this for no reason, the reality is they need to create this amount of abstractions because the part of the library that handles the widgets can work on top of many different terminal implementations, including a Swing based one that comes with Lanterna itself.

Clopher only targets POSIX compatible operating systems so we can go directly to what we want and read the termios part directly discarding all the other compatibility code. This code is quite easy to find if you see the directory tree of Lanterna: there’s a native-integration folder in the root directory. If you follow that you’ll arrive to PosixLibC.java that uses JNA to interact with termios.

The implementation provided by Lanterna is quite complete, they declare a library with the functions they need and the data structure introduced in the previous chapter. Once the library interface and the necessary data structures are defined from Java they can be called with JNA, like they do in the file: NativeGNULinuxTerminal.java.

How to call JNA from Clojure, then?

Calling Java code from Clojure is quite simple because Clojure have been designed with that in mind, but this is not only that. Thanks to the Internet, there’s a great blogpost by Nurullah Akkaya describing a simple way to use JNA from Clojure. From that, we can move to our specific case.

termios has its own data structure so we need to define it so the JNA knows how to interact with it. The problem is that Clojure doesn’t have enough OOP tools to do it directly so we need to make it in plain Java. The good thing is that we don’t really need to create anything else.

If we remove some unneeded code from Lanterna’s termios structure implementation it will look like the implementation I made at src/java/clopher/Termios.java:

package clopher.termios;
import com.sun.jna.Structure;

import java.util.Arrays;
import java.util.List;


/**
 * Interface to Posix libc
 */
public class Termios extends Structure {
    private int NCCS = 32;

    public int c_iflag;           // input mode flags
    public int c_oflag;           // output mode flags
    public int c_cflag;           // control mode flags
    public int c_lflag;           // local mode flags
    public byte c_line;           // line discipline
    public byte c_cc[];           // control characters
    public int c_ispeed;          // input speed
    public int c_ospeed;          // output speed

    public Termios() {
        c_cc = new byte[NCCS];
    }

    // This function is important for JNA, because it needs to know the
    // order of the fields of the struct in order to make a correct Java
    // class to C struct translation
    protected List<String> getFieldOrder() {
        return Arrays.asList(
                "c_iflag",
                "c_oflag",
                "c_cflag",
                "c_lflag",
                "c_line",
                "c_cc",
                "c_ispeed",
                "c_ospeed"
                );
    }
}

Once the struct is defined, it’s time to use it from Clojure. clopher.term namespace has the code to solve this. Summarized here:

(ns clopher.term
  (:import [clopher.termios Termios]
           [com.sun.jna Function]))

(def ^:private ICANON 02)
(def ^:private ECHO   010)
(def ^:private ISIG   01)
(def ^:private ECHONL 0100)
(def ^:private IEXTEN 0100000)

(def ^:private VTIME 5)
(def ^:private VMIN  6)

; The macro we saw at the blogpost by Nurulla Akkaya
(defmacro jna-call [lib func ret & args]
  `(let [library#  (name ~lib)
         function# (Function/getFunction library# ~func)]
     (.invoke function# ~ret (to-array [~@args]))))

; Wrapper for the tcgetattr function
(defn get-config!
  []
  (let [term-conf (Termios.)]
    (if (= 0 (jna-call :c "tcgetattr" Integer 0 term-conf))
      term-conf
      (throw (UnsupportedOperationException.
               "Impossible to get terminal configuration")))))

; Wrapper for the tcsetattr function
(defn set-config!
  [term-conf]
  (when (not= 0 (jna-call :c "tcsetattr" Integer 0 0 term-conf))
    (throw (UnsupportedOperationException.
             "Impossible to set terminal configuration"))))

; Example to set the non-canonical mode using the flags at the top of the
; file
; Yeah, binary operations.
(defn set-non-canonical!
  ([]
   (set-non-canonical! true))
  ([blocking]
  (let [term-conf (get-config!)]
    (set! (.-c_lflag term-conf)
          (bit-and (.-c_lflag term-conf)
                   (bit-not (bit-or ICANON ECHO ISIG ECHONL IEXTEN))))
    (aset-byte (.-c_cc term-conf) VMIN  (if blocking 1 0))
    (aset-byte (.-c_cc term-conf) VTIME 0)
    (set-config! term-conf))))

Pay attention to all the mutable code here! aset-byte function helps a lot when dealing with all that.

Be also sure to check termios’ documentation because the calls act in a very C-like way, returning a non-zero answer when they fail.

We need an extra point in our code to solve the Java-Clojure interoperability: we have to tell our project manager that we included some Java code in there. If our project manager is Leiningen, we can just tell it where do we store our Java code. Be careful because Leiningen doesn’t like if you mix Java and Clojure in the same folder.

(defproject
  ; There's more blablabla in here but these are the keys I want you to
  ; take in account
  :source-paths      ["src/clojure"]
  :java-source-paths ["src/java"]
  :javac-options     ["-Xlint:unchecked"])

Look back!

Now you can configure your terminal to act non-canonically and serve you the characters one by one as they come. It’s cool but you’ll see there are some problems to come for the next chapters. Don’t worry! They’ll come.

This is like a heroic novel where the character (in this case you) fights monsters one by one leaving their dead corpses in the dungeon floor. Looking back will let you remember how many monsters did you slaughter in your way to the deep where the treasure awaits. Remember to take rest and sharpen your sword. This is a long travel.

Prepare yourself for the next monster. Let the voice of the narrator guide you to the unknown.

Why don’t you mix what you learned on the previous chapter with what you learned from this one and try to make an interactive terminal program yourself?

I’ll solve that in the next chapter, but there’s some code of that part already implemented in the repository. You can check it while I keep writing and coding. Here’s the link to the project:

https://gitlab.com/ekaitz-zarraga/clopher

See you in the next episode!

Use the man pages, seriously: man stty ↩
I’ve also been in contact with Mogens, the author of the project, who is a really good guy and gave me a lot of good information. ↩
But who cares about them anyway? ↩
the free encyclopedia ↩
If it’s free software it’s not stealing and it’s exactly what you are supposed to do with it. ↩

TUI: A look to the deep

2019-05-30T00:00:00+03:00

This software have been introduced as a Gopher client but, as you can probably deduce from the previous post, the Gopher part is probably the simplest one. The complexity comes with user interaction. People are hard. That’s why we are going to delay that as much as possible, trying to cover all the points in the middle before we jump to the unknown.

Just joking. In fact, we have to shave many yaks before thinking about user interaction anyway. This text talks about them.

Are you talking to me?

Let’s remember we can classify programs by two different categories like this:

Non-interactive programs, often called scripts, are programs that take an input and return an output. There’s no interaction with the user in between. An example of this could be the command ls.
Interactive programs receive user input while they run and respond to the user while they are running. An example of this could be the machine that sells you the tickets for the subway, it asks you where are you going, then tells you the price, take your money and so on. All of this with the program constantly running.

Remembering what we talked about Gopher: it’s a stateless protocol. There’s no state stored in the server so all the queries must contain all the info related to them. Queries are independent.

This, somehow leaves the door open to two possible implementations of Clopher. The non-interactive one would work like curl. Getting the IP, port, selector string and an optional search string as input it would open the connection retrieve the result and return it.¹

But Clopher is designed as an interactive program. More like lynx, where you interactively ask for the pages and have a local state that records your history and other things. This is a decision, it’s not imposed by the protocol.

Shellf boycott

There are some different ways to handle user interaction in TUI based programs. The simplest one is to read by line, waiting until the user hits ENTER to read the result. That’s the behaviour of the classic scanf function of C and many others like input in Python, etc.

In programs like Clopher, where the design is similar to lynx or vi, this kind of input makes no sense at all. The program needs to be able to capture every key pressed by the user and perform action in response to them. For instance, in vi when the user hits i in normal mode it needs to change to insert mode and when the user presses i in insert mode it needs to change the contents of the buffer.

The design of these kind of programs is simple to understand, it’s an infinite loop² where key presses are captured and they change the state of the program. When the user hits the key combination that halts the program the loop is broken.

In simple C code the program would look like this:

#include<stdio.h>

int
main(int argc, char * argv[]){
    char c;
    // Create some state

    while(1){
        c = getchar();
        if( c == 'q'){ // Exit if user pressed `q`
            return 0;
        }
        // Update state here
        putchar(c); // Show the character for debugging
    }
}

Or the simplified Clojure equivalent:

(loop [c     (char (.read *in*))
       state (->state)] ; Create some state

  (when (not= c \q)     ; Exit if user pressed `q`
    (print c)           ; Show the character for debugging
    (recur (char (.read *in*))
           (update-state c state))) ; Update state

Looks simple, right?

Wait a second, there’s a lot of stuff going on under the hood here. If you run the code in any POSIX compatible operating system (I didn’t test on others, and I won’t) you’ll find the code might not be doing what we expected it to: The getchar (or .read) calls will wait until ENTER is pressed in the input buffer and then they’ll get the characters one by one. But we want to get them as they come!

Saints and demons — canonical mode

In POSIX operating systems, the input is buffered by default. But that behavior can be configured following the POSIX terminal interface under the name canonical mode or non-canonical mode. The mode we are looking for is the non canonical mode. You can read more about it in the Wikipedia.

Choosing the non-canonical mode has some extra options: one controls the number of minimum characters to have in the buffer to perform a read operation and the other defines the amount of tenths of second to wait for that input³. Choosing the right value for those fields (c_cc[MIN] and c_cc[TIME]) depends on the kind of interaction we are looking for.

Make Dikembe smile — blocking

Setting c_cc[TIME] field to 0 means the read operation will wait indefinitely until the minimum amount of characters defined with c_cc[MIN] are waiting in the buffer. Together with that, the c_cc[MIN] can be 0 that means the read operations will wait until there are 0 characters in the buffer, or, in other words, they won’t wait.

Be aware that both fields can provoke the read operations in the input buffer be non-blocking operations and that will cause the read operation to return with no value.

In the case of Clopher, I decided to set the c_cc[MIN] to 1 so the read operations block until there’s at least one character in the buffer (that means they will always return something) and the c_cc[TIME] to 0 so the read operations have no timeout and will block until a character arrives.

Depending on the application you are developing, you might choose other kind of blocking configuration. For instance, setting a timeout can let you process other parts of the system and wait for the input in the same thread.

We’re talking about practice? — termios

So now we know where to find this theoretical configuration it’s time to put it in practice. In POSIX the standard way to access this is via termios⁴. It has some details that are not specified and depend on the implementation, so it might have some differences from Linux to BSD or whatever.

tcsetattr and tcgetattr calls can be used to set and read the terminal configuration via termios. Check this example, compile it and compare it with the C code of the previous example:

#include<stdio.h>
#include<termios.h>

int
main(int argc, char* argv[]){
    // Get interface configuration to reset it later
    struct termios term_old;
    tcgetattr(0, &term_old);

    // Get interface configuration to edit
    struct termios term;
    tcgetattr(0, &term);

    // Set the new configuration
    term.c_lflag &= ~(ECHO | ECHONL | ICANON | IEXTEN | ISIG);
    term.c_cc[VMIN]  = 1;   // Wait until 1 character is in buffer
    term.c_cc[VTIME] = 0;   // Wait indifinitely
    //TCSANOW makes the change occur immediately
    tcsetattr(0, TCSANOW, &term);

    char ch;
    while(1){
        if(ch == 'q'){
            // Set old configuration again and exit.
            // If it's not set back the normal configuration of the
            // terminal will be broken later!
            tcsetattr(0, TCSANOW, &term_old);
            return 0;
        }
        ch = getchar();
        putchar(ch);
    }
}

All the code has enough comments to be understood but there are some weird flags it’s better to check in termios documentation.⁴

But this is C code and Clopher is written in Clojure!

I know but this is becoming long and boring. Why not wait until I get some spare time and write the next chapter? You have tons of information to check until I write it so you won’t be bored if you don’t want to.

See you next.

In fact, you can navigate the Gopherverse like this with curl. ↩
Unsurprisingly called main loop. Programmers are very creative. ↩
That read operation is what getchar is doing under the hood. ↩
man termios or visit online man pages ↩↩

Down the rabbit gopher hole

2019-05-07T00:00:00+03:00

As the project’s goal was to create a Gopher client, it was time to understand something about the protocol and read the RFC. No need for you to know the protocol to understand what I’m going to say here. I think I already did the difficult part for you.

Understand some Gopher

Gopher is a really simple protocol (this doesn’t mean I implemented it correctly anyway). It’s assumed to work on top of TCP (default port is 70) and it’s as simple as creating a socket, sending the selector string to it followed by a line terminator¹, and reading everything from it until it closes. That’s in most of the cases how it works.

It has two major ways to access the data:

Text mode, which is used in most of the queries, needs the client to read from the socket until a line with a single dot (.) appears. Then the connection is closed.
Binary mode, expects the client to read from the socket until the server closes it.

Easy-peasy.

Gopher is a stateless protocol and that helps a lot during the creation of the client. There’s no need to retain data or anything related.

Selector strings are what client wants to see. In order to know what selections are possible, Gopher defines an specific text format that works as a menu, and it’s called, unsurprisingly, Menu.

Menus have a description of the content, the address or hostname, the port, the selector string, and a number that indicates the type of each of its elements separated by a TAB (aka \t character). Each element in one line¹.

Pay attention to the fact that each menu entry contains an address and a port, that means it can be pointing to a different server!

The type further than making the client choose between binary and text mode also gives the client information about what kind of response it’s going to get from it: if it’s a menu, an image, an audio file… It also says if the element is a search endpoint².

Yes, Gopher supports searches!

Well, Gopher supports tons of things because the only rule is that all the logic is on the server side. You can do whatever you want, if you do it on the server.

Searching is as simple as asking for a text document, but it adds also the search query to the equation. During a search, the client needs to send the selector string to select the endpoint and then the search string (separated by a TAB character).

There are some points more but this is more than enough for the moment.

Let’s make something work.

Make Gopher queries

Before jumping to Clojure, lets make sure that we understood how this works with some simple text queries. In a UNIX-like terminal you can do the following to navigate the Gopherverse:

exec 5<>/dev/tcp/tilde.team/70
echo -e "~giacomo\r\n" >&5
cat <&5

This code opens a TCP socket to tilde.team at port 70 sends the selector string ~giacomo followed by the line terminator (\r\n) and prints the answer. Simple.

You can do some telnet magic instead, which is easier but not as cool as the other³:

telnet tilde.team 70
~giacomo

If you run the code you’ll see you can understand the response with your bare eyes with no parser involved. Isn’t that great?

Notice that in our examples our selector string is ~giacomo. Gopher supports empty strings as selector strings that, in most cases, return a Menu where we can see which selector strings are valid. Why don’t you try it yourself?

Move to Clojure

Now we understand what it’s happening under the hood, it’s time to move to Clojure.

A simple text request can be understood like this piece of Clojure code here (which involves more Java than I’d like to):

; Define the function to make the queries
(defn send-text-request
  [host port body]
  (with-open [sock (java.net.Socket. host port)
              writer (clojure.java.io/writer sock)
              reader (clojure.java.io/reader sock)
              response (java.io.StringWriter.)]
    (.append writer body)
    (.flush writer)
    (clojure.java.io/copy reader response)
    (str response)))

; Make a query and print the result
(println (send-text-request "tilde.team" 70 (str "~giacomo" "\r\n"))

As you see, it’s not waiting to the dot at the end of the file and it’s not doing any kind of parsing, error checking or timeout handling, but it works. This a minimal (and ugly, clean the namespaces!) implementation for you to be able to run it in the REPL.

Binary or not?

The binary is almost the same but the output must be handled in a different way. As Clopher is a terminal based application I made it store the answer in a file.

There’s a simple and beautiful way to handle temporary files in Java that you can access from Clojure. As I wasn’t a Java user before I didn’t know this:

(defn- ->tempfile
  "Creates a temporary file with the provided extension. If extension is
  nil it adds `.tmp`."
  [extension]
  (doto
   (. java.io.File createTempFile "clopher" extension)
   .deleteOnExit))

With this function is really simple to create a temporary file and copy the download there. It’s also easy to ask the user if they want to store the file as a temporary file or in a specific path. With the code below, calling to download-file-to works like we described. If destpath is nil a temporary file is created. Cool.⁴

(defn download-file-to
  [host port srcpath destpath]
  (with-open [sock   (->socket host port)
              writer (io/writer sock)
              reader (io/reader sock)]
    (.append writer (str srcpath defs/CR-LF))
    (.flush writer)
    (io/copy reader
             (io/output-stream
               (or (io/file destpath)
                   (->tempfile (get-extension srcpath)))))))

`doto`, make Java interop less painful

You probably know what doto does but it’s interesting enough to talk about it here. It returns the result of the first form with all the rest of the forms applied inserting the first form’s result as first argument and discarding the result of the operations. This sounds weird at the beginning but in cases like this one where you are working with mutation it’s really handy:

We are creating a File instance and returning it after calling .deleteOnExit on it. Take in consideration that .deleteOnExit returns nothing, so discarding its return value is great. We want to return the File, not the result of the .deleteOnExit operation.

Once we now how to deal with doto we can improve the caller with this function that creates sockets with some timeout applied that connect automatically:

(defn- ->socket
  ([host port]
   (->socket host port 10000))
  ([host port timeout]
   (doto (java.net.Socket.)
         (.setSoTimeout timeout)
         (.connect (java.net.InetSocketAddress. host port) timeout))))

Replacing java.net.Socket from the example above with a call to this function will make the call handle timeouts, configuring the socket on its creation.

Whatever, right? Better check the code for that. Beware that it may change as I keep going with the development. Maybe not, it depends on the time I spend on this.

Here’s the link to the code. Relevant part can be found in src/clojure/clopher in a file called net or similar:

Link to the repository

It’s time to move on because this is taking longer than it should. We are just warming up, let’s leave it simple at the beginning, there will be chance to make this complex in the near future.

Hope you enjoyed this post.

Hey! But what about the Menus?

Menus are just queried like any other text document so they can be queried with this little code. The parsing, processing and so on is only needed for user interaction so we’ll deal with that later. Don’t worry. We all have to learn to be patient.

See you in the next step.

Line terminator is CRLF (carriage-return + line-feed), aka \r\n. ↩↩
Don’t be afraid of the types because they are just a few of them. ↩
Remember to jump line after you enter the selector string. ↩
You have to implement get-extension yourself but you know how to do it. ↩

Introducing Clopher

2019-05-06T00:00:00+03:00

When you do a hack (or even a dirty hack) you do it for some reason. You do it because you can understand the complexity of the problem and you see it’s a complex problem to solve that needs a good enough solution for your case.

You are facing the complexity. You are seeing it. You are seeing the deepness of the abyss.

This project started a little bit like an exercise to do that. Take a simple problem: make a Gopher client, and try to solve it in a decent way collecting information during the process.

It’s just a learning project, but it went wild.

The initial idea was to force myself to use Clojure’s network API, which is Java’s one, because I never used it in the past and I wanted to learn about it and the possible problems it can have. In order to do that I decided to write a Gopher client, because that way I’d also had to read the RFC and some resources more.

I sketched the Gopher protocol exchange without many problems, because it’s quite simple and the RFC is really well explained. The wild part came with the rest of the project, which still is under heavy development and it doesn’t work yet (this sentence may be edited in the future, I hope it will).

I wanted to make a terminal based client, and I had a cool library for this, called clojure-lanterna which is just an interface to lanterna, a java library for TUI (Terminal User Interfaces). When I wanted to use clojure-lanterna I realized the project was kind of abandoned and it didn’t cover the UI elements, only the basic screen interface, and I decided to make it by myself.

Further than that, I thought that if I focused on only POSIX compatible operating systems I wouldn’t need to use lanterna neither. So I decided to implement everything by myself.

That took me to some thoughts I’ve been having these days: When software has few dependencies or no dependencies at all you have more control over the process of making it. People who code in popular programming languages have even more libraries than we need and it’s really hard to stop the temptation to use them (this explains some recent events with NPM repositories, for instance). This is not only about security –possible security breaches in libraries we use– or control –the fact that we included some software we don’t know– it’s also about remembering that libraries can’t be software you just import: they should be read, analysed and, often, thrown away in favor of an ad-hoc solution. Many times ad-hoc solutions reduce the codebase size and they solve the problem more accurately, as they are specifically design to solve our problem.¹

Also, it’s good to tell yourself you can code everything from scratch and try to prove it true.

In summary, I wanted a project that covered these points:

Be a simple Gopher client.
Written in Clojure.
Terminal User Interface (TUI).
No dependencies if possible.

And all of them had some sense, at least in my mind, on the early beginning of the project.

So, here we are

As I said before, the goal is not to create a good software. It’s not even to create something that works. The idea is to learn during the process and this post series is a way to put what I learned in an ordered way.

If you follow this post series, you’ll follow me on my research and hacks. We are going to dive on all those weird concepts that will appear. I’ll try to be as technically correct as I can but I’m not an expert and this is not a class. I’m just sharing my experiences.

I’m looking at the abyss and telling you what I see from this view, pointing the interesting things I spot.

As a note, while I was writing this, I experienced some issues with nested dependencies in a different piece of software I was using. Dependencies can be understood as a tree, with your project at the root. More deep the tree is, longer time for changes to arrive the root of the tree from the leaves, because changes must be accepted in all the nodes of the affected branch and developers are busy. This can be a problem like in the case I experienced where a bug in a leave of the tree was solved but the root was broken and was unable to solve the issue because they needed an intermediate node to update the version of the leave. This hurts.
(They should’ve never added the change in the first place but when dependencies go deep it’s more difficult to detect bugs) ↩

Let’s document

2019-02-01T00:00:00+02:00

At ElenQ Technology I just released my documentation templates tool. You can find it in the link below:

https://gitlab.com/ElenQ/templates

I think that project, even if it’s quite simple, it’s a really good reason to talk about document edition and my workflow.

The story

As part of my job at ElenQ Technology I document a lot of things: I have to make reports, proposals, documentation for projects, notes for courses…

I have to write a lot.

If I could decide, I’d share all my documentation and files in plain text. But I don’t decide, so I need to send PDF files and they need to have a nice look so the clients understand I take care of my stuff. I also like to pay attention to the aesthetics of what I do so I really like to keep everything in order.

That’s really difficult to do. Even more if you work with tools like LibreOffice that have tons of options and menus and are sometimes difficult to understand or hard to make them do exactly what you want. I have nothing against LibreOffice but some years ago I realized it’s not a tool for me. WYSIWYG¹ tools like that have some characteristics that don’t fit my workflow well. Let me break them down:

They are designed to work with a mouse, and I avoid using the mouse because it makes my wrist and arm hurt. That’s why I often work with my wacom tablet in mouse-intensive tasks like PCB routing and I use laptop’s touchpad in everyday tasks.
They have tons of menus where you can get lost while the most of the documents you write don’t have that kind of complexity. Often, that kind of options just make the documents complex and hard to maintain.
They don’t have a clear separation between the content and the view. When I write I like to focus on the content and avoid to get distracted with how it looks on the screen. I hate “Oh my god, the picture moved and now the whole layout is broken”-like errors.²
Their file formats are difficult to operate with even if they are open standards. Mixing data with something that comes from a different process is really complex, and it makes the user write everything by hand.
As an example of this: in the previous version of the ElenQ Documentation Templates, there was a tool to get all the git tags of the project and insert them as a document changelog. This is really difficult to make in LibreOffice. (This version doesn’t support that yet).

Trying to solve all those issues, I’ve spent some time with LaTeX as a main tool but, it also has a really thin separation between the content and the view and its learning curve is crazy hard.

Enter pandoc

Some day, while working on her PhD, my sister discovered Pandoc and our life changed.

Pandoc is a great tool which is able to convert between a lot of different document formats. That opens a world of possibilities where you can write in a format you like and then convert it to different output formats. It’s huge. The main power of Pandoc is also related with the amount of output formats it can handle. It is possible to write all the content of a document in a common language like MarkDown, RST or AsciiDoc and then convert it to different output formats like PDF, ePub or to a simple static website.

All this tooling also lets you write files that are easy to write and read, like MarkDown is, without the need to play with tons of tags and weird commands like HTML or LaTeX do.

Pandoc is a really powerful tool with tons of option that can be quite overwhelming. It even lets you add filters that transform the AST it creates during the conversion!

At the moment we discovered Pandoc I was really obsessed with productivity and the chronic pain my hands, wrists and arms were suffering and I didn’t care about anything else. If a tool could help me reduce the use of the mouse and my keystroke count it was worth the time learning it.

I was so crazy at that time that I made a Vim plugin called droWMark for posting in WordPress. Taking advantage of Pandoc filters I also made it able to upload images linked from the MarkDown file. It was fun.

Choose the tools

Some time later I founded ElenQ Technology and I decided we needed to integrate Pandoc in our tooling. That’s why with my sister’s help we created the first version of the documentation templates.

I wanted any person working with the documents to be able to use the editor they like the most. And I only wanted to care about the aspect of the document once: during the template creation.

It worked. I spent almost 2 years working with the old version of the templates and they served me well. The only problem they had was that they needed many files to work and they added some complexity to the folder where the documents were edited.

Choose the tools: remastered

This new version eliminates that complexity. We needed to sacrifice a couple of features but now there’s no need to add any extra file in the directory where the document is. We removed the Makefiles and embedded the SVG logo of the company inside the templates using TikZ. Now the tool is just a couple of Pandoc LaTeX templates: elenq-book template for long documents and elenq-article for short documents.

Like in the previous version, both templates are designed to create output LaTeX files that can be compiled to PDF using XeLaTeX (or let Pandoc do the conversion for you). The input formats are not defined, the only limitation is on the metadata they need (you can read the documentation included with the project for that).

All of this is installed automagically using Stow.

The project also explains in the README.md file how to create a couple of command line aliases to simplify the calls to Pandoc. You really want to use them because Pandoc needs a lot of input arguments. Using aliases, the conversion as simple as running a command in the terminal:

elenqdoc-book document.md -o book.pdf           # For books
elenqdoc-article document.md -o article.pdf     # For articles

With the new template system, the documents are just Markdown files and they are easy to maintain under version control. Note that the same input file can be used to create an article and a book, the input doesn’t really affect the output of the process.

We decided to use MarkDown for some extra reasons too. Markdown is simple but has everything that any simple document needs and it’s easy to read in plain text even for people who don’t know it. But not only that, MarkDown is a widely use format (this blog is written in MarkDown too!) and it’s really extensible, letting the user insert HTML or LaTeX pieces to cover specific cases like formulas or complex formatting.

Choose the tools: future chapters

Next step is the creation of a invoice control system that is integrated with the Pandoc templates. The template integration is really easy, we only need to inject some variables to the templates and Pandoc already has a tool for that: the metadata system. From that side the problem is solved, now we need to make all the rest.

On the other hand, as said before, in the future, if the conversion process needs extra complexity, we’ll just need to add some Pandoc filters to create it.

Wrapping up

In summary, we can say that the tool we made is just a consequence of the workflow we follow. This is probably not for anyone, but any person used to work with the terminal and software is a potential user for this kind of tool.

It’s powerful, simple and straight-to-the-point. I think that fit’s our workflow really well.

WYSIWYG: What You See Is What You Get ↩
Obligatory xkcd reference: https://xkcd.com/2109/ ↩

Call me maybe

2019-01-09T00:00:00+02:00

Do you remember what happens when you call a function in your program?

What happens when you make too many nested calls?¹

When you call your functions, there some stuff going on in the memory, some variables, the program counter and all that, that must be stored somewhere to be able to come back to the place you called the function when the function ends. Right?

The place where all that is stored is the stack. You already know all this. When you call many nested functions, the stack goes pushing more and more data, and there’s no chance to pop it, so it overflows.

This can happen anytime but there’s more risk for that when you call functions recursively, because they call themselves many times by definition. In a non-recursive program it can happen too, but devices can handle big levels of nesting so it’s more unlikely to happen (in small devices like microcontrollers or so, you have to take care of this too).

This doesn’t mean recursive functions will result in a stack overflow always. That will only happen when the nesting level is bigger than the stack size.

You are so stupid the recursive function that calculates your stupidity causes a stack overflow.
— Heard in a computer science class

But this is not always true. There are some optimizations that can change this behaviour and allow you to create stack-safe recursions. Let’s talk about tail-call optimization.

Some programming languages implement tail-call optimization, that, if used correctly, avoids stack overflows in recursive calls and increase performance. First of all, in order to be able to make a tail-call optimization, the function must have a call as its last action (tail-call). This means it requires to be ordered in an specific way. Let’s see it with an (oversimplified) example (in Python, but don’t pay attention to the language):

def factorial (a):
    """ This function does not provide a tail-call, because the last thing
    to execute in it is a multiplication, not a call """
    if a == 1:
        return 1
    return a * factorial(a-1)

def factorial_tail (a, acc=1):
    """ This function provides a tail-call, the last thing happening on it
    it's a function call."""
    if a == 1:
        return acc
    return factorial_tail(a-1, acc=acc*a)

As the comments say, the first function is not performing a tail-recursion, but the second is. But, what’s the difference?

The main point is the first function, factorial, needs to go back in the call stack to retrieve previous step’s a value, while the second function doesn’t. That’s why the second can be optimized and the first not.

The optimization exploits this behaviour in a really clever way to avoid the stack overflows I told you before. Tail call optimization just changes the input parameters of the function and calls it again, replacing the original call with the new call with different input arguments. This can be made because the function is written in a way that doesn’t need anything from the previous step.

Imagine that we introduce a 3 in the first and the second function, let’s compare the execution. Let’s check factorial first:

Call factorial(3)
- Call factorial(2)
  - Call factorial(1)
  - Return 1
- Return 2 * 1
Return 3 * 2

Now with the factorial-tail function but without any optimization:

Call factorial-tail(3)
- Call factorial-tail(2, acc=3)
  - Call factorial-tail(1, acc=6)
  - Return 6
- Return 6
Return 6

See the difference?

The factorial-tail call doesn’t need anything from the previous step, the last factorial-tail(1, acc=6) function call’s result is the same as the result of the factorial-tail(3) function. That changes everything!

What tail call optimization does is just change the call arguments and keep running the same code. There’s no need to store anything on the stack, just change the function call with the tail call.

Let’s optimize the second call now:

Call factorial-tail(3)
Replace the call with factorial-tail(2, acc=3)
Replace the call with factorial-tail(1, acc=6)
Return 6

This can be stretched further! It can involve different functions! In any place where a tail-call is made, even if the called function is a different function, this kind of optimization can be done, reducing the stack size and increasing the performance.

If you want to read more about this, there’s a great wikipedia page on the subject and you there’s a really good explanation in the book Programming in Lua.

But how is all this handled by the programming languages? You may ask.

The answer is there’s not a clear answer, all of them have their own style of dealing with this. Let me give you some examples.

Python, just to point out the language I chose for the example is no the best example of this, has no tail recursion elimination. Guido and the Pythonists² argue that tail call optimization alters the stack traces (which is true) and that they don’t like the recursion as a base for programming, so they try to avoid it. In CPython there’s no tail call optimization, but they don’t forbid (they can’t!) any other Python implementation to implement that particular optimization. There’s a really interesting post by Guido Van Rossum about this.

Lua, as you’ve seen in the previous link, implements proper tail calls (as they call them there) and there’s nothing the programmer needs to do to make sure they are optimized. The only thing is to put the tail calls correctly.

Scala implements tail recursion optimization at compile time so the compiler transforms the recursive call with a loop in compilation time. That’s interesting because there’s a compile time check too. There’s an annotation called @tailrec that can be used to make sure that your function is going to be optimized. If the compiler is not able to optimize it will throw an error if it has the @tailrec annotation. If it doesn’t have it, it will simply make a standard recursion. In the annotations tour of the Scala language has some words about @tailrec.

Clojure is a really interesting case too. Clojure doesn’t implement tail-call optimization, but it has one (and only one) special form for non stack consuming looping: recur. This special form rebinds the recursion point’s bindings or arguments and jumps back to the recursion point. The recursion point can be a loop special form or a function definition. So, it’s just an explicit call to the tail recursion optimization. Tail call must be done correctly too, recur is only allowed in a tail call and the compiler checks if it’s located in the correct place. Also, it has some specific rules that must be taken in consideration (multiple arity functions and so on), that is better to read in the documentation.

Edited 2019-01-25: Thanks to a discussion in the fediverse about the topic, I found the moment where Emacs Lisp got its tail call optimization everything explained in the author’s blog. It’s really interesting.

Some help: what’s the name of the website you check when you don’t know how to solve your programming problem? ↩
My next music band name. ↩

My first time

2018-06-23T00:00:00+03:00

The other day I remembered a very important day on my life, one of those early beginnings that started to change my mind: The first time I contributed to free software.

My first contribution was in 2014, more specifically the 22nd of May of 2014.

That’s only 4 years ago. But, at the same time, they already passed 4 years since then? OMG.

You get the feeling, right?

You may think I started coding when I was 10 or something like that. I didn’t. I learned programming in the university and not as well as a Computer Scientist because I studied Telecommunication Engineering and computers are just a third of the studies while the other two parts are electronics and signals related things.

I’m not a young hacker or a genius. My parents don’t like computers. I didn’t live with a computer at home since I was a toddler. That didn’t happen.

Today I want to tell you my story. Not because it’s awesome and you’ll love it. I want to tell you my story because it’s really standard. I want you to see that you can also contribute to Free Software. Anyone can.

So, how did it all start?

I started my university studies in 2009. The first year we had one semester of C and the next one of C++. Not real programming classes, just introductory stuff for the languages and computers. A couple of years later we had a networking subject where I used Linux for the first time. The computers had Kubuntu installed. At that time my laptop started to give me some trouble and I installed Kubuntu in a dual boot and tested it. It was nice.

Few time later the Windows partition failed again and I was comfortable enough in Kubuntu to delete it and use only Kubuntu. It was easy.

The second semester that year another subject had some focus on Linux because it was a networks and tools subject and I really needed it. We learned to use a terminal, some SQL and many things like that. Simple tools but they resulted to be useful in the future. I was really surprised by the power of the terminal and I studied a lot in my free time I finished the subject with honours just because I was really interested on it. As I said, I’m not a genius, I was interested.

We had a subject about Minix, following Andrew Tannenbaum’s Operating Systems: Desing and Implementation book and Minix version 1, which gave us the initial needed knowledge about Operating Systems at that time. That started to give me some info about the ethical part of the free software and also sparked more interest.

Next year I had a couple of Operating Systems subjects (the theoretical one and the practical one). The teacher was part of KDE Spain, and he talked about free software in class. I was quite into it at that time. The practical part of the subject was real software, we covered the contents of the book called Advanced Linux programming¹. That was pure C development and we didn’t have a lot of knowledge on that. We just touched some C/C++ during the first year and some assembly in a couple of subjects. It was really hard, but it was really cool.

We made a small shell. It was great!

Final year² of the university: I had to make the final project.

I didn’t know what to do so I contacted the teacher who was part of KDE Spain and he mentored me. I installed a IRC client and started talking with the people at kde-telepathy project. I wasn’t used to that kind of collaborative development. Heck, I wasn’t used at any kind of development! But it was all good, mostly thanks to the great people in the project (David, Diane, George, Martin… You are awesome!).

The project itself was a KDE application, KDE-Telepathy, a big one. Thanks to heaven, my part of the project was quite separated so I could focus on my piece. That taught me to search in a big codebase and focus on my part. Then I had to code in C++ like in the real life, not like designed problems I’ve worked on at the university, and I also had to read tons of documentation about Qt, KDE and anything else.

I started with the contribution that opened this post and I went on until I renewed the whole interface. It wasn’t great, but the code was finally merged in the application some time later.

Since then I could say I code almost everyday and I’ve been studying many languages more but, at that time, I was relatively new to programming and computers.

With all this I mean:

If you are interested, try. Everything is going to be fine. You don’t need to be a genius³.

You can check the contribution here.

Love.

Ekaitz

It’s a great book, by the way. You can find it online. ↩
When I studied, right before the Bolognia Process, the university was 5 years long for a Masters Degree and 3 for a Bachelor Degree. ↩
But congratulations if you are, that way you’ll learn faster and probably have more reach if you want to. ↩

Genesis

2018-04-15T00:00:00+03:00

As a first post in this official-but-not-very-official blog I just want to introduce myself and ElenQ Technology.

First of all, my name is Ekaitz Zárraga and I was born in 1991. I describe my job as R&D Engineer but actually I studied Telecommunications Engineering and I only make my research and development in that area. I’m mostly focused in programming or computer-related activities but I can also do some electronics and other kind of things. That’s my formal introduction. In the informal part I’d say I’ve always been a really curious person and that made me try other disciplines like arts in its different forms. This last point drives most of what I’ll write about later in this text. That’s all from my part, I’ll write down an informal resume in the future.

ElenQ Technology is a name, it’s a name to call the way I am and the interests I have. That said, it’s also the independent R&D project I’m running. It’s a different kind of company which aims to raise awareness about ethical technology or Ethical Innovation by example, demonstrating ethical companies can be profitable. It’s not simply the way I make my living where I make engineering, it’s also a performance. Like an art piece.

ElenQ Technology is an art piece which tells you that a different model is possible. It tells you that you have a choice and you don’t need to work in a corporation and be governed by its rules.

ElenQ Technology is the result of many things I felt working for other companies and it’s also the result of a deep analysis of the status of the technology in my context, which I think it can be generalized globally with a decent accuracy.

First, in my near context most of the IT companies have a similar business model based on body shopping and they pay really low salaries. Other sectors are not in a much better position, but the IT is outrageous.

The jobs are not 9-to-5 jobs here. Working 10 hours per day is becoming the norm. The famous Economical Crisis™ mixed with a deep corruption made people pray for jobs and the companies are being aware of that.

That said, you can easily imagine how the tech world works here. IT corporations get a ridiculous amount of money while they don’t respect their workers nor their clients. They make proprietary solutions because they don’t want to lose the projects and let the client be independent. They don’t want you to be free in any case because they need to maintain their rotten business model.

That is the general status of the IT world in my surroundings but I’m sure, as I said, it can be generalized, maybe not totally but there are many points than can be, mostly because the corporations I mention are present in other countries.

Personally, I had the luck to work in what I thought it was a better place. The working conditions were not as bad as I described, or at least I thought that. It was a R&D Engineer position in a not very big corporation. I worked in a small department with less than ten co-workers. We made new stuff for the company. It was fun.

After some time there I realized how it worked. It wasn’t really different to other jobs. There were a lot of things I don’t want to share here but I started to feel bad there and my personal situation didn’t help at all. I’ve always been a really curious person, I love learning new things and the job simply wasn’t giving me that as it did at the beginning. I started to need to fill my needs spending more time after the job doing tech related stuff in the few free time I had. The mix between the boring job and the organizational problems we had made it really depressing.

While I was immerse in that depressing environment, our company wanted more money and they started looking for new businesses with the resources they had. Our team, as R&D team, was responsible of the development of the first proof-of-concept of the new technology. We were asked to analyze the data that the company had. Literally, we were asked to track people. The company didn’t care if they were our users or not, we were asked to track everyone.

That was the straw that broke the camel’s back. I left the job because my ethics are not compatible with people tracking. I don’t like it and I don’t want to be part of it.

I always wanted to change the way the technology is created and I always thought it was a great idea to make it by myself and encourage others to do so, but I never had the courage to do it. What happened gave me the courage I needed.

But, why not simply move to another job?

I think it was the moment to try it. I had been defining an idea of what Ethical Technology is for a while and I always wanted to apply that idea in my field: R&D. Also, taking in account that most of the companies where I could work have the same structural problems I decided to change it from the root. I decided to try a different model. Did I really have other options?

Is living in a depressing environment making the world a worse place to live in an option? Really?

Think about it.

Now, ElenQ Technology breaks the business model of the companies of my environment. It makes ethical technology and it makes it in an ethical way. That’s good for me because it lets me work on the fields I like and I can make the world a better place to live in, which selfishly will improve the future I leave for my children if I ever have them. But it’s also good for the clients ElenQ Technology has, because all the projects are handled in a way they own them, following the principles of the Free Software and Hardware and Ethical Design manifesto with some extra ideas I added to be more specific to the innovation field (you can read more here).

I want to change the world. Don’t you?

The thing is I can’t do it alone so ElenQ Technology wants to push other people to take the same decision I took (or I was forced to take) and that’s its main goal.

Let’s change the world together.

About this website

You already noticed this is not the official ElenQ Technology website. This is my official-but-not-very-official blog as part (or head, but I don’t like that word) of ElenQ Technology.

Here I’ll write about ElenQ Technology‘s philosophy, goals, achievements and that kind of official things but mostly I’ll write about the things we make. That’s what interests me the most.

This is going to be a really technical place where I’ll try to explain advanced concepts in a simple way to let you learn stuff with me. I want to share what I do with you all.

About languages

I’ll write all this post in English but some of the entries are going to be translated. As the blog supports that, it’s better for me to leave it like a supported tool and keep the translation option always enabled so I can add community translated posts or posts translated by myself.

[ES] Génesis

2018-04-15T00:00:00+03:00

Como primer post en este blog oficial-pero-no-muy-oficial sólo quiero presentarme y presentar ElenQ Technology.

Me llamo Ekaitz Zárraga y nací en 1991. Suelo describir mi trabajo como ingeniero de I+D pero en realidad estudié Ingeniería de Telecomunicaciones y sólo trabajo en ese área. Mayormente me centro en actividades relacionadas con los ordenadores como, por ejemplo, programar pero también puedo hacer electrónica y otras cosas. Esa sería mi presentación formal. En la informal diría que soy una persona bastante curiosa, lo que me ha hecho investigar y profundizar en otras disciplinas como el arte en sus diferentes formas. Este último punto explica mucho de lo que vendrá después en este texto. Y eso es todo en lo que a mí respecta, ya escribiré un currículum vitae informal en el futuro.

ElenQ Technology es un nombre, una forma de llamar a como soy y a los intereses que tengo. Además, también es un proyecto de I+D que estoy desarrollando. Es una empresa distinta a las demás, en la que pretendo generar conciencia acerca de la tecnología ética mediante el ejemplo, demostrando que las compañías de tecnología ética pueden ser rentables. No es sólo mi trabajo en el que hago ingeniería, también es una performance artística. Es como una obra de arte.

ElenQ Technology es un proyecto artístico que te dice que otro modelo es posible. Te recuerda que tienes elección y que no tienes que trabajar en una corporación y seguir sus reglas.

ElenQ Technology es, simplemente, el resultado de todas las cosas que he sentido trabajando para otras compañías y el resultado de un análisis profundo del estado de la tecnología en mi contexto cercano que, creo, puede ser extrapolado al resto de lugares con una precisión aceptable.

Para contextualizar, las grandes empresas del mundo IT en mi zona cercana tienen un modelo de negocio similar, basado en el body shopping (muchas de ellas haciendo cesiones ilegales en subcontratas). Pagan unos salarios bajísimos y las condiciones laborales son lamentables. El resto de sectores tampoco están mucho mejor, pero el caso de las empresas del mundo IT es escalofriante.

Los trabajos rara vez son de 8 horas diarias, las jornadas se están alargando cada vez más y, en muchos, es normal trabajar 10 horas al día. La famosa Crisis Económica™ mezclada con una profunda corrupción ha sido el caldo de cultivo perfecto para que las grandes empresas se aprovechen de los trabajadores.

Dicho esto, es muy fácil entender cómo funciona el mundo de la tecnología por aquí. Grandes empresas ganando insultantes cantidades de dinero mientras que no respetan a sus trabajadores o clientes, soluciones tecnológicas privativas para atar a los clientes e impedirles ser independientes, etc. Todo para mantener su modelo de negocio podrido y corrupto hasta la médula.

Ese es el estado de las empresas de tecnología en mi entorno, el de las grandes. Seguro que puede extrapolarse a otros lugares porque muchas de ellas operan también en el extranjero.

En mi caso tuve la suerte de acabar en una empresa que parecía un lugar mejor. Las condiciones eran ligeramente mejores que las que he descrito, o al menos así lo creía yo. Era un trabajo de Ingeniero de I+D en una empresa no demasiado grande. Trabajaba en un departamento aislado de menos de 10 personas. Hacíamos los juguetes nuevos de la empresa. Era divertido.

Después de algún tiempo allí me di cuenta de cómo funcionaba. No era tan diferente al resto. Hubo muchas cosas que no quiero compartir aquí pero empecé a sentirme bastante mal y mi situación personal tampoco ayudó mucho. Siempre he sido una persona curiosa a la que le gusta aprender cosas nuevas y ese trabajo dejó de aportarme eso como lo hacía al principio. Empecé a necesitar llenar ese hueco trabajando en mis proyectos personales en el poco tiempo que me quedaba al día. La suma de un entorno de trabajo aburrido y deprimente más los problemas organizativos que teníamos era difícil de gestionar.

Sumergido en ese entorno deprimente, la empresa, con intención de salir a bolsa próximamente, quiso exprimir al máximo sus recursos y plantear nuevos negocios. Nuestro departamento, como encargado del I+D de la empresa, era el responsable de plantear las nuevas pruebas de concepto. Nos pidieron que analizásemos los datos de la compañía. Literalmente, nos pidieron que siguiésemos a la gente, que los localizásemos. No les importaba que fuesen nuestros clientes o no. Querían que localizásemos a todos.

Eso fue la gota que colmó el vaso. Tenía que dejarlo porque eso superaba con creces el límite de mi ética personal. No me gustan esas prácticas y no podía ser parte de eso.

Llevaba tiempo pensando en la forma en la que hacemos tecnología y siempre me había apetecido probarlo por mi cuenta. Eso me dio el valor que me faltaba para hacerlo.

¿Por qué no simplemente cambiar de trabajo?

Creo que era el momento para intentarlo. Como entusiasta del software y hardware libre, siempre me ha interesado definir lo que es la tecnología ética y llevaba tiempo con ganas de aplicarlo en mi campo: el I+D. Además, teniendo en cuenta el estado de las empresas para las que podía trabajar, decidí cambiar las cosas de raíz. Decidí intentar un modelo distinto. ¿Tenía alguna otra alternativa en realidad?

¿Es una alternativa real trabajar en un entorno deprimente que hace del mundo un lugar peor? ¿Seguro?

Piensa en ello.

ElenQ Technology rompe entonces con ese modelo de negocio y hace tecnología ética de una forma ética. Eso es bueno para mí porque me permite trabajar en los campos que me gustan y hacer del mundo un lugar mejor lo que, egoístamente, mejorará el futuro que le deje a mis hijos, si algún día los tengo. Al mismo tiempo esto es bueno para los clientes de ElenQ Technology porque los proyectos se gestionan de forma que ellos son los dueños de la tecnología que se crea. Para esto último se siguen los principios del Software y el Hardware Libre y el Manifiesto del Diseño Ético junto con algunas ideas adicionales más específicas del campo al que me dedico (puedes leer más aquí).

Quiero cambiar el mundo. ¿Tú no?

El problema es que yo no puedo hacerlo solo así que ElenQ Technology es una forma de hacer que otros tomen la misma decisión que yo tomé (o fui forzado a tomar) y ese es su objetivo principal.

Cambiemos el mundo juntos.

Sobre este blog

Ya te has dado cuenta que este blog no es el blog oficial de ElenQ Technology. Esto es mi blog oficial-pero-no-muy-oficial como parte (o “persona al frente”, pero decirlo así no me gusta) de ElenQ Technology.

Aquí escribiré sobre ElenQ Technology, sobre su filosofía, objetivos, logros y ese tipo de temas oficiales que me parezcan relevantes pero sobre todo tengo la intención de escribir sobre las cosas que hacemos. Eso es lo que más me interesa.

Este sitio será un lugar muy técnico en el que trataré de explicar conceptos avanzados de forma sencilla para que aprendáis conmigo. Quiero compartir lo que haga con vosotros.

Sobre los idiomas

Este blog se escribe en inglés, pero algunas de las entradas (como esta misma) podrán traducirse a otros idiomas. Como el blog soporta traducciones, prefiero mantener las opción activa para, si es necesario, añadir traducciones proporcionadas por la comunidad o hechas por mí mismo, como en este caso.

[EU] Genesis

2018-04-15T00:00:00+03:00

Blog ofizial-baina-ez-oso-ofizial honen lehenengo post bezala, nor naizen eta ElenQ Technology zer den aurkeztu nahi dut.

Hasteko, Ekaitz Zárraga naiz eta 1991. urtean jaio nintzen. Nire burua I+G ingeniari bezala deskribatzen dudan arren, Telekomunikazio Ingeniaritza ikasi nuen eta arlo horretan bakarrik egiten dut lan. Bereziki ordenagailuekin lotutako gauzak egiten ditut, programazioa eta horrelakoak, baina elektronika eta bestelako gauzak jorratzeko ere gaitasuna daukat. Hori da nire sarrera formala. Sarrera informalean jakin-min handia dudala esango nuke eta horrek beste diziplina batzutan aritzeko aukera eman didala, haien artean artea. Azkeneko puntu honek garrantzi handia dauka testu honetan idatzitakoarekin erlazio zuzena izango duelako. Hauxe da nire aurkezpena, etorkizunean kurrikulum informal bat egingo dut.

ElenQ Technology izen bat da, nire interesak eta nire izaera adierazteko izen bat baino ez. Hori esanda, aldi beran nire I+G proiektu independentea da. Adibidearen bitartez, enpresa etikoak errentagarriak izan daitezkeela erakutsiz, teknologia etikoari buruz kontzientzia eratzeko helburua duen enpresa bat da. Ez da nire lana bakarrik, performance artistiko bat da. Artelan baten antzera.

ElenQ Technology modelo ezberdin bat egin daitekeela esaten dizun artelan bat da. Aukerak badaudela eta ez zaudela korporazio baten arauekin lan egitera behartuta esaten dizu.

ElenQ Technology beste enpresetan lan egitearen eta nire ingurunean teknologiaren egoeraren analisi bat egitearen emaitza da, baina uste dut nahiko modu zehatzean orokortu daitekeela.

Hasteko, nire inguruko IT enpresek body-shopping-ean oinarritutako negozio eredu antzekoa daukate. Askotan ilegalak diren praktikak egiteaz gain soldata baxuak ordaintzen dituzte. Beste sektoreak ez dira askoz hobeak baina IT munduaren kasua guztiz ankerra da.

Ordutegiak luzatu egin dira azken urteotan. Egunean 10 ordu lan egitea normala bihurtzen hasi da. Krisi Ekonomiko™ famatuaren ondorioz, prekaritatea normalizatu egin da, gehien bat, enpresek argi daukatelako jendeak lanaren behar larria daukala.

Hori esanda, erraz ulertu dezakezu teknologiaren mundua nola dagoen. IT korporazioek haien bezeroak eta langileak errespetatu gabe dirutza egiten dute. Bezeroak lotuta izateko produktu propietarioak saltzen dituzte. Haien helburu nagusia daukaten negozio eredu ustelaren etorkizuna bermatzea da.

Hau da bizi dugun egoera. Lehen esan bezala, erraz orokortu daiteke, aipatutako enpresa gehienak atzerrikoak direlako eta beste herrialdeetan ere kokatuta daudelako.

Nire kasuan, leku hobe batean lan egiten nuela uste nuen. Lan baldintzak hobeak ziren bertan. I+G ingeniari postu bat nuen enpresa moderno batean. Hamar bat pertsonaz osatutako departamendu txiki isolatu bat zen. Enpresaren jostailu berriak egiten genituen. Nahiko dibertigarria zen.

Denbora pasa ahala, lan baldintzak hain onak ez zirela konturatzen hasi nintzen. Aipatu nahi ez ditudan gauza asko gertatu zirenez, bertan txarto sentitzen hasi nintzen eta, gainera, nire egoera pertsonalak ez zuen batere lagundu. Oso pertsona kuriosoa naiz eta lanak nire jakin-mina asetzeko ematen zidan aukera desagertzen hasi zen. Nire denbora librean gauza berriak ikasten eta aztertzen hasi nintzen. Lan aspergarria barneko antolaketa arazoekin batu zen eta nire bizitza kudeatzea oso zaila egin zitzaidan.

Depresioan murgilduta, enpresak, burtsara ateratzea helburu zuela, bere negozioa handitu nahi zuen. Gure departamenduak, enpresaren I+G-aren erantzulea zenez, enpresaren kontzeptu-proba berrien garapena egin behar zuen. Enpresak zituen datuak aztertzea eskatu ziguten. Pertsonen kokapena jarraitzea eskatu ziguten, mundu osotik, edozein momentuan. Ez zuten bezero eta ez-bezeroen arteko ezberdintasunik egin nahi. Guztiak jarraitzeko eskatu ziguten.

Hori gehiegi zen niretzat. Arrazoi etikoengatik utzi nuen lana. Ez dut horretan parte hartu nahi.

Teknologia eratzen den modua aldatu nahi izan dut beti. Nire kabuz teknologia egitea eta besteak gauza bera egitera bultzatzea beti egon da nire buruan baina orain arte ez dut salto hori emateko ausardirik izan. Gertatutakoak falta zitzaidan bultzada eman zidan.

Baina, zergatik ez mugitu beste lan bateara?

Momentua zela uste dut. Denbora luzez ibili naiz teknologia etikoari buruz pentsatzen eta nire esparruan, Ikerkuntza eta Garapenean, aplikatu nahi nuen. Gainera, nire inguruko enpresetan izango nituen arazoak ikusita, zuzenean sustraira joatea erabaki nuen. Beste modelo bat saiatzea erabaki nuen. Beste aukerarik al nuen?

Mundua txarrerantz aldatzen giro deprimagarri batean lan egitea benetako aukera bat al da?

Pentsatu ondo.

Orduan, ElenQ Technolgy-k nire inguruko konpainien negozio modeloa apurtzen du. Teknologia Etikoa garatzen du, modu etiko batean. Hori ona da niretzat, zuzenean, niri gustatzen zaizkidan gauzetan lan egiteko aukera ematen didalako, etorkizunean eduki ditzakedan umeentzat mundua hobetzen dudan bitartean. Eta bezeroentzat, proiektuak garatutako teknologia bezeroarena izateko moduan kudeatzen ditugulako, Software eta Hardware Librearean printzipioak eta Diseinu Etikoaren Manifestoa (gure esparrura moldatuta) jarraituz (gehiago irakurri dezakezu hemen).

Mundua aldatu nahi dut, zuk ez?

Nik bakarrik ezin dudala mundua aldatu konturatu naiz, beraz, ElenQ Technology-k besteak nik (behartuta edo ez) hartu nuen erabakia hartzera bultzatzea du helburu.

Aldatu dezagun mundua guztiok batera.

Blog honi buruz

Jada konturatu zara blog hau ElenQ Technology-ren blog ofiziala ez dela. Nire blog ofizial-baina-ez-oso-ofiziala da, ElenQ Teknology-ren parte (edo buru, baina ez dut hitz hori gustoko) bezala.

Hemen ElenQ Technology-ri buruz idatziko dut, bere filosofia, helburu, arrakasta, etab.-ei buruz. Baina gehien bat egiten dugunari buruz idatzi nahi dut, hori baita niretzat interesgarriena. Prozesuaren parte egin nahi zaituztet.

Hizkuntzei buruz

Blog hau ingelesez idatziko da, baina aukera dago testuak (hau bezala) beste hizkuntzetara itzultzeko. Blogak baimentzen duen bitartean nahiago dut aukera prest uztea komunitateak itzulitako testuak gehitzeko edota, kasu honen moduan, nik egindako itzulpenak igo ahal izateko.

Ekaitz's tech blog

[Talk] Full Source Bootstrapping RISC-V on Guix

The European Union must keep funding free software

Open Letter to the European Commission

Milestone (End?) - Bootstrapping path discovered

Bootstrapping chain discovered

Time to make it reach a distro

Issues

What now, then?

Milestone – Bootstrapped GCC 4.6.4 for RISC-V

The things we need to deal with

We need to rebuild with Musl

Backport Musl support to GCC 4.6.4

But it doesn’t work!

The thing built, and worked!

Next

TinyCC to GCC gap is slowly closing

Symptoms

Musl

Meslibc

TinyCC

Bootstrappable TinyCC

The new Bootstrapping chain

GCC 4.6.4 with RISC-V support in Guix

How to use this thing

Next

Extra (added 2024-04-05)

Takerufuji made history

So we’ve got sidetracked…

Gash

Gzip

GNU-Make

TinyCC

Binutils

Musl

Back to TinyCC

MeslibC

So…

GCC 4.6.4 with RISC-V support

Debian

Guix

GCC’s bootstrapping process

So…

FOSDEM and Guix Days 2024

Guix Days

FOSDEM 2024

My feelings

Guix + Zig + NSIS for the win…DOWS?

The program

The Text-To-Speech system

Connection to the Twitch chat

Playing the audio

All together

The tooling

Keeping it small

Audio library

AhoTTS

The rest of it

Bringing it to the users

Windows

NSIS

GNU/Linux

XDG standard: Desktop file and icons

Guix

Debian

Testing

Conclusions

Bye Protonmail

Mes released and bootstrappable TCC merged

Mes

Bootstrappable TinyCC

Some words about it

Milestone — MesCC builds TinyCC and fun C errors for everyone

Context

Why is this important?

Problems fixed

TinyCC misses assembly instructions needed for MesLibC

TinyCC’s assembly syntax is weird

TinyCC does not support Extended Asm in RV64

MesLibC main function arguments are not set properly

MesLibC `main` function arguments are not set properly

TinyCC says `__global_pointer$` is not a valid symbol

Bootstrappable TinyCC’s `long double` support was missing

MesLibC use `signed char` for `int8_t`

MesLibC Implement `setjmp` and `longjmp`