From 4880f641ade11e60f18f907fdf0660ca349714a7 Mon Sep 17 00:00:00 2001
From: Prefetch
Date: Tue, 15 Nov 2022 22:13:52 +0100
Subject: Publish "Website adventures" part 1 about static site generators
---
.../2022/website-adventures-generators/index.md | 474 +++++++++++++++++++++
1 file changed, 474 insertions(+)
create mode 100644 source/blog/2022/website-adventures-generators/index.md
(limited to 'source/blog/2022/website-adventures-generators')
diff --git a/source/blog/2022/website-adventures-generators/index.md b/source/blog/2022/website-adventures-generators/index.md
new file mode 100644
index 0000000..0aba183
--- /dev/null
+++ b/source/blog/2022/website-adventures-generators/index.md
@@ -0,0 +1,474 @@
+---
+title: "Adventures in making this website:
static site generation"
+date: 2022-11-15
+layout: "blog"
+toc: true
+---
+
+Making and managing this personal website has been an adventure.
+In this series, I go over the technical challenges I've encountered
+and philosophical decisions I've made,
+and I review some of the tools I've used along the way.
+This is part 1, with more posts coming soon.
+
+
+
+If you've ever taken a good look at HTML,
+it should be clear that it isn't fun to write it all by hand.
+Therefore, we invented the Static Site Generator (SSG):
+a program that takes an HTML template and some text
+in a more human-friendly form (usually Markdown),
+and mashes them together, yielding a servable HTML file.
+Of course, it's pretty clear what the best approach to do this is,
+so we only need one SSG, right?
+
+
+
+
+
+Yeah... no, there are [a lot of them](https://jamstack.org/generators/).
+Frustratingly many, in fact.
+Although some of them are highly specialized or advanced,
+most are very similar in scope.
+The three SSGs I'm about to discuss
+are all intended for personal blogs,
+and for many people, there isn't much reason to prefer one over the other.
+My SSG-hopping was due to my niche requirements and perfectionism.
+
+
+
+## Zola
+
+The first SSG I used was [Zola](https://www.getzola.org/),
+which was started in late 2016 (making it the youngest SSG here),
+and advertizes itself as a *"one-stop static site engine"*
+that *"gets out of your way so you can focus on your content"*.
+Written in Rust, it's fast and portable,
+and has convenient features like code syntax highlighting,
+support for multilingual sites, and an optional theme library.
+Similar to other SSGs, you can run `zola serve` to start a local HTTP server
+to test a webpage, which reloads itself each time you make changes;
+this makes development a breeze.
+
+
+
+### Demonstration
+
+An example can say more than a thousand words,
+so, to give an idea of what it's like to use Zola (other SSGs are similar),
+consider this HTML template,
+powered by [Tera](https://tera.netlify.app/):
+
+```html
+{% raw %}
+
+ Example
+
+
+ {% block content %}
+ {{ section.content | safe }}
+ {% endblock content %}
+
+{% endraw %}
+```
+
+This is a standard HTML document, except for the parts in curly braces:
+`{% raw %}{% %}{% endraw %}` is for statements,
+and `{% raw %}{{ }}{% endraw %}` inserts the value of an expression into the output.
+We've created a block `content`, and given it the default definition
+`{% raw %}{{ section.content | safe }}{% endraw %}`,
+where `section.content` is the content of this "section"
+after Markdown-to-HTML conversion
+(Zola distinguishes *sections* and *pages* based on folder structure and file name).
+Then `safe` tells Tera that the string has been fully converted to HTML,
+so no more escaping is necessary.
+
+The point of defining such blocks in the template
+is that you can override them.
+We thus create a page template which inherits the above structure
+(`{% raw %}{% extends "default.html" %}{% endraw %}`)
+and redefines the `content` block.
+This example also highlights Zola's neat table-of-contents generation:
+
+```html
+{% raw %}{% extends "default.html" %}
+{% block content %}
+ {{ page.title }}
+
+
{% for h1 in page.toc %}
+ - {{ h1.title }}
+ {% if h1.children %}
+
+ {% endif %}
+ {% endfor %}
+
+ {{ page.content | safe }}
+{% endblock content %}{% endraw %}
+```
+
+Okay, enough templates, what if we actually want to write something?
+Well, then we just create a Markdown file in the appropriate place
+(I'm being vague, since this isn't intended to be a tutorial)
+with contents like:
+
+```markdown
++++
+title = "Example page"
++++
+
+# Heading 1
+
+Lorem ipsum dolor sit amet...
+```
+
+The part surrounded by `+++` is called the *front matter*,
+and is a common feature in SSGs, allowing you to set parameters
+that can be used by templates or the SSG itself.
+And that's it! If we now run `zola build`,
+our Markdown will be converted to HTML and inserted into the template.
+
+
+
+### Review
+
+Zola's opinionated design pays off:
+it's quite simple, fast and as flexible as your templates allow.
+Speaking of which, of the SSGs I've tried,
+Tera is in my opinion the cleanest and most powerful template engine.
+Zola's main weakness is its obscurity:
+if you need help or want to do complex things,
+there aren't many resources available other than the official documentation.
+But I also think its design is good enough
+that you can probably figure it out.
+
+So, why did I stop using Zola? Due to a giant misunderstanding, that's why.
+In late 2020 I started my [knowledge base](/know/),
+for which I needed the ability to show maths formulas.
+There exist several solutions to put LaTeX maths on websites,
+but back then I didn't understand how they worked and what their prerequisites were,
+so I mistakenly concluded that it wasn't possible with Zola.
+Indeed, Zola doesn't have built-in support for it,
+but it turns out I didn't need that!
+And I only figured that out while writing this post...
+
+I'll go over my maths rendering journey in a future article.
+In the end, my misguided decision to migrate from Zola did have some benefits;
+otherwise my current offline solution wouldn't have been possible.
+But in hindsight, I wish I'd spent more time with Zola,
+and I think it would be my first recommendation to a beginner.
+
+
+
+## Hugo
+
+I migrated to [Hugo](https://gohugo.io/), an SSG written in Go,
+which also markets itself as being fast and flexible,
+and is pretty much feature-equivalent to Zola.
+Hugo is much more popular than Zola;
+in fact, it seems to be one of the most-used SSGs out there.
+
+The reason I chose it is that it allows page contents to be written in one
+of [several formats](https://gohugo.io/content-management/formats/#list-of-content-formats),
+one of which is [pandoc's](https://pandoc.org/) Markdown dialect
+that has built-in support for LaTeX maths.
+So when Hugo is building the site, it passes my `.pdc` file to the external `pandoc` program,
+which converts it to HTML to insert into the template.
+With how Hugo sets this up, the result is that all maths
+get wrapped in HTML `` tags with certain classes,
+which I (wrongly) thought was necessary.
+
+
+
+### Demonstration
+
+The previous page template example would look like this in Hugo,
+after a few small tweaks like adding a navigation bar at the top of the page:
+
+```html
+{% raw %}
+
+ {{ .Title }} | {{ .Site.Title }}
+
+
+ {{ partial "navbar.html" . }}
+ {{ .Title }}
+ {{ .TableOfContents }}
+ {{ .Content }}
+
+{% endraw %}
+```
+
+On the one hand, Hugo provides the convenient `.TableOfContents` variable
+at the cost of control, unlike Zola, where we had to do it manually.
+On the other hand, Hugo doesn't use block-based templates
+and hence lacks a nice inheritance system like Zola's.
+If you want to reuse a snippet, you put it in a file e.g. `navbar.html`
+and include it as `{% raw %}{{ partial "navbar.html" . }}{% endraw %}`.
+Note the dot at the end... I'll get to that.
+Overall, Hugo's approach to organizing templates feels "dumber" than Zola's,
+although, to be fair, my website had become a lot more complicated by then.
+
+Like most SSGs, Hugo uses an external template engine,
+namely the [`text/template`](https://pkg.go.dev/text/template) Go package.
+Disclaimer: I'm not a Go developer, so I don't know how nice this library is for various use-cases;
+here I'm just reviewing whether it's good for HTML templates in Hugo.
+
+Okay, here comes my review: it's awful. No, it isn't a good fit here:
+the whole point of an SSG is to have a pretty sophisticated template language
+to avoid needing to program a custom solution.
+This package is just too primitive,
+so Hugo has shoehorned all sorts of extensions into it to make it usable;
+the result is an opaque mess with still too little flexibility.
+For simple stuff like inserting the site's title somewhere it's fine,
+but I need some pretty non-trivial templates.
+
+To give you a taste, consider the following snippet I wrote,
+which lists the 10 most recent articles from newest to oldest,
+as seen on my knowledge base' [front page](/know/):
+```html
+{% raw %}Most recent articles:
+
+ {{ $concepts := where .RegularPagesRecursive "File.Dir" ">" "know/concept/" }}
+ {{ range first 10 $concepts.ByPublishDate.Reverse }}
+ - {{ .Title | title }}
+ {{ end }}
+
+{% endraw %}
+```
+
+This example captures the messiness of Hugo templates.
+But on a more serious note,
+there are some questionable design decisions here,
+in no particular order:
+
+* `.Title` is a variable, whose value is implicitly set by the surrounding `range` loop.
+ And what if you want to insert *this* page's title?
+ You also write `.Title`, just not inside a loop.
+ So you don't know what some variables contain,
+ unless you pay close attention to the context:
+ you need to be aware that this is happening inside a loop over an array of pages...
+ and not, of course, an array of something else,
+ so don't forget to type-check the loop.
+
+ To be fair, what's going on here is clearly explained
+ [in the docs](https://gohugo.io/templates/introduction/#the-dot):
+ the initial dot refers to the current context,
+ of which `Title` is an attribute.
+ The keywords `range` and `with` swap out the context from under your feet,
+ which I insist is bad design.
+ If you need the global context, it can always be accessed explicitly using `$.`,
+ so at least there's that?
+
+* Meanwhile, `.ByPublishDate` is a function that sorts a list of pages;
+ specifically, it's a *method* in the object-oriented sense.
+ Okay, this works, but why did they add this feature,
+ instead of just telling us to use the more general
+ [`sort`](https://gohugo.io/functions/sort/) function?
+
+ `.Reverse` is another method, which flips the order of an array.
+ I have the same question: why this, instead of a general `reverse` function?
+ As of writing, no such function exists,
+ so if you want to reverse an array... [have fun](https://discourse.gohugo.io/t/reverse-array/28753).
+
+ This awkward split between methods and "normal" functions
+ makes it unclear how things can be composed.
+ Clearly, you can chain methods as `$xs.ByPublishDate.Reverse`,
+ or pipeline functions as `$x | title | sha256`,
+ but how do you mix the two?
+ And what if we need to pass additional arguments,
+ e.g. for [`.Param`](https://gohugo.io/functions/param/)
+ or [`hmac`](https://gohugo.io/functions/hmac/) (WTF, this SSG can do crypto)?
+ I'm sure there exist answers to these questions;
+ my point is that it's unnecessarily confusing.
+
+* Nitpick: for looping over an array,
+ why did the Go developers choose the keyword `range`,
+ instead of `for` like almost all other computer languages?
+ To me, `range` is a function (like in Python),
+ and the order of `range first 10 xs` hence looks weird (coming from Haskell).
+
+ And why do [Hugo's docs](https://gohugo.io/functions/range/)
+ claim that `range` is indeed a function,
+ while the Go [package docs](https://pkg.go.dev/text/template#hdr-Actions)
+ clearly say that it's an *action*?
+ Documentation should be precise!
+
+Maybe, if you're a Go/Hugo veteran, this all just makes sense to you
+(then I'm both jealous and scared of you).
+I just want you to keep my comments in mind
+when I compare the same piece of templating code in the next SSG.
+
+
+
+### Review
+
+I don't mean to shame Hugo's developers with my complaints.
+I suspect it started as a private experiment
+(which would explain their choice of Go's template package for quick development)
+that became too popular too fast.
+Looking at [Jamstack's ranking](https://jamstack.org/generators/) of SSGs,
+we see that Hugo is the only compiled program anywhere near the top,
+and I think that explains its popularity:
+there are a lot of performance-focused programmers out there,
+who don't want an SSG in an interpreted language like JS or Python.
+So when Hugo was released, it quickly attracted many users,
+and also many contributors wanting various features.
+The result is its current messy feel.
+
+But for all my complaining, Hugo's popularity is a big advantage,
+with many tutorials available online in addition
+to the extensive official documentation.
+And, as it claims, Hugo is indeed fast and flexible.
+In practice, you only really interact with an SSG when writing your templates,
+and after that most of your time is spent on actual content,
+so although Hugo may sometimes get in your way during development,
+it might not be a problem for you.
+
+Towards the end of 2022, my maths rendering solution required
+fetching 450KB of compressed JS on page load,
+leading to a visible delay, which I wasn't happy with.
+I'd discovered it was possible to render maths offline,
+so no JS would need to be loaded,
+but this couldn't reasonably be done with Hugo,
+unless I forked it like [this guy](https://graemephi.github.io/posts/server-side-katex-with-hugo-part-2/)
+did... No thanks.
+It was time to change.
+
+
+
+## Jekyll
+
+Compared to Zola and Hugo,
+[Jekyll's](https://jekyllrb.com/) feature set is fairly basic:
+things like an automatic table of contents or multilingual pages
+need to be added using plugins.
+But on the upside it supports plugins at all,
+thanks to being written in Ruby, an interpreted programming language,
+unlike its compiled competitors.
+It's quite mature compared to other popular SSGs,
+and is what I'm using right now,
+since I was able to get statically-rendered maths thanks to a plugin.
+
+
+
+### Demonstration
+
+At first sight, Jekyll's [Liquid](https://shopify.github.io/liquid/) templates
+look very similar to Hugo's, due to its inclusion system:
+
+```html
+{% raw %}
+
+ {{ page.title }} | {{ site.title }}
+
+
+ {% include navbar.html %}
+ {{ content }}
+
+{% endraw %}
+```
+
+But where did our table of contents go? For that, we need
+the [`jekyll-toc`](https://github.com/toshimaru/jekyll-toc) extension
+and pipe all our content through the `toc` filter.
+Compared to Hugo, Jekyll has a trick up its sleeve here:
+you can nest templates inside the `{% raw %}{{ content }}{% endraw %}` object.
+So, let the above `default.html` be a skeleton for the actual page template,
+which we then define as:
+
+```html
+{% raw %}---
+layout: "default"
+---
+
+{{ page.title }}
+
+{{ content | toc }}{% endraw %}
+```
+
+Another template could be nested in `{% raw %}{{ content }}{% endraw %}`, and so on,
+until the innermost block is filled with the converted Markdown,
+which, unlike for Zola and Hugo, can also contain template code.
+This approach is brilliant in my opinion: it's more flexible than Hugo
+and needs less boilerplate than Zola.
+Also, Jekyll doesn't try to guess which template should be used for what.
+
+A standard feature of Markdown is that you can write HTML directly,
+which gets copied verbatim while the rest is converted to HTML too.
+A minor annoyance is that [kramdown](https://kramdown.gettalong.org/),
+Jekyll's converter, insists on parsing the HTML I write,
+but isn't very good at it:
+for the image on [this page](/blog/2022/email-server-revisited/#motivation-1),
+why did it generate a closing `` tag?
+That isn't valid! Fortunately, modern browsers are accustomed to this,
+so it gets interpreted correctly anyway.
+
+And what about my snippet to list the 10 most recent articles?
+In Jekyll it looks [like this](/code/prefetch-jekyll/tree/source/know/index.md):
+
+```html
+{% raw %}{% assign concepts = site.pages | where_exp: "item", "item.layout == 'concept'" %}
+{% assign newest = concepts | sort: "date" | reverse | slice: 0, 10 %}
+Most recent articles:
+
+ {% for item in newest %}
+ - {{ item.title }}
+ {% endfor %}
+
+{% endraw %}
+```
+
+There, isn't that much better?
+Yes, it's a bit more verbose than Hugo,
+but at least it's pretty clear what's going on;
+with the documentation's help, you could write this yourself.
+Now it's bearable for me to write complex templates...
+although sometimes I maybe take it
+[a bit too far](/code/prefetch-jekyll/tree/source/_includes/image.html).
+
+
+
+### Review
+
+Jekyll has been consistently popular for many years, and I can see why.
+Its templating system is elegantly organized and flexible enough for most use-cases,
+it's infinitely extensible with plugins
+and plenty fast despite being written in Ruby.
+Although 5 years older than Hugo, it doesn't feel nearly as messy.
+
+Jekyll deserves some criticism for how it wants you to organize your files by default:
+in both Zola and Hugo, you put your markdown into a dedicated `content` folder,
+but in Jekyll you just dump it all directly into the root,
+meaning that configuration and content aren't clearly separated, which I don't like.
+Fortunately, you can fix this in `_config.yml` by setting `source`.
+
+I'm happy with my choice to move to Jekyll,
+and I don't want to do another migration ever again
+(~200 files would need to be converted).
+My site feels organized now,
+and it builds in seconds, even including the process
+of rendering all LaTeX maths into HTML.
+In fact, I'm so satisfied that I've even published my
+[source code](/code/prefetch-jekyll/tree/)!
+
+
+
+## Conclusion
+
+Making a functioning website isn't as hard as it seems,
+but making a *good* one does take time.
+Most of that time is spent writing HTML templates and especially CSS,
+and realistically, both will need to be tweaked again and again over time.
+But, in my humble opinion, it's worth the effort:
+it's a constructive activity with tangible results.
+I like seeing my work appear in a browser.
+
+Do you want to build your own website using an SSG?
+I recommend that you look at Zola first, with Jekyll a close second,
+if you can deal with its higher complexity.
+Personally, compared to those two, I wouldn't recommend Hugo,
+but for some people (probably Go developers)
+it just might be what they're looking for.
+
--
cgit v1.2.3