clang in CI fails to deduce it, so let's help it a bit.
Implement a crude html-to-text scraper function, to extract plain text from html messages, so we can use it for indexing.