Forum: RetroDigital BBS

[ANN] robots.txt consolidation on tilde.club

From Anton Shepelev@ant@tilde.culb to tilde.club on Sun Jul 20 14:59:04 2025

Many tilde.club members have placed robots.txt files in their
~/public_html/ directories, unaware that they have no effect, because
crawlers consult robots.txt only at domain root[1]. Therefore, I have
written a script for the consoliation of the individual users's robots.txt files into a unified server-level robots.txt.

This script is now installed and reguarly executed on the tilde.club
machine, and everyone is welcome to verify whether their personal
robots.txt is properly incorporated:

<http://tilde.club/robots.txt>

My latest version of the script is always available at:

<http://tilde.club/~ant/pub/botcons>

Bug requests and feature reports are welcome in the form of diff
patches, English prose (or verse, for that matter) to my e-mail
(ant [at] tilde.club) or in the #club channel on Newnet IRC[2].

1. <https://www.rfc-editor.org/rfc/rfc9309.html#section-2.3>:

The rules MUST be accessible in a file named "/robots.txt" (all
lowercase) in the top-level path of the service.

2. <https://newnet.net/>
--- Synchronet 3.20a-Linux NewsLink 1.2