General hint, very often a copy of a site is on the Wayback Machine, and if it is not an ancient snapshot, can be used as a substitute (i.e. not really suitable for news sites or for highly dynamic content, but technical resources tend to be rather static in nature).
The issue may be that one (or the other) snapshot could be not complete, and you need to try a few ones to get the whole stuff, possibly partially from different snapshots: https://web.archive.org/web/20250000000000*/http://rickfreyconsulting.com/explanation-of-mikrotik-layer-2-firewall-pattern-matchers
While the linked article is well written, in this context it’s mostly useless. Yes, you can filter PPPoE packets in all the usual L2 ways (MAC address, port, etc.), but for PPPoE (especially if you’re a client) they’re pretty much all-or-nothing.
So I wouldn’t feel the need to filter PPPoE packets as such. The packets are visible after decapsulation in the normal ip firewall - here is where filtering is advised, I would go so far as to say mandatory.