Corporations do not like it though, because they want to control spread of information. They prefer you re-enter page, be targeted with more ads, or spoonfed with their news algorithm on-site.
However RSS only works, if someone knows about your page. YouTube is quite good with RSS though. You can have channel RSS, be notified about new videos, while channels still are discoverable.
Also shameless plug to my own database of RSS feeds
h ttps://github.com/rumca-js/awesome-database-feeds
Both ideas can be true. It’s not on their radar because despite their popularity in consumer space they can’t find a business purpose that aligns with their self interests that require such user information. If I’m running a free podcast, in contrast, I might be happy anyone’s even bothering to visit and listen to what I have to say compared to who they are and whether I can assign a monetary value to their attention because spending money on something without clear, intentional, measurable ROI is anathema to our predominant modus operandi in business
Same experience. RSS is by far the most requested resource in the server logs. Sometimes up to 70% of the total traffic is coming from RSS. That said though, I wonder how much of that traffic is biological and how much is bots.
I also wonder how much is dead traffic. Dead as in people who add a ton of stuff in their RSS readers but don’t actually read.
RSS is a bit of a black box when it comes to this but maybe that's a good thing.
There's a ton of selection bias going on here. Terence writes mostly about open web standards and topics that would be of significant interest to folks with an RSS feed reader.
Also, RSS readers are generally automated. I know I've had them around for years pulling in articles that I never read. Like a podcast "listen" is actually just an automated download, RSS traffic does not necessarily involve anyone actually reading the article, whereas search traffic is generally high intent and is at least resulting in eyeballs on the site, if not actual readers.
The data doesnt purport to cover any more than the 1 website. Its not like there are any generalisations about other websites derived from the data. Its just "These are where my hits come from"
> RSS readers are generally automated. I know I've had them around for years pulling in articles that I never read
They try to address that
> I added RSS and Newsletter tracking. These data are very lossy. If someone is subscribed to my RSS feed and opens a post and their client downloads a lazy-loaded image at the end of the post, I get a hit.
Oh yeah, your feed is definitely in my reader. When I click through to your site, in the URL I pass ?rss_ref=mydomain This way you - and other domains I visit - can know where the RSS-visitor is coming from (not foolproof of course, because your average spammer can also put their domain in the URL).
More important, when you see my traffic your post is definitely read on your website itself (or at least opened to read). There's no background stuff going on in my reader, pulling in feeds without reading them.
My way of reading feeds is more Tinder-style: the latest post from a random feed per time, then another one, etc. I feel irky even thinking about a firehose of content coming my way.
I've been toying with LLM based agents for staying on top of feeds lately. Staying on top of the fire hose was always the challenge with RSS readers, a lot of interesting low frequency stuff from people you follow gets buried under the non stop feed of stuff from high volume news that is actively trying to drown everyone out with many updates per day. Back in the day I solved it by simply not using RSS feeds for high volume stuff. But I was following a lot of interesting people. I miss that.
This is what lead to algorithm based filtering. Hacker News uses a simplistic algorithm but it is definitely using one and it works well enough. It's why I come here. We all collectively vote things up and what remains is nominally interesting enough to skim from the front page. With a bit of editorializing.
Social networks tried to game the algorithms for ad revenue. Which is why they are a lot less popular these days. Sites like Medium, Substack, Tumblr, etc. took over from simple blogs and immediately started raising walled gardens around them to become discovery platforms, have recommendations, etc.
But at least they support RSS. A lot of websites still do. If you run any kind of website publishing regular news or article content and you don't support a feed, you are being an idiot. It's easy, doesn't really cost anything, and you might actually get people using your feed once in a while. Your site might actually have one without you realizing. Most news papers have feeds. They are everywhere. The main issue isn't finding them but sifting through them. It always was.
With agent based approaches, you control the algorithm. That wasn't possible in the past. LLMs can summarize, aggregate, categorize, group, filter, etc.
> a lot of interesting low frequency stuff from people you follow gets buried under the non stop feed of stuff from high volume news that is actively trying to drown everyone out with many updates per day.
Every major RSS reader supports folders. Your problem is that you engage with RSS as if it were a social media feed, with it's single monolithic reverse-chronological feed.
Just don't do that. Stick all the high volume news feeds in a folder, and you can skim read the headlines & hit "mark all as read" once you're done or for whatever other reason don't want to look at the news anymore.
Stick the low volume things you care about in their own folder, and those will remain unread, in their own ordering for you to read at your own leasure.
Even for sites that don't offer granular feeds, every major feed reader offers filtering options, a lot of them offer fairly complex regex filtering.
> This is what lead to algorithm based filtering.
Feed aggregators (and most social media) exist because of discoverability, finding new stuff from new people you hadn't heard about before.
> With agent based approaches, you control the algorithm. That wasn't possible in the past. LLMs can summarize, aggregate, categorize, group, filter, etc.
You'd be spending tens of dollars of compute on something that every major RSS client was doing back in 2006 with the equivalent of less than a single penny worth of current day compute.
I'm building Subweb.net (not ready yet, it's just a few test feeds without the LLM pipeline turned on yet) to LLM-tag RSS feed items with topic, relevance/interest, location, and translations, and present them as feeds. I'm thinking I could maybe let users specify their preferred custom prompts and ranking params or similar, though the standard prompt is already fine.
I think the open web needs to come back, but in a fair way for everyone, giving readers control over their feeds while also sending traffic and comments back to the original sources. Not quite sure how to do that yet.
Just as a suggestion, but I've been thinking in terms of clustering. If I can figure out that five different posts from five different feeds are about one story, I can collapse them into a single UI element.
I toyed around with using Language Embeddings as a way to categorize my RSS Feeds.
It works pretty well. But importantly, it's so cheap that I have never really seen it on my bill. An earlier prototype used OpenAI embeddings. I loaded 5$ API credits and after a year the credits expired.
Is it possible that the users who used to find us through Google are now satisfied with AI chat summaries and no longer feel the need to click through to the actual page?
Meanwhile, the long-time users who subscribed via RSS are still showing up like they always have. If this is the case, it’s a bit of a sad reality for content creators.
This is probably the case for most personal websites with considerable cachet in their niche.
+ With a strong enough social network you probably don't have to care about SEO as much
You can title your post about bad customer service practices in a unique way without a second thought [0] and your more traditionally titled posts can still make the first page of a Google search with a reasonable query [1].
+ Depending on your niche your target audience is likely to already be tapped in well enough to not have to rely on search engines for content catering to their interests.
I feel like search engine practices trend along the curve shown in that meme where it's the "fool" on one end and then the "normie" in the middle and then the "Jedi" on the other end who does the same thing as the idiot. Except in this case "Jedis" only search for what's not present in their feeds (which doesn't have to be only RSS feeds) and fools can eventually cultivate their own feeds for their interests and reserve search engine use for mundane purposes that essentially fulfill the responsibility of some kind of pop culture almanac, phonebook and portal to Wikipedia.
From my analytics, around 85% of visits to my sites are now from bots. It's hard to say if these are oldschool bots or people just browsing the web via chatgpt now, but the reality is that very few actually visit the actual sites I have anymore.
The universal contact function is usually there, and it's email. I tried many times: it works. Authors respond. Only very few people seem to want to keep it hidden.
I'm dumbfounded by the number of times I see comments of the form "if the author is reading this ..." on a 3rd party comment side, with a link posted by somebody else, on a forum the author is likely never going to watch, followed by an actually useful comment that you could have _ensured_ the author reads by you know... just contacting him?
Forum comments are just recipe for instant spam, and have been so in the last 10+ years. If you want to make them useful, it currently needs to be actively policed (not to mention: you can be responsible for the content posted as well in several countries now). As an author, only if you're trying to create an audience around your blog, all the hassle around it might be worth it.
How else to notify readers? RSS is quite good.
Corporations do not like it though, because they want to control spread of information. They prefer you re-enter page, be targeted with more ads, or spoonfed with their news algorithm on-site.
However RSS only works, if someone knows about your page. YouTube is quite good with RSS though. You can have channel RSS, be notified about new videos, while channels still are discoverable.
Also shameless plug to my own database of RSS feeds
h ttps://github.com/rumca-js/awesome-database-feeds
It’s not that they don’t like it. It’s more like they don’t care, or it’s not on their radar.
Both ideas can be true. It’s not on their radar because despite their popularity in consumer space they can’t find a business purpose that aligns with their self interests that require such user information. If I’m running a free podcast, in contrast, I might be happy anyone’s even bothering to visit and listen to what I have to say compared to who they are and whether I can assign a monetary value to their attention because spending money on something without clear, intentional, measurable ROI is anathema to our predominant modus operandi in business
Same experience. RSS is by far the most requested resource in the server logs. Sometimes up to 70% of the total traffic is coming from RSS. That said though, I wonder how much of that traffic is biological and how much is bots.
I also wonder how much is dead traffic. Dead as in people who add a ton of stuff in their RSS readers but don’t actually read.
RSS is a bit of a black box when it comes to this but maybe that's a good thing.
There's a ton of selection bias going on here. Terence writes mostly about open web standards and topics that would be of significant interest to folks with an RSS feed reader.
Also, RSS readers are generally automated. I know I've had them around for years pulling in articles that I never read. Like a podcast "listen" is actually just an automated download, RSS traffic does not necessarily involve anyone actually reading the article, whereas search traffic is generally high intent and is at least resulting in eyeballs on the site, if not actual readers.
>There's a ton of selection bias going on here.
The data doesnt purport to cover any more than the 1 website. Its not like there are any generalisations about other websites derived from the data. Its just "These are where my hits come from"
> RSS readers are generally automated. I know I've had them around for years pulling in articles that I never read
They try to address that
> I added RSS and Newsletter tracking. These data are very lossy. If someone is subscribed to my RSS feed and opens a post and their client downloads a lazy-loaded image at the end of the post, I get a hit.
Oh yeah, your feed is definitely in my reader. When I click through to your site, in the URL I pass ?rss_ref=mydomain This way you - and other domains I visit - can know where the RSS-visitor is coming from (not foolproof of course, because your average spammer can also put their domain in the URL).
More important, when you see my traffic your post is definitely read on your website itself (or at least opened to read). There's no background stuff going on in my reader, pulling in feeds without reading them.
My way of reading feeds is more Tinder-style: the latest post from a random feed per time, then another one, etc. I feel irky even thinking about a firehose of content coming my way.
I've been toying with LLM based agents for staying on top of feeds lately. Staying on top of the fire hose was always the challenge with RSS readers, a lot of interesting low frequency stuff from people you follow gets buried under the non stop feed of stuff from high volume news that is actively trying to drown everyone out with many updates per day. Back in the day I solved it by simply not using RSS feeds for high volume stuff. But I was following a lot of interesting people. I miss that.
This is what lead to algorithm based filtering. Hacker News uses a simplistic algorithm but it is definitely using one and it works well enough. It's why I come here. We all collectively vote things up and what remains is nominally interesting enough to skim from the front page. With a bit of editorializing.
Social networks tried to game the algorithms for ad revenue. Which is why they are a lot less popular these days. Sites like Medium, Substack, Tumblr, etc. took over from simple blogs and immediately started raising walled gardens around them to become discovery platforms, have recommendations, etc.
But at least they support RSS. A lot of websites still do. If you run any kind of website publishing regular news or article content and you don't support a feed, you are being an idiot. It's easy, doesn't really cost anything, and you might actually get people using your feed once in a while. Your site might actually have one without you realizing. Most news papers have feeds. They are everywhere. The main issue isn't finding them but sifting through them. It always was.
With agent based approaches, you control the algorithm. That wasn't possible in the past. LLMs can summarize, aggregate, categorize, group, filter, etc.
> a lot of interesting low frequency stuff from people you follow gets buried under the non stop feed of stuff from high volume news that is actively trying to drown everyone out with many updates per day.
Every major RSS reader supports folders. Your problem is that you engage with RSS as if it were a social media feed, with it's single monolithic reverse-chronological feed.
Just don't do that. Stick all the high volume news feeds in a folder, and you can skim read the headlines & hit "mark all as read" once you're done or for whatever other reason don't want to look at the news anymore.
Stick the low volume things you care about in their own folder, and those will remain unread, in their own ordering for you to read at your own leasure.
Even for sites that don't offer granular feeds, every major feed reader offers filtering options, a lot of them offer fairly complex regex filtering.
> This is what lead to algorithm based filtering.
Feed aggregators (and most social media) exist because of discoverability, finding new stuff from new people you hadn't heard about before.
> With agent based approaches, you control the algorithm. That wasn't possible in the past. LLMs can summarize, aggregate, categorize, group, filter, etc.
You'd be spending tens of dollars of compute on something that every major RSS client was doing back in 2006 with the equivalent of less than a single penny worth of current day compute.
I'm building Subweb.net (not ready yet, it's just a few test feeds without the LLM pipeline turned on yet) to LLM-tag RSS feed items with topic, relevance/interest, location, and translations, and present them as feeds. I'm thinking I could maybe let users specify their preferred custom prompts and ranking params or similar, though the standard prompt is already fine.
I think the open web needs to come back, but in a fair way for everyone, giving readers control over their feeds while also sending traffic and comments back to the original sources. Not quite sure how to do that yet.
Just as a suggestion, but I've been thinking in terms of clustering. If I can figure out that five different posts from five different feeds are about one story, I can collapse them into a single UI element.
I did exactly this with Cohere language embeddings
https://aws.amazon.com/blogs/machine-learning/use-language-e...
I toyed around with using Language Embeddings as a way to categorize my RSS Feeds.
It works pretty well. But importantly, it's so cheap that I have never really seen it on my bill. An earlier prototype used OpenAI embeddings. I loaded 5$ API credits and after a year the credits expired.
https://aws.amazon.com/blogs/machine-learning/use-language-e...
https://github.com/aws-samples/rss-aggregator-using-cohere-e...
Is it possible that the users who used to find us through Google are now satisfied with AI chat summaries and no longer feel the need to click through to the actual page?
Meanwhile, the long-time users who subscribed via RSS are still showing up like they always have. If this is the case, it’s a bit of a sad reality for content creators.
This is probably the case for most personal websites with considerable cachet in their niche.
+ With a strong enough social network you probably don't have to care about SEO as much
You can title your post about bad customer service practices in a unique way without a second thought [0] and your more traditionally titled posts can still make the first page of a Google search with a reasonable query [1].
+ Depending on your niche your target audience is likely to already be tapped in well enough to not have to rely on search engines for content catering to their interests.
I feel like search engine practices trend along the curve shown in that meme where it's the "fool" on one end and then the "normie" in the middle and then the "Jedi" on the other end who does the same thing as the idiot. Except in this case "Jedis" only search for what's not present in their feeds (which doesn't have to be only RSS feeds) and fools can eventually cultivate their own feeds for their interests and reserve search engine use for mundane purposes that essentially fulfill the responsibility of some kind of pop culture almanac, phonebook and portal to Wikipedia.
[0]: https://shkspr.mobi/blog/2026/03/bored-of-eating-your-own-do...
[1]: https://shkspr.mobi/blog/2026/04/does-mythos-mean-you-need-t... — I Googled "mythos and open source". Interestingly, a forum discussion about this post came before it: https://itsfoss.community/t/does-mythos-mean-you-need-to-shu...
From my analytics, around 85% of visits to my sites are now from bots. It's hard to say if these are oldschool bots or people just browsing the web via chatgpt now, but the reality is that very few actually visit the actual sites I have anymore.
If you had any data over the past 5 years I would love to read a blog about this. Seeing the numbers would be impactful to me.
I think blogs should have a universal comment function, I'd love to engage ith the author but many blogs don't bother providing a basic textarea.
The universal contact function is usually there, and it's email. I tried many times: it works. Authors respond. Only very few people seem to want to keep it hidden.
I'm dumbfounded by the number of times I see comments of the form "if the author is reading this ..." on a 3rd party comment side, with a link posted by somebody else, on a forum the author is likely never going to watch, followed by an actually useful comment that you could have _ensured_ the author reads by you know... just contacting him?
Forum comments are just recipe for instant spam, and have been so in the last 10+ years. If you want to make them useful, it currently needs to be actively policed (not to mention: you can be responsible for the content posted as well in several countries now). As an author, only if you're trying to create an audience around your blog, all the hassle around it might be worth it.
They typically do but are disabled due to the volume of spam and moderation overhead. Send an email.
I think policing all the spam isn't worth the trouble.
Send the author an e-mail.
Ranked yesterday, although not much discussion (18 points, 2 comments) https://news.ycombinator.com/item?id=48022560
Make RSS Great Again. Besides the jokes, RSS is a great founding pillar to get out of the algorithmic internet that is today.
I'd much prefer correct sitemap.xml to an RSS. Please let me keep up to date with your site/blog/homepage without scraping it!
Shameless low-volume rss feed drop.
https://brynet.ca/feed.xml
Also the embedded mastodon feed on my site uses rss.
Interesting to see RSS is still relevant, thanks for sharing.