Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (No Skin)
  • No Skin
Collapse
Code Project
  1. Home
  2. The Lounge
  3. RSS experiment...

RSS experiment...

Scheduled Pinned Locked Moved The Lounge
xmlhelpquestionannouncement
12 Posts 5 Posters 0 Views 1 Watching
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • D Offline
    D Offline
    dandy72
    wrote on last edited by
    #1

    I'm pretty sure I brought this up a few months ago, but nothing really came out of it. I'm subscribed to the BBC's news RSS feed. Old school. It's at https://feeds.bbci.co.uk/news/rss.xml I use that feed with Thunderbird. Since a few months ago, every time they "re-publish" an article to make corrections - no matter how minor...it's my understanding that they re-publish it with a different internal ID, so even though it has the same title (and publication timestamp), because the ID is not re-used, republished items are essentially duplicated in the article list. And sometimes triplicated, and quadruplated, and [N+1]plated and so-on. Right now I'm looking at a screenful of titles, and the majority is repeated 2, 3, 4 or even more times. It's annoying, and inflates the number of unread items by hundreds, in a single day. I've settled on Thunderbird as my RSS reader for years, and it's only within the last few months that the BBC link - and only this one - has started showing this. I've even contacted what I think is probably the most technical people at the BBC over the matter, and they didn't even acknowledge receiving anything. Can someone with Thunderbird try subscribing to that feed for a few days, and report back whether they're seeing the same thing? Or someone with a different reader confirm it's okay with theirs? I'm not bringing this up to solicit suggestions for alternative readers. I might be willing to switch readers if I knew for sure the problem didn't manifest itself. But really, and please don't take this the wrong way, I honestly don't want to hear about everybody's favorite RSS reader if you don't know whether the same problem is there or not.

    R J T R 4 Replies Last reply
    0
    • D dandy72

      I'm pretty sure I brought this up a few months ago, but nothing really came out of it. I'm subscribed to the BBC's news RSS feed. Old school. It's at https://feeds.bbci.co.uk/news/rss.xml I use that feed with Thunderbird. Since a few months ago, every time they "re-publish" an article to make corrections - no matter how minor...it's my understanding that they re-publish it with a different internal ID, so even though it has the same title (and publication timestamp), because the ID is not re-used, republished items are essentially duplicated in the article list. And sometimes triplicated, and quadruplated, and [N+1]plated and so-on. Right now I'm looking at a screenful of titles, and the majority is repeated 2, 3, 4 or even more times. It's annoying, and inflates the number of unread items by hundreds, in a single day. I've settled on Thunderbird as my RSS reader for years, and it's only within the last few months that the BBC link - and only this one - has started showing this. I've even contacted what I think is probably the most technical people at the BBC over the matter, and they didn't even acknowledge receiving anything. Can someone with Thunderbird try subscribing to that feed for a few days, and report back whether they're seeing the same thing? Or someone with a different reader confirm it's okay with theirs? I'm not bringing this up to solicit suggestions for alternative readers. I might be willing to switch readers if I knew for sure the problem didn't manifest itself. But really, and please don't take this the wrong way, I honestly don't want to hear about everybody's favorite RSS reader if you don't know whether the same problem is there or not.

      R Offline
      R Offline
      raddevus
      wrote on last edited by
      #2

      My RSS reader (I wrote this long ago using Angular !! :-\ ) and publicly available shows the items one time. try it out at: FreedReadR[^] I added your RSS link to the top edit box (RSS link) and clicked the button [Load (from textbox)] and I saw titles only one time. Does that help at all?

      D 1 Reply Last reply
      0
      • R raddevus

        My RSS reader (I wrote this long ago using Angular !! :-\ ) and publicly available shows the items one time. try it out at: FreedReadR[^] I added your RSS link to the top edit box (RSS link) and clicked the button [Load (from textbox)] and I saw titles only one time. Does that help at all?

        D Offline
        D Offline
        dandy72
        wrote on last edited by
        #3

        You have to let it re-fetch. If an article is updated 3 days after the original publication, that's when it starts to show up as a duplicate. I do appreciate the feedback. [Edit] Given the way your implementation (probably) works, I don't think you'll ever encounter what I'm describing. You have to have previous results stored locally, and then a re-fetch should try to merge the new stuff with what it already had.

        1 Reply Last reply
        0
        • D dandy72

          I'm pretty sure I brought this up a few months ago, but nothing really came out of it. I'm subscribed to the BBC's news RSS feed. Old school. It's at https://feeds.bbci.co.uk/news/rss.xml I use that feed with Thunderbird. Since a few months ago, every time they "re-publish" an article to make corrections - no matter how minor...it's my understanding that they re-publish it with a different internal ID, so even though it has the same title (and publication timestamp), because the ID is not re-used, republished items are essentially duplicated in the article list. And sometimes triplicated, and quadruplated, and [N+1]plated and so-on. Right now I'm looking at a screenful of titles, and the majority is repeated 2, 3, 4 or even more times. It's annoying, and inflates the number of unread items by hundreds, in a single day. I've settled on Thunderbird as my RSS reader for years, and it's only within the last few months that the BBC link - and only this one - has started showing this. I've even contacted what I think is probably the most technical people at the BBC over the matter, and they didn't even acknowledge receiving anything. Can someone with Thunderbird try subscribing to that feed for a few days, and report back whether they're seeing the same thing? Or someone with a different reader confirm it's okay with theirs? I'm not bringing this up to solicit suggestions for alternative readers. I might be willing to switch readers if I knew for sure the problem didn't manifest itself. But really, and please don't take this the wrong way, I honestly don't want to hear about everybody's favorite RSS reader if you don't know whether the same problem is there or not.

          J Offline
          J Offline
          Jeremy Falcon
          wrote on last edited by
          #4

          If it's that important to you, you can spin up a Node.js application as a proxy to actually consume the RSS feeds with something like [rss-parser](https://www.npmjs.com/package/rss-parser) and then re-export them to you however you want via [rss](https://www.npmjs.com/package/rss) for your client to get them. You'd need a way to serialize them, given the fact you're parsing RSS into JSON, then NoSQL is the perfect choice as you can practically serialize the JSON directly.

          Jeremy Falcon

          D 1 Reply Last reply
          0
          • D dandy72

            I'm pretty sure I brought this up a few months ago, but nothing really came out of it. I'm subscribed to the BBC's news RSS feed. Old school. It's at https://feeds.bbci.co.uk/news/rss.xml I use that feed with Thunderbird. Since a few months ago, every time they "re-publish" an article to make corrections - no matter how minor...it's my understanding that they re-publish it with a different internal ID, so even though it has the same title (and publication timestamp), because the ID is not re-used, republished items are essentially duplicated in the article list. And sometimes triplicated, and quadruplated, and [N+1]plated and so-on. Right now I'm looking at a screenful of titles, and the majority is repeated 2, 3, 4 or even more times. It's annoying, and inflates the number of unread items by hundreds, in a single day. I've settled on Thunderbird as my RSS reader for years, and it's only within the last few months that the BBC link - and only this one - has started showing this. I've even contacted what I think is probably the most technical people at the BBC over the matter, and they didn't even acknowledge receiving anything. Can someone with Thunderbird try subscribing to that feed for a few days, and report back whether they're seeing the same thing? Or someone with a different reader confirm it's okay with theirs? I'm not bringing this up to solicit suggestions for alternative readers. I might be willing to switch readers if I knew for sure the problem didn't manifest itself. But really, and please don't take this the wrong way, I honestly don't want to hear about everybody's favorite RSS reader if you don't know whether the same problem is there or not.

            T Offline
            T Offline
            trønderen
            wrote on last edited by
            #5

            Years ago, a couple of Norwegian newspapers claimed to be open to readers' comments without any censorship. But whenever someone posted a comment they didn't like, they made a small 'edit' of the article, giving it a new article number, a new URL ... and a new, initially empty, list of user comments. If you read the original URL, while still available in your history list, you would see it with the first user comments, up to and including the one that offended the editors. If you selected the same article from the teasers on the front page, you would see only the new comments, submitted after the offending one. There was no way to access the original version (and comments) unless you happened to have saved the article number/URL. The free text search function only returned the most recent version, with no back link to the older versions. In those days, I did write a few comments, and I certainly didn't always agree with the editors. So I got used to articles being 'edited' after I added my comment. Therefore, I got several chances to save a copy of the article before I submitted my comment, so that I could compare it to the subsequent 'edited' version with a new comment chain. In every case I checked, there was no edit at all. For the one newspaper - one that I even subscribed to the paper edition - I made a complaint to the newspaper. They were so rude that they returned to me the URL of the old, original article, with the early comments up to mine, pointing: Look there, your comment _is_there - we are not doing any censorship! But anyone who opened the article from the front page would find it without my comment. That pi**ed me off so much that I canceled my subscription to the paper edition and stopped reading the web edition.

            Religious freedom is the freedom to say that two plus two make five.

            1 Reply Last reply
            0
            • D dandy72

              I'm pretty sure I brought this up a few months ago, but nothing really came out of it. I'm subscribed to the BBC's news RSS feed. Old school. It's at https://feeds.bbci.co.uk/news/rss.xml I use that feed with Thunderbird. Since a few months ago, every time they "re-publish" an article to make corrections - no matter how minor...it's my understanding that they re-publish it with a different internal ID, so even though it has the same title (and publication timestamp), because the ID is not re-used, republished items are essentially duplicated in the article list. And sometimes triplicated, and quadruplated, and [N+1]plated and so-on. Right now I'm looking at a screenful of titles, and the majority is repeated 2, 3, 4 or even more times. It's annoying, and inflates the number of unread items by hundreds, in a single day. I've settled on Thunderbird as my RSS reader for years, and it's only within the last few months that the BBC link - and only this one - has started showing this. I've even contacted what I think is probably the most technical people at the BBC over the matter, and they didn't even acknowledge receiving anything. Can someone with Thunderbird try subscribing to that feed for a few days, and report back whether they're seeing the same thing? Or someone with a different reader confirm it's okay with theirs? I'm not bringing this up to solicit suggestions for alternative readers. I might be willing to switch readers if I knew for sure the problem didn't manifest itself. But really, and please don't take this the wrong way, I honestly don't want to hear about everybody's favorite RSS reader if you don't know whether the same problem is there or not.

              R Offline
              R Offline
              RedDk
              wrote on last edited by
              #6

              I always thought a Thunderbird was some sort of American Indian symbol for a diety. I have access to RSS BBC however and the list of current 49 headliners (beginning with the word "Tributes" and ending with "sweeps in" show exactly zero duplicates. I'll keep an eye on the feed as you request; given that the weekend is upon us ... and let you know how things are going ... shall we say ... next week.

              D 1 Reply Last reply
              0
              • J Jeremy Falcon

                If it's that important to you, you can spin up a Node.js application as a proxy to actually consume the RSS feeds with something like [rss-parser](https://www.npmjs.com/package/rss-parser) and then re-export them to you however you want via [rss](https://www.npmjs.com/package/rss) for your client to get them. You'd need a way to serialize them, given the fact you're parsing RSS into JSON, then NoSQL is the perfect choice as you can practically serialize the JSON directly.

                Jeremy Falcon

                D Offline
                D Offline
                dandy72
                wrote on last edited by
                #7

                Jeremy Falcon wrote:

                If it's that important to you,

                ...not to the extent that I want to start writing my own middleware to fix another program's failures. :-)

                1 Reply Last reply
                0
                • R RedDk

                  I always thought a Thunderbird was some sort of American Indian symbol for a diety. I have access to RSS BBC however and the list of current 49 headliners (beginning with the word "Tributes" and ending with "sweeps in" show exactly zero duplicates. I'll keep an eye on the feed as you request; given that the weekend is upon us ... and let you know how things are going ... shall we say ... next week.

                  D Offline
                  D Offline
                  dandy72
                  wrote on last edited by
                  #8

                  RedDk wrote:

                  I have access to RSS BBC

                  I hope you're using the feed I provided. There's no telling whether another feed will show the same problem or not. But then, if an alternate feed doesn't have the problem, then by all means, I'm willing to switch feeds. :-)

                  R 2 Replies Last reply
                  0
                  • D dandy72

                    RedDk wrote:

                    I have access to RSS BBC

                    I hope you're using the feed I provided. There's no telling whether another feed will show the same problem or not. But then, if an alternate feed doesn't have the problem, then by all means, I'm willing to switch feeds. :-)

                    R Offline
                    R Offline
                    RedDk
                    wrote on last edited by
                    #9

                    If you hadn't posted the feed I have access to I'd have downvoted your Lounge lizard and suggested you get a hold of jan_hus over in the LINUX forum to ask him how his mauve thing fares ;)

                    1 Reply Last reply
                    0
                    • D dandy72

                      RedDk wrote:

                      I have access to RSS BBC

                      I hope you're using the feed I provided. There's no telling whether another feed will show the same problem or not. But then, if an alternate feed doesn't have the problem, then by all means, I'm willing to switch feeds. :-)

                      R Offline
                      R Offline
                      RedDk
                      wrote on last edited by
                      #10

                      Refreshed BBC News - News Front Page ... (Last refreshed @ 10:26:47 on 08/29/2024): 46 items none of which were duplicated in the list "Outdoor ... to ... car crash" :)

                      D 1 Reply Last reply
                      0
                      • R RedDk

                        Refreshed BBC News - News Front Page ... (Last refreshed @ 10:26:47 on 08/29/2024): 46 items none of which were duplicated in the list "Outdoor ... to ... car crash" :)

                        D Offline
                        D Offline
                        dandy72
                        wrote on last edited by
                        #11

                        Thanks for the follow-up. You mentioned refreshing...for what it's worth, I don't tell Thunderbird to refresh, it just does so automatically on its own every couple of hours. It's running 24/7 and mostly just sitting in the background.

                        R 1 Reply Last reply
                        0
                        • D dandy72

                          Thanks for the follow-up. You mentioned refreshing...for what it's worth, I don't tell Thunderbird to refresh, it just does so automatically on its own every couple of hours. It's running 24/7 and mostly just sitting in the background.

                          R Offline
                          R Offline
                          RedDk
                          wrote on last edited by
                          #12

                          Yeah, my interface is a typical Windows application standing alone purchased from a well known purveyor of a proprietary language. And their thing, as such being a language, is probably to provide inroads to automaticity of just such a process as a TSR. The fact that I don't complain about having to depress a button on a Windows form shouldn't indicate that I'm probably not going to like something else I'd run into down that road that'll cause lightning bolts to fly from my rearend, it just means I don't know everything about their language yet. But I keep a fire extinguisher handy.

                          1 Reply Last reply
                          0
                          Reply
                          • Reply as topic
                          Log in to reply
                          • Oldest to Newest
                          • Newest to Oldest
                          • Most Votes


                          • Login

                          • Don't have an account? Register

                          • Login or register to search.
                          • First post
                            Last post
                          0
                          • Categories
                          • Recent
                          • Tags
                          • Popular
                          • World
                          • Users
                          • Groups