Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Horrible reporting.

The cost to the BBC to keep those pages on its servers is more than $3.99. Just because one guy downloaded all of the pages, compressed them, and then seeded them on bittorrent for a cost of $3.99 to him doesn't mean jack.

Notice that he's not hosting the content in any easily readable form. No, he decided to put that burden on everyone by putting it up on bittorrent. Why isn't he hosting the content? Because hosting a heavily trafficked site ain't cheap.



The fact that it'd cost the BBC millions to maintain those pages means jack.

People were going to lose access to that information, and now they aren't. That's really all that matters. With torrents, the burden is on "everyone" only if they really want to keep it up.

edit: Reasoning for downvotes would be appreciated. The BBC has a long history of recklessly losing valuable data. See kgtm's link to previous yc post on this topic.

edit #2: Oh, okay, I see what's going on. When I said "means jack", I didn't mean to imply that the BBC should spend inordinate amounts to keep the data up. What I meant was: Yes, it might have cost them to do it, but look, like what this guy did, there are other ways to keep the data archived. Hell, I'm sure some people would even settle for a CD archive or something.


It does not mean 'jack', it means money that comes from license fee payers would be spent, I am not justifying their removal, but it's overly simplistic to reduce it to a mere discussion of storage costs, the BBC has a responsibility to maintain and support their offerings, it's not rocket science to see that doing that will cost money. Nice that someone has archived that data, I truly appreciate that, moving forward though I think it would be more useful to look at where money is spent and discuss that real issue, not get sidelined with quips and digs at a truly unique organisation that has long set standards in media of all varieties.

As far as losing valuable data, well okay, they maybe blanked a few copies of Doctor Who here and there, but it was not considered valuable at the time, and as always cost was a factor then as it is now, it's easy to pass judgement with hindsight.


> The BBC has a long history of recklessly losing valuable data

This is quite a fascinating topic in itself. http://en.wikipedia.org/wiki/Category:Lost_BBC_episodes http://en.wikipedia.org/wiki/Doctor_Who_missing_episodes


Erasing master tapes of actual shows is one thing; not bothering to preserve promotional web pages for eternity isn't even in the same ballpark.


In fact I see it more as them picking up after themselves.


I agree that BBC should have done something to preserve the content in some form, not necessarily as easily accessible and readable. Heck, they could've packed it up and put it out as a torrent which they would seed themselves and it would've been better than just dumping it.

I upvoted you because I agree about the content, so I can only speculate about downvotes. If I had to guess, I'd say it's probably because you either missed or ignored the point you replied to, which is that the reporting was, indeed, terrible.


Actually he's probably being downvoted because he's taken a simplistic view to what is a complicated issue, and he's being pretty free with my tax pounds :)


You're saying that archiving the data, or packaging it up in a torrent rather than just throwing it away is a waste of money?


"means jack"

Actually it doesn't mean jack. that's my money they'd be using, and from a quick scan of those websites, i'm happy for them to disappear.

For some reason, many people seem to have decided that all data is important and there's an almost fetishistic devotion to saving everything that can be saved. Not all information needs to be preserved for all time.


> Not all information needs to be preserved for all time.

I disagree, and strongly. Future historians will probably disagree with you too. Even the most inane TV shows can prove to be extremely valuable when trying to decipher the culture of various countries 300 or 3000 years in the past. Imagine reading a trashy romance novel from 1000 BC. You'd learn a lot about the people who lived in that time.

You might counter this by saying that with all the information out there now, we only need to keep the "good stuff". But who decides what's good, right now? How do you know that what you decide is good will always be seen as good? Your personal opinions on the content disappearing are totally irrelevant when perhaps millions of people have seen it or have been affected by it.

All information that is created should be saved, especially since we're able to. Imagine the ancient Romans deciding to burn a bunch of books because it cost too much to hire guards for the library. Ouch.


I agree with you, but practically, not all information can be kept.

There are teams of people right now deciding what is 'good' (i.e. what is kept) and what is shredded. They're called archivists, and, using their judgement in conjunction with government-created records retention schedules, they're culling both government and private records. This culled information is removed for a variety of reasons, space being one of the most prominent. By such means is the pool of available primary sources that will form the basis for future histories created by underpaid and underfunded but well-meaning bureaucrats.

Sounds like the same situation obtains not only in the dead-tree archives but also in the digital ones. Sigh. I wish people would prioritize our cultural legacy first instead of last when it comes to funding.


In your analogy, instead of burning the books they could just dump them in big trash heaps in Egypt, or in jars in Palestine, which correspond to bittorrenting quite nicely.


Mind, I don't think future historians will weep too much about losing some promotional websites for such actual content.


Is the downvote above because someone disagrees, or because the downvoter is just completely unaware of the nature of the sites being taken down?

Most of these sites are promo fluff for BBC TV shows. Some are cancelled, some are for news show with little more "content" than "This show next comes on --- on BBC -".

Most of these sites would be dismissed as "marketing" if the BBC were a for-profit company. Marketing for defunct products. The rest is material that's being legitimately archived elsewhere.


(OT: I didn't downvote you above; in fact, I can't, since you replied to my comment I think. However, note that many of us frequently accidentally downvote instead of upvote and can't undo it, especially on mobile touch-browsers if we're not zoomed-in enough for a big arrow)


I've occasionally done that myself, but I then post a note that I meant to upvote.


We don't have the ability to preserve all information for all time. Storage media decays, formats obsolete, storage imposes cost, space and time considerations that are non-zero and non-trivial.

If these sites were a special instance of our culture, then fine. But they're not. We don't need every single episode of Eastenders archived for all time to understand culture. A good sample is way better.

These are different times to the classical era. We have a glut of information, most of which, once archived, will never be looked at again, and will only cost money until finally abandoned.


"We don't need every single episode of Eastenders archived for all time to understand culture."

Speak for yourself!


"We don't need every single episode of Eastenders archived"

I'm sympathetic to preserving the shows themselves.

Their promotional pages on the BBC site when they're no longer in first-run? Feh.


I went to a museum once where they just kept everything, I learnt 2 things, one, that the first radio batteries came in more varieties than we have today, and two, never go to a museum that try's to keep everything...


How much would it cost to re-host all these sites? If you hosted a snapshot it'd be completely static, a small cluster of Varnish servers would be trivial to set up and could take quite a beating (alternately you could leverage a CDN I guess? I don't know much about such things). Either way I doubt it'd be a huge investment to keep them online as static, archive sites (which is a very different commitment to keeping them actually live, which would obviously incur many more costs).


I'd not expect any of these sites to be heavily trafficed going forward, so my guess: $5/month.

Note that large portions of geocities are being hosted on the web by various parties, most of whom are apparently using pizza money to do it. Geocities was still in the top-100 websites or so when it closed, and they're probably getting decent amounts of traffic from people trying to find old sites.

Even if there is heavy traffic, you're hosting a archival copy, not a production site, so you can just degrade performance as needed. See often slooow web.archive.org :)


Completely agree. I think there'd be an initial spike, but anything after that would well be within the capacity of my server, which is ~£30 a month. Not a high price for preserving the information... but how long would I want to keep it online, especially once it was only seeing a handful of hits a day.


Well there is no reason why all of those files couldn't be made available by the same server that is currently acting as the master seedbox - and thus would still be for $3.99.

No, he decided to put that burden on everyone by putting it up on bittorrent.

Give a man a fish and he can feed his family for a day. Let the man catch his own fish and he can feed his family and community for a lifetime.


How many people every day you think would access a BBC archive sites containing personal memoires of WWII survivors? Maybe 10? 100? Let's call it 10000? You can serve 10.000 people a day with a very inexpensive setup, maybe more than $3.99 a month but definitely not high enough to justify this kind of cuts.


>Why isn't he hosting the content? Because hosting a heavily trafficked site ain't cheap.

Rehosting it isn't legal either, but then neither is torrenting it.


Generally, when the torrenting / duplicating of such data is for preservation purposes, everyone turns a blind-eye, and there's even legislation most places to support such efforts. Nobody would be making money from it while it doesn't exist, and it can be viewed as "historical" or "educational", both of which are usually protected uses of duplication.


>there's even legislation most places to support such efforts.

I've seen previews of suggestions for legislation but no actual legislation for this sort of scenario, can you point me to some.

Making money from something isn't a requirement for copyright infringement, nor even is it the test for commercial use of copyright works.

I've never heard of a "fair use" case on historical (preservation) grounds can you point to one? As for educational use in Europe one generally doesn't have the relatively liberal educational use allowances that one does in the USA, and from hazy memory I think that the USA legislation does tie down quite well what is educational use. Certainly unlicensed redistribution except on the most de minimis basis wouldn't get a pass as fair use for education.

IANA(IP)L and am a little out of touch wrt the latest legislative efforts.

---

OT rant:

Just recently it's interesting but all my posts in which I've tried to appraise people of copyright legislation or at least the potential threat of such have been modded down substantially. I have pretty strong views contrary to the current established laws myself, FWIW, but I feel that even if you're an anarchic dissident it's important to know the laws one is fighting against. Clearly others here disagree that reminders about demands of IP legislation are worthwhile for this type of forum.


I'll hunt around a bit, but I don't know of any off-hand. As to evidence of the attitude, I point to abandonware sites. Technically illegal many times, but companies getting uppity about things they don't sell appearing on abandonware sites is exceedingly rare. Similar to anime companies vs fansubbers prior to the anime being translated and sold in that particular area.

Since you seem like you may be interested: looked at http://www.groklaw.net before? They've had quite a bit of activity on IP laws, last time I checked (as has the rest of the internet, but still).


You can browse the entire archive at www.bbcattic.org




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: