pewresearch.org

Kissaki , to Technology in Online Content Is Disappearing
@Kissaki@beehaw.org avatar

54% of Wikipedia pages contain at least one link in their “References” section that points to a page that no longer exists.

My impression was/is that over the last years/decade Wikipedia made efforts to/switch to not linking directly but extending direct links with (dated) Web Archive links or using Web Archive links directly (dated as "sourced from this in this state; which protects against upstream edits too).

tiago ,
@tiago@beehaw.org avatar

Have not contributed enough to Wikipedia! I'd like to implement this on links I come across.

How does one get an Archive link for a specific date and website? I tried snooping around but am not familiar

Kissaki , (edited )
@Kissaki@beehaw.org avatar

Wikipedia has guidance for it as Citing sources. Regarding web links specifically section Handling links and Preventing and repairing dead links.

The Web Archive "Wayback Machine" is available at web.archive.org. It has a "Save Page Now" action too.

https://web.archive.org/web/*/beehaw.org gives you a history of archived versions of that URL.

The Web Archive "Wayback Machine" is a project from archive.org, which does much more in archiving and accessibility efforts. An alternative service for websites is https://archive.ph/.

zooi , to Technology in Online Content Is Disappearing

Donate to the internet archive!

avidamoeba ,
@avidamoeba@lemmy.ca avatar

This is important. I signed up a week ago.

K1nsey6 , to Technology in Online Content Is Disappearing
@K1nsey6@lemmy.world avatar

This allows the ruling class to write history as they see fit.

adespoton , to Technology in Online Content Is Disappearing

This isn’t helped by most websites reinventing themselves every couple of years so the old links 404 even though the content still exists.

brisk ,

If anyone is considering how to avoid this on their own site: https://indieweb.org/URL_design

ZeffSyde , to Technology in Online Content Is Disappearing

Oh, thank fuck. David Bowie's Area is still online.

ColeSloth , to Technology in Online Content Is Disappearing

If Ian's shoelace site dissapears, I'ma bounce too.

possiblylinux127 , to Technology in Online Content Is Disappearing
@possiblylinux127@lemmy.zip avatar

This is why we need the internet archive

blazeknave ,

You should see it in person. Just drove by it today. Support them!

possiblylinux127 ,
@possiblylinux127@lemmy.zip avatar

What do you mean

lauha ,

He means you should see the internet archive in person.

OhmsLawn ,

It's in San Francisco.

I've seen it, but hadn't realized that it was open to visitors.

lvxferre ,
@lvxferre@mander.xyz avatar

Yes. And wikis, too.
We (people in general) have a tendency to share stuff in forums, like Lemmy. That's fine in the short term, but in the long term this stuff should be sorted, organised, and preferably mirrored. Wikis are perfect for that, while the internet archive is more like "bulk" storage.

Telodzrum , (edited )

Wikis are not really a defense against this issue, they are by nature a secondary or (occasionally by policy) a tertiary source of information. Once the source they are recording dies so does the value of that page on the wiki. From the OP:

54% of Wikipedia pages contain at least one link in their “References” section that points to a page that no longer exists.

lvxferre ,
@lvxferre@mander.xyz avatar

There's nothing intrinsically non-primary in the format. At the end of the day they're collaborative writing projects, split into pages with internal and external links; it's just that the biggest one out there happens to be tertiary.

And I believe that they could help a lot with this issue if people migrated/copied meaningful info from forums (like Lemmy) to wikis. Forums are good for discussion, but they tend to accumulate a lot of trash; having the good content sieved and sorted in a wiki makes it more accessible for everyone.

Telodzrum ,

There’s nothing intrinsically non-primary in the format. At the end of the day they’re collaborative writing projects, split into pages with internal and external links; it’s just that the biggest one out there happens to be tertiary.

This is an accurate point. Thanks for the correction. I think what I should have said is that the biggest one has that policy and, as a result, there is a trend of others following suit.

LainTrain ,

This is why Discord is poison to our shared pool of knowledge, it's such a black hole for many games and software (especially ironically enough open source projects) in lieu of decent docs.

lvxferre ,
@lvxferre@mander.xyz avatar

Thanks for giving me another bone to pick against Discord ¬¬
Seriously. Fuck Discord.

Agrivar ,

Ugh!

The worst part is, after wasting a bunch of time tracking down the correct Discord server to ask a question about a piece of software, you generally get lambasted by the "regulars" of that server to "just use the search feature, that's what it's for!"

Yeah, no. I don't want to wade through a reverse chronology of a bunch of conflicting back-and-forth conversations - just gimme a FAQ or some actual documentation!!!

Suavevillain , to Technology in Online Content Is Disappearing
@Suavevillain@lemmy.world avatar

It is more important to archive things you like if you got the space. Even if you don't plan on using it for a long time.

some_guy , to Technology in Online Content Is Disappearing

And now with Google regurgitating a summary of the content they've crawled there will be no incentive to publish because no one will click through to get ad payments.

thecodemonk ,

We need to revive the days people write blog posts to help others instead of pushing ads to make money. The content was far better.

Facebones ,

I started a home server on an RPI running some services with a dinky HTML page on it, I need to start actually posting to it. I'm already not on fb or Twitter or anything, if you want to know what I'm doing go to my homepage loser! 🤣

toothpicks , to Technology in Online Content Is Disappearing

Yeah. Might need to start backing up stuff myself. I noticed a lot of old YouTube videos go missing as well.

N01R3 , to Technology in Online Content Is Disappearing

I remember a small RPG maker game that I no longer can find on the web, let alone anything that used to be hosted on FreewareFiles or Raymond.cc...

moon ,

I used to be on the rpg maker forums back in the day, can you loosely describe it to me?

N01R3 ,

I can give you its name too: End of the World, Part 1 and End of the World Part 2. It was a basically a Final Fantasy clone/attempt that I thought when I was younger was pretty good. Can't remember much about what made it unique though aside from a hidden stick figure fight right outside the castle.

morrowind , to Technology in Online Content Is Disappearing
@morrowind@lemmy.ml avatar

Certain types of tweets tend to go away more often than others. More than 40% of tweets written in Turkish or Arabic are no longer visible on the site within three months of being posted.

I've read this is a major problem in Facebook as well, they lack good moderation for these languages and especially the Arabic script and so just remove things heavy handedly to be safe.

Jode , to Technology in Online Content Is Disappearing

The biggest crime against shared knowledge ever committed is photobucket fucking off with the pictures in every "how to fix this car problem" forum post.

Zoidsberg ,
@Zoidsberg@lemmy.ca avatar

And all the "Thanks! Took two minutes to fix after seeing your post" comments just to rub it in.

PseudorandomNoise ,
@PseudorandomNoise@lemmy.world avatar

There’s some old Reddit posts like this too. Advice threads where the person who posted a solution went back and overwrote their comments during the boycott last year. I know why they did it but we still lost some information in the grand scheme of things.

infeeeee ,

Most of reddit was already archived before: https://the-eye.eu/redarcs/

Appoxo ,
@Appoxo@lemmy.dbzer0.com avatar

Also by the Archive Warrior project: https://tracker.archiveteam.org/reddit/#show-all

Appoxo ,
@Appoxo@lemmy.dbzer0.com avatar

And that is why I criticized the decisions every time I read about it. Every time I got mixed responses but ultimately got a higher downvote ratio.
Also a reason I participate(d) in the archive warrior reddit project.

Powderhorn , to Technology in Online Content Is Disappearing
@Powderhorn@beehaw.org avatar

Poorly thought-out Facebook posts are forever; coverage of city council malfeasance from two years ago, not so much.

Maeve , to Technology in Online Content Is Disappearing

The memory hole is real.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • kbinchat
  • All magazines