MEpedia and Internet Archive

WillowJ · Mar 3, 2019

Most pages have not been archived in over a year, and some have not been archived at all.

I was able to add/update a few to archive.org, but it might be good to set up a method. Possibly some tech person even knows of a way to automate this.

daftasabrush · Mar 3, 2019

Sounds good. MEpedia also keeps history of each page but a separate archive is important too.

It's not clear to me how the web archive chooses which pages to archive. I know it won't just archive the whole of any site.

Perhaps the Contents page is a key one to archive and might encourage the archive to visit and archive many of the links on there?

I think there are issues with a low crawl rate on MEpedia too, I've noticed recent pages and not-so-recent changes not appearing from a Google search.

Patient4Life · Mar 3, 2019

WillowJ said:
Most pages have not been archived in over a year, and some have not been archived at all.

I will do some pages. Never worked with it. What do I do?

EDIT: I just figured it out. Saved main page and a few other pages.

Patient4Life · Mar 3, 2019

Main Page, Pace trial, ICC, CCC, and IOM report, Fibromyalgia, ME, CFS, SEID, CF, and all the Primers have been saved to the Wayback Machine as well a few other pages such as Neuroinflammation and Brain scans.

EDIT:If this is not the same as the Archive.org page, (although Wayback is found at Archive.org) someone else will have to work on that as I cannot figure out how to use it.

Alvin · Mar 3, 2019

I would not worry so much about archiving MEPedia on the Internet Archive.
I assume MEAction has no plans to take it down.
On the other hand i think the references in David Tuller's articles desperately need archiving because the authors want to erase any history that they don't want immortalized.

The IA seems to have some sort of algorithm, i was reading that manually adding a page does not add that page or website to its roster, so i'm assuming it has something to do with worldwide traffic levels or how often its linked elsewhere or something like that.

Patient4Life · Mar 3, 2019

daftasabrush said:
I think there are issues with a low crawl rate on MEpedia too, I've noticed recent pages and not-so-recent changes not appearing from a Google search.

I just tried the google search console and it will need to be done by someone like Jen or another MEpedia administrator as far as I can see. I guess doing the same pages I did on the Wayback Machine (see above) would be a good idea.

FYI @JenB @JaimeS

I also did the Trial By Error, Open Medicine Foudantion, GWI, Simon W, Esther C, Michael S, GET, CBT, and a few other pages on the Way Back Machine.

Alvin · Mar 3, 2019

Patient4Life said:
I also did the Trial By Error, Open Medicine Foudantion, GWI, Simon W, Esther C, Michael S, GET, CBT, and a few other pages on the Way Back Machine.

Wow.
I did the references on the PACE Intimidation article a week ago i think, but there's no way i would be able to do that on a regular basis or even the updates since.

Patient4Life · Mar 3, 2019

Alvin said:
Wow.
I did the references on the PACE Intimidation article a week ago i think, but there's no way i would be able to do that on a regular basis or even the updates since.

I can't do anything on a regular basis either. I just can't be responsible for something like this. Oh, and I only did the pages, not their references one by one.

Just to note here: I also did PEM and Pediatric ME/CFS and List of symptoms of ME CFS. Also Lady Gaga, all the Fibro pages, not just the main page, and Dry eyes syndrome, Lyme disease, and Lupus.

So, if anyone should ever want to keep updating Wayback, they can look at my list and copy down.

Patient4Life · Mar 3, 2019

Alvin said:
I did the references on the PACE Intimidation article a week ago

Those citations are very important. Thanks for doing them.

Alvin · Mar 3, 2019

Patient4Life said:
I can't do anything on a regular basis either. I just can't be responsible for something like this. Oh, and I only did the pages, not their references one by one.

I cant be responsible either, i can do a bit here and there but doing it on a regular basis is not going to happen.
MEPedia is constantly changing so updating it on the IA regularly would be a Sisyphean task

Patient4Life said:
Just to note here: I also did PEM and Pediatric ME/CFS and List of symptoms of ME CFS. Also Lady Gaga, all the Fibro pages, not just the main page, and Dry eyes syndrome, Lyme disease, and Lupus.

So, if anyone should ever want to keep updating Wayback, they can look at my list and copy down.

I think the references are actually the most important, if they get deleted then there is no citation for the MEPedia article.
In some cases its generic information so no big deal, in some you need citations, especially when its talking about people or actions.

Patient4Life · Mar 3, 2019

This link won't update on Wayback and I don't know why. I run into little things like this.

Alvin said:
I think the references are actually the most important, if they get deleted then there is no citation for the MEPedia article.
In some cases its generic information so no big deal, in some you need citations, especially when its talking about people or actions.

https://www.bbc.com/news/uk-12195884

Alvin · Mar 3, 2019

Patient4Life said:
This link won't update on Wayback and I don't know why. I run into little things like this.

https://www.bbc.com/news/uk-12195884

Sometimes pages will go in an infinite loop, probably some software bug but this ones already got 5 captures
https://web.archive.org/web/20140512085120/http://www.bbc.com/news/uk-12195884

Patient4Life · Mar 3, 2019

I can't capture this page either. It looks like the BBC article above was captured and I did not realize it although it should not give me an error, why not just let me save it again.
https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(06)68662-5/fulltext

Patient4Life · Mar 3, 2019

Alvin said:
Sometimes pages will go in an infinite loop, probably some software bug but this ones already got 5 captures
https://web.archive.org/web/20140512085120/http://www.bbc.com/news/uk-12195884

Yes, just saw the capture. But won't let me capture a Lancet article. Tried searching it, too. Nothing.

Alvin · Mar 3, 2019

Patient4Life said:
I can't capture this page either. It looks like the BBC article above was captured and I did not realize it although it should not give me an error, why not just let me save it again.
https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(06)68662-5/fulltext

Its there too
https://web.archive.org/web/2013101...cet/article/PIIS0140-6736(06)68662-5/fulltext

I use Firefox and the Get Archive addon, it allows you to right click on any page and get the IA version. If its not there then you can click add to IA.

For the lancet one i had to remove the #article_upsell at the end of the url

Patient4Life · Mar 3, 2019

Thank you. I guess I am not really familiar with all of this. I will practice.

Alvin · Mar 3, 2019

Patient4Life said:
Thank you. I guess I am not really familiar with all of this. I will practice.

No worries, i have fought with IA for a while so i have some experience on weird happenings

rvallee · Mar 3, 2019

Patient4Life said:
I just tried the google search console and it will need to be done by someone like Jen or another MEpedia administrator as far as I can see. I guess doing the same pages I did on the Wayback Machine (see above) would be a good idea.

FYI @JenB @JaimeS

I also did the Trial By Error, Open Medicine Foudantion, GWI, Simon W, Esther C, Michael S, GET, CBT, and a few other pages on the Way Back Machine.

I see that robots.txt is not configured. It's a special file that tells search engines about updates and which pages are most important. Without that search engines do a basic crawl and set their own rules based on affluence (more visited sites get more interest, but this only works at much higher rates so for me-pedia it would be minimal).

This is something that needs to be installed on the wiki software. It's all automatic once it's configured properly. Internet archive probably relies on it, at least partially,

(I also noticed it's not configured on s4me.info, should be looked into, the forum software should have the option)

JaimeS · Mar 5, 2019

Thanks guys -- I passed this on to our MEpedia Volunteers Slack channel.

Alvin · Mar 5, 2019

JaimeS said:
Thanks guys -- I passed this on to our MEpedia Volunteers Slack channel.

It might also be worth asking them if there is a way that references can be automatically submitted to the Internet Archive.

MEpedia and Internet Archive

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)

Senior Member (Voting Rights)