Home
Forums
NAIJAFANS TV
NAIJAFANS RADIO
New posts
Trending
Search forums
What's new
New posts
New listings
New resources
New profile posts
Latest activity
Classifieds
New listings
Resources
Latest reviews
Search resources
Members
Current visitors
New profile posts
Search profile posts
Log in
Register
What's new
Search
Search
Search titles only
By:
Menu
Log in
Register
Install the app
Install
Home
Webmasters / Bloggers
Ways to Recover your Content from Wayback Machine (Internet Archive)
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Reply to thread
Message
<blockquote data-quote="Naijablog" data-source="post: 417" data-attributes="member: 46"><p>[ATTACH=full]116[/ATTACH]</p><p></p><p>The Internet Archive, also known as the Wayback Machine takes periodic snapshots of many sites across the internet and may have a copy of your site. So, follow along and we’ll teach you how to search for archives and <strong>recover your content from the Wayback Machine</strong>. You can then use these pieces to rebuild your site from scratch.</p><p></p><ol> <li data-xf-list-type="ol">Visit the Wayback Machine at <a href="https://archive.org/web/" target="_blank">https://archive.org/web</a>.</li> <li data-xf-list-type="ol">Type your web address in the search field then click the <strong>Browse History</strong> button. It will list how many times your site was saved over a time period. For example:<br /> “<em>Saved 34 times between November 9, 2008 and May 28, 2019.</em>“<br /> <br /> </li> <li data-xf-list-type="ol"><p style="text-align: right"><img src="https://www.inmotionhosting.com/support/wp-content/uploads/2019/06/select-year.png" alt="select year" class="fr-fic fr-dii fr-draggable fr-fir" style="" /></p> You will also see a timeline and a calendar. Click the <strong>year</strong>to view what dates your site was archived.<br /> <br /> </li> <li data-xf-list-type="ol"><p style="text-align: right"><img src="https://www.inmotionhosting.com/support/wp-content/uploads/2019/06/select-date-1.png" alt="select date 1" class="fr-fic fr-dii fr-draggable fr-fir" style="" /></p> Click the <strong>date</strong>on the calendar to view a snapshot of what was saved. You can try to navigate the site to view any available content. Keep in mind, it may not look exactly like your site since it depends on what was archived at the time.<br /> <br /> </li> <li data-xf-list-type="ol">I recommend checking each <strong>year</strong> and <strong>date</strong> to ensure you find all of the content.</li> </ol><p></p><p></p><h2>Copy Content Manually</h2><p>Now that you know how to search for and find your website snapshots, you can begin copying the text and images to your computer.</p><p></p><ol> <li data-xf-list-type="ol">Navigate to each page of the site and <strong>copy</strong> the text, then <strong>paste</strong> it into a text editor such as <em>Notepad</em>, <em>Google Docs</em>, or<em> MS Word</em>.</li> <li data-xf-list-type="ol">Visit each page in the Internet Archive then <strong>right-click</strong> and <strong>save</strong> any images you want to recover to a folder on your computer.</li> <li data-xf-list-type="ol">In some cases, you may be able to recover some of the website code. <strong>Right-click</strong> then select <strong>View page source</strong> to access the site code. <strong>Save</strong> it to a text editor for later use.</li> </ol><h2>Scrape Internet Archive Content</h2><p>If you don’t have time to manually copy each page of the website you’re recovering another option is to pull or scrape all the site content using a script. The following are some of the most popular options available. Keep in mind that these are often coded by 3rd parties or individuals and may require testing and troubleshooting to make them function successfully.</p><p></p><ul> <li data-xf-list-type="ul"><a href="https://pypi.org/project/wayback-scraper/" target="_blank">Wayback Scraper</a></li> <li data-xf-list-type="ul"><a href="https://github.com/sangaline/wayback-machine-scraper" target="_blank">Wayback Machine Scraper</a></li> <li data-xf-list-type="ul"><a href="https://github.com/hartator/wayback-machine-downloader" target="_blank">Hartator Wayback Machine Downloader (Ruby)</a></li> </ul><h2>3rd Party Services</h2><p>Want to save time? You can pay a 3rd party service to scrape and recover your website for you. Some will even restore content from CMSs such as WordPress. The pricing and scope of service will differ based on the site, so we recommend checking and comparing them to see which one best meets your needs.</p><p></p><p></p><ul> <li data-xf-list-type="ul"><a href="https://www.waybackmachinedownloader.com/en/" target="_blank">Wayback Machine Downloader</a></li> </ul><p>Now that you know how to find and recover website content from the Wayback Machine (Internet Archive), you can begin rebuilding your site. Hopefully, your site will return to its former glory with help from the archived copy. We recommend <a href="https://www.inmotionhosting.com/support/website/maintain/archive-website-with-wayback-machine" target="_blank">archiving your website with the Wayback Machine</a>, so you will have updated snapshots.</p></blockquote><p></p>
[QUOTE="Naijablog, post: 417, member: 46"] [ATTACH type="full"]116[/ATTACH] The Internet Archive, also known as the Wayback Machine takes periodic snapshots of many sites across the internet and may have a copy of your site. So, follow along and we’ll teach you how to search for archives and [B]recover your content from the Wayback Machine[/B]. You can then use these pieces to rebuild your site from scratch. [LIST=1] [*]Visit the Wayback Machine at [URL='https://archive.org/web/']https://archive.org/web[/URL]. [*]Type your web address in the search field then click the [B]Browse History[/B] button. It will list how many times your site was saved over a time period. For example: “[I]Saved 34 times between November 9, 2008 and May 28, 2019.[/I]“ [*][RIGHT][IMG align="right" alt="select year"]https://www.inmotionhosting.com/support/wp-content/uploads/2019/06/select-year.png[/IMG][/RIGHT] You will also see a timeline and a calendar. Click the [B]year[/B]to view what dates your site was archived. [*][RIGHT][IMG align="right" alt="select date 1"]https://www.inmotionhosting.com/support/wp-content/uploads/2019/06/select-date-1.png[/IMG][/RIGHT] Click the [B]date[/B]on the calendar to view a snapshot of what was saved. You can try to navigate the site to view any available content. Keep in mind, it may not look exactly like your site since it depends on what was archived at the time. [*]I recommend checking each [B]year[/B] and [B]date[/B] to ensure you find all of the content. [/LIST] [HEADING=1]Copy Content Manually[/HEADING] Now that you know how to search for and find your website snapshots, you can begin copying the text and images to your computer. [LIST=1] [*]Navigate to each page of the site and [B]copy[/B] the text, then [B]paste[/B] it into a text editor such as [I]Notepad[/I], [I]Google Docs[/I], or[I] MS Word[/I]. [*]Visit each page in the Internet Archive then [B]right-click[/B] and [B]save[/B] any images you want to recover to a folder on your computer. [*]In some cases, you may be able to recover some of the website code. [B]Right-click[/B] then select [B]View page source[/B] to access the site code. [B]Save[/B] it to a text editor for later use. [/LIST] [HEADING=1]Scrape Internet Archive Content[/HEADING] If you don’t have time to manually copy each page of the website you’re recovering another option is to pull or scrape all the site content using a script. The following are some of the most popular options available. Keep in mind that these are often coded by 3rd parties or individuals and may require testing and troubleshooting to make them function successfully. [LIST] [*][URL='https://pypi.org/project/wayback-scraper/']Wayback Scraper[/URL] [*][URL='https://github.com/sangaline/wayback-machine-scraper']Wayback Machine Scraper[/URL] [*][URL='https://github.com/hartator/wayback-machine-downloader']Hartator Wayback Machine Downloader (Ruby)[/URL] [/LIST] [HEADING=1]3rd Party Services[/HEADING] Want to save time? You can pay a 3rd party service to scrape and recover your website for you. Some will even restore content from CMSs such as WordPress. The pricing and scope of service will differ based on the site, so we recommend checking and comparing them to see which one best meets your needs. [LIST] [*][URL='https://www.waybackmachinedownloader.com/en/']Wayback Machine Downloader[/URL] [/LIST] Now that you know how to find and recover website content from the Wayback Machine (Internet Archive), you can begin rebuilding your site. Hopefully, your site will return to its former glory with help from the archived copy. We recommend [URL='https://www.inmotionhosting.com/support/website/maintain/archive-website-with-wayback-machine']archiving your website with the Wayback Machine[/URL], so you will have updated snapshots. [/QUOTE]
Insert quotes…
Verification
Post reply
Richest Naijafans User
Most NaijaCash
Naijafans
11,212 NaijaCash
Streetot
6,147 NaijaCash
N
NL SOFT
2,595 NaijaCash
maventechie
589 NaijaCash
SACHSTOSHI
578 NaijaCash
Naijablog
397 NaijaCash
Klaus
390 NaijaCash
Naijababe
272 NaijaCash
bestosteopathy1
205 NaijaCash
I
Irinaabada
130 NaijaCash
Home
Webmasters / Bloggers
Ways to Recover your Content from Wayback Machine (Internet Archive)
This site uses cookies to help personalise content, tailor your experience and to keep you logged in if you register.
By continuing to use this site, you are consenting to our use of cookies.
Accept
Learn more…