Home > Blogging, Geeky, Tips > AOL Users: How to Backup AOL Journals to Your Computer

AOL Users: How to Backup AOL Journals to Your Computer

October 2nd, 2008

This is a screenshot of Wget on AOL JournalsIt’s been a couple days since AOL users were notified about the fact AOL Journals is shutting down. In an effort to help those who need it, I’ve put together a guide on how to backup all your AOL journals content (blog entries, images, etc.) locally to your computer for later access.

Until AOL provides an option to export your blog content into a simple XML file, you’ll need to “crawl” (aka, “spider”) your Journal and it will be mirrored locally. There are several drawbacks for doing it this way — but to save you time, let’s just say AOL will be paying the price for it in the end due to the inability to offer a blog export feature. Important notes are located at the end of this entry for review.

Overview:

  • Download a free application (Wget) to be used later.
  • Use a Command Prompt so you can send Wget the commands.
  • Open the folder where all the files are stored.
  • Wait until crawl is complete. It takes a while.

I recommend you close all other applications before continuing as this can be confusing for novices and distractions just make things worse.  This will involve a LOT of Web traffic moving over your Internet connection, dial-up users need not continue further.

This process uses the Command Prompt (terminal for geeks).  Please take caution as you type these commands as you could potentially cause more problems than good. Type all quotation marks, spaces and dashes for commands!

Process:

  1. Download Wget for Windows. It’s free and is a typical software installation, accept the terms and click ‘yes’ and ‘continue’ to install it.
  2. Click Start then Run. Windows Vista users can just click the Start Menu.
  3. Type cmd and press Enter.
  4. Navigate to the Wget installation folder by typing this command in and press Enter:

    cd "C:\Program Files\GnuWin32\bin"

  5. Begin the crawl of your Journal. Type this command in and press Enter:

    wget -m -k -K -E --random-wait --html-extension http://journals.aol.com/SCREENNAME/JOURNAL-TITLE/

  6. Observe the Command Prompt and make sure it’s crawling your Journal. At this point, just set it and forget it, and come back in a few hours and observe if it’s crawling your Journal.
  7. When Wget is no longer crawling your blog, press CTRL + C to stop it.
  8. Open the indexed files locally on your computer. Type this command in and press Enter:

    explorer journals.aol.com

  9. Browse the folders and locate your screenname. The reason why other blogs show up is because you might have linked out to them and they were also AOL Journals users. At this point, you may close the Command Prompt.
  10. You’re done! I recommend adding everything under your blog into one ZIP file and saving it to a CD so you don’t delete it by accident.

Important Notes:

  • This will consume a lot of bandwidth. Do not do this if you have a rated broadband provider (like Verizon EVDO service).
  • This will hit AOL Journals’ servers hard. If too many people do this at once, AOL could technically block these requests. (There are tricks to get around that, nonetheless.)
  • Assume a full index will take about 500MBs, including images.
  • *If* AOL offers an “export” function, do not follow the instructions above.

Any comments, questions, problems? Discuss it in the comments below. Unfortunately, I’m unable to back up everyone’s blog for you, but try these instructions on your own before asking me to do it for you.

Why the heck did I write this? Read: AOL Journals and AOL Hometown Will Be Closed.

Blogging, Geeky, Tips , , , , , ,

  1. Katherine Fisher
    October 6th, 2008 at 06:34 | #1

    This seems to be working on my public journal (it’s crawling now). I also have a private journal and I could not get it to work. I assume I somehow have to be “allowed in”. Any suggestions???

    Thank you -

  2. October 6th, 2008 at 13:03 | #2

    Katherine,

    Private Journals require the use of passing a secure cookie. This is possible, but makes it a bit more difficult.

    Briefly, you would clear your cache and cookies, then login with your Web browser, then find that cookie and use that same coookie (aka, Session), and crawl it and use the authenticated cookie to crawl a private journal.

    Use the following commands in your Wget command:

    --cookies=on --load-cookies=cookie.txt --keep-session-cookies --save-cookies=cookie.txt

    You’d copy over the data of the authenticated AOL cookie into the a cookie.txt file in the Wget bin folder and do that.

    ~Joe

  3. Katherine Fisher
    October 7th, 2008 at 20:22 | #3

    Just an update….it crawled FOREVER. When I finally stopped it, it was not in any way clear how the files were organized. Very frustrating. But, I have to keep trying….

    thanks!

  4. October 8th, 2008 at 13:15 | #4

    Joe,

    I thought I’d give you a heads-up - AOL Journals users can now import their journals to the Blogger platform.

    How-to is here:

    http://help.blogger.com/bin/answer.py?answer=112742

    I wrote a short update on my blog here:

    http://anti-aol.livejournal.com/98350.html

    Perhaps you want to update your blog to keep J-Land users who stop by up-to-date on what’s going on.

  1. October 2nd, 2008 at 07:24 | #1
  2. October 3rd, 2008 at 07:27 | #2