Looking for:
Winhttrack download blogspotDownload HTTrack Website Copier 3.49-2 - Winhttrack download blogspot
Configuring WinHTTrack to Back Up a Blog | Ray Woodcock's Latest
It could also be useful for capturing and preserving the state of a website as of a certain date. I had used HTTrack previously. At its best, I had been very happy with it. Most recently, however, I had somehow lost the settings that I had used to make a successful backup of my blogs. Once backed up, it was not necessary to re-download the full website; HTTrack would just copy the pages that had changed.
I saw that there were downloads for both and bit systems, with and without installers. I assumed the non-installer version was what others would call the portable version. I hoped to preserve my settings, once developed, over a long period of time, and anyway I preferred not to have to reinstall a boatload of programs whenever I installed Windows. So I opted for the bit portable version. It dated from September 30, I put the entire contents of the portable version into my customized Start Menu, not located on drive C, so that I could back it up, share it with other computers, and preserve it for years, regardless of Windows reinstallations.
I ran HTTrack while consulting its documentation page. Given my particular interest in using HTTrack to back up my blogs, especially but not only those in WordPress, I also ran a search for advice.
This led to a bemusing spectrum of suggestions. On one extreme, eHow said that, to back up a Blogger blog presumably not very different from a WordPress blog using HTTrack, you just basically point and shoot. I saw comments by others who had done exactly that, with acceptable results. On the other extreme, Noumaan Yaqoob , evidently working with WordPress.
To the extent applicable, this could include Search and other widgets, plugins, comments, and metatags, as well as code minification. His concern seemed to be that HTTrack would not mirror all of the programming that would underlie a blog, and therefore some links e. I guessed that it was not necessarily as simple as some might believe. There was also a concise, command-line manual for the Linux version. That guide apparently a bit outdated, but hopefully still useful advised me to start by entering a project name and base path.
The name I chose was simply the title of the blog I was backing up, so that I would not accidentally mix up its files with those of another blog. I chose a base path that was not located on drive C. That is, I wanted the HTTrack backup of my blogs to be on a data partition, not on the Windows drive C programs partition. This was because a I backed up the data partitions much more frequently and b drive C would be wiped out if Windows became nonworking and had to be restored from a drive image.
The subfolder was called hts-cache. This folder reportedly contained information about what had been copied previously, so as to speed up subsequent updates of the copied website. The files blackblue. HTTrack would add to the contents of that index. My first Project Name was Blog 1. The Project Name box had a drop-down menu, but at the start there was nothing in it; it was OK to type Blog 1 as a project name there.
I also entered a project category e. Then I clicked Next. That created the Blog 1 subfolder. Eventually, I backed up several blogs, resulting in multiple subfolders around my base path. At that point, my directory tree looked like this:. Each of those subfolders contained an hts-cache subfolder. But that would come later. At the moment, my focus was just on mirroring Blog 1. When I clicked Next, I moved on to a new screen. Here, I had to choose an Action.
According to the Step by Step guide, the meanings of those options were as follows:. I figured that downloads of my WordPress blogs would ordinarily involve a an initial Download Web Site perhaps with questions and then b an occasional Update Existing Download.
I was able to paste that directly into the Web Addresses box. Next, I was able to set Preferences and Mirror Options. Clicking that button opened a new dialog:. The Step by Step guide seemed to indicate that some of the tabs in this dialog might be relevant to me, while others were probably not:.
That concluded my overview of the Options dialog. Now I needed to return to its Limits and Links tabs for closer study. I turned to my WordPress blogs, using the present blog as an example. It had some preset choices but would allow new entries e. It seemed that more than just a few levels would create a potentially voluminous mirror: the Step by Step guide said that the number chosen here would indicate how far links would be followed.
If I understood correctly, a Maximum mirroring depth of 1 would capture only the single page found at the designated URL. If I entered 2, it would capture that single page plus all pages or files that you could reach by clicking every link on that single page. If I entered 3, it would capture the single page at that URL, every page linked to from that single page, and every page linked to from each of those pages.
And so forth. My blog was currently set up so that the top page i. That top page also contained links to other recent blog posts and to my other blogs. I certainly did not want the mirror of this site to include posts from those other blogs.
So just just to be sure, I set the external depth to zero. That resolved one concern. If nothing from outside my target URL i. The highest given value was 20, but it appeared that the box would accept I wondered whether a value that high would just make the thing churn unnecessarily, as it hashed the same links within my blog, over and over again.
Presumably HTTrack started by making a complete list of all of the links that it would be checking. Its search would be infinite only in the sense that it would keep looking for more links, as long as there were any nooks and crannies that it had not yet explored, within the blog or other website that it was copying.
But once it had worked through the links on a page, apparently it would not try to do so again. So it seemed there was not much to be lost from leaving Maximum mirroring depth at a blank i. Cohen offered an example with a limit of 50, and said 20 would usually be sufficient. I decided there was not much to lose from entering a Maximum mirroring depth of Of course, if I had been copying some random and potentially voluminous webpage within e.
So, OK, I had excluded the outside world by setting maximum external depth to zero. I had also enhanced my chances of capturing all of the webpages within my target URL by setting a high 50 if not infinite blank value for maximum mirroring depth. But could I be sure that HTTrack would find every post in my blog, even if it were allowed to follow infinitely many links within my blog?
In other words, was it possible that one or more posts within my blog would never be reached, regardless of how many links HTTrack might follow? In asking this question, I visualized a solitary post, not linked to by any other post. Of course, there was that list of recent posts at the right side. Well, there was also the Archives option, with a drop-down menu leading to lists of items posted, for each month since the blog was first started. And would a drop-down menu count as a source of links for HTTrack?
I experimented briefly, using a free website host. I created a main page, a subfolder, and a file within that subfolder. I ran HTTrack on that new main page. It was captured. For purposes of my HTTrack mirror, the fact that there was no link to it meant that I would basically not be able to find or see it, within the mirrored page structure.
In other words, the mirror was just going to reflect the website or blog. If it was going to take me 14, or 50, or infinite steps to find the page online, then the same would be true — at best — in the mirror. So now I tried a different kind of experiment. I pointed HTTrack toward the top page in this blog, and played with the maximum mirroring depth. I used the same project name as before i. First, I tried setting the mirroring depth to zero. The folder created in the process described in the preceding paragraph was still there.
In a moment, I would delete that, because now there was a new folder for this blog. Note that Windows might not want to delete files or folders if HTTrack or some other program e. Unlocker could be helpful in that situation. I went into that Blog 1 folder and clicked on its index. Sure enough, it displayed the post currently shown on the main page of this blog. While viewing that, I clicked on one of the links shown in that file. It took me to a web address. So, OK, in this case limiting the search to zero depth did not kill the links; it just prevented them from being brought into the mirror.
In Windows Explorer, I went to the other index. Something wrong there. Now I went into the folder for the previous experiment i. There was nothing there. I deleted that folder. I ran the same thing again, except this time I set the maximum mirroring depth to 1 instead of zero.
This time, the top-level index. Now I tried again, notching it up once more: this time, mirroring depth was set to 2.
This process took longer, though still only a minute or so. Although the Blogger backup tool provides a limited backup file, it is a good idea to download this as well in the event that you ever need to restore your Blogger website. To access the Blogger backup tool, click "Settings" from the Blogger admin interface, then click "Export. While HTTrack does provide a complete archive of your website, you cannot use the archive to recreate your blog in the event of data loss.
Blogger's import wizard only supports export files created by Blogger and other popular blogging platforms. Jason Spidle is a technology enthusiast and writer. His writing on computers, smartphones, Web design, Internet applications, sports and music has been published at a variety of websites including Salon, JunkMedia, Killed in Cars and The Columbia Free Times.
Spidle maintains a number of blogs featuring poetry, short stories and other fiction. Home Hardware. The best answers are voted up and rise to the top.
Stack Overflow for Teams — Start collaborating and sharing organizational knowledge. Create a free Team Why Teams? Learn more about Teams. Error using Httrack for download a blog from a blogger site Ask Question. Asked 9 years, 4 months ago. Modified 9 years, 4 months ago. Viewed times. Does anyone know how to donwload a blog with this program? Thank you so much. Improve this question. Peterstone Peterstone 1 1 gold badge 4 4 silver badges 14 14 bronze badges. Add a comment. Sorted by: Reset to default.
❿ ❿
No comments:
Post a Comment