Textpattern to Serendipity Migration Scripts

Not terribly riveting reading but this is the story of my migration from textpattern (version 4.0.4) to serendipity (version 1.1.3). This includes some PHP scripts I used to fix some problems on import – they’re not the best or the most robust in the world but they worked for me and I wanted to pop them here in case they work for someone else. Feedback on the code isn’t necessary because I won’t be using it again ;)

A running start

Firstly let me say that serendipity has a fabulous import feature which brought in 90% of my content about 30 seconds after I saw the button labelled “Import Data”. This is a real hurdle-remover for new users and definitely clinched the deal for me. Having done this I found that:

  • I had weird entity encodings everywhere
  • my images hadn’t arrived
  • everything seemed much more spaced out than it had been on the old blog

Images

The images hadn’t been transferred and I also realised that the custom tags that textpattern uses, such as:


hadn’t been handled by the import script, so I kind of had two problems to solve. I started by transferring all the images across and renaming them to something more helpful.

This is going to be a really long article, but do read on if its helpful – I’m posting the rest as extended content so the disinterested need not scroll past it!Here’s the code I used to move my images (both databases were on the same server as were both installations – this helped to simplify matters quite a bit. Also I have used an exec() call in this script – maybe not the most secure thing in the world but perfect for this fast-and-dirty one-off import script.


This placed all my images in the relavant folder in the serendipity upload folder, which was great. But I still had those mangled image tags to deal with. I wrote a script which edited each entry (its not pretty but it really did work) and fixed this – here it is:

/',$row['body'],$matches,PREG_OFFSET_CAPTURE);
                foreach($matches[1] as $key=>$match) {
                        $replacement = '';
                        $new_body = preg_replace('/

It took a few attempts (mostly involving getting the same image multiple times on posts where there was more than one image.

Entities

The way that textpattern stores character entities gave me a problem when they were then imported into textpattern, and I had ’ and other such “features” all over the place. With a lot of swearing and some help from my friend (thanks Sara!) I eventually got it untangled using this:

 "'",
                "–" => "-",
                "“" => "\"",
                "”" => "\"",
                "…" => "..."
        );

while($row = mysql_fetch_assoc($results)) {

        $sql = 'update serendipity_comments set body = "'.mysql_real_escape_string(str_replace(array_keys($table), array_values($table),$row['body'])).'"
                where id='.$row['id'];
        echo $sql."\n";
        mysql_query($sql,$db) or die(mysql_error());
}

?>

There might be a few others that got through but putting these in certainly sorted out the majority of the problems with the encoding. I’m a little unclear why textpattern and serendipity couldn’t communicate on this front but the main thing is that I did get most of my content through.

Vertical Space

The spacing problems that I had early on I “fixed” by running my code through a script to remove a load of whitespace – actually I was seeing this output because I had the nl2br plugin turned on. Sadly now I’ve formatted all my existing posts I can’t turn it off without re-reformatting them so I’m just living I with what I have for the time being!

Import Scripts and One-Shots

I’ve had quite a bit of experience doing data import routines and I applied some of the same principles to solving the problems I experiences with this migration from textpattern to serendipity. I’m really happy with the new platform, I’d like to hang on to some of the stuff that I used for this, and so I’ve stored it here. If it helps you too, then great :)

One thought on “Textpattern to Serendipity Migration Scripts

  1. This site uses a blogging platform called serendipity which is a nice little tool and I’ve been mostly happy since moving across from textpattern (I did write about the experience). Recently however, a few things have been going wrong with the feeds.

Leave a Reply

Please use [code] and [/code] around any source code you wish to share.

This site uses Akismet to reduce spam. Learn how your comment data is processed.

To respond on your own website, enter the URL of your response which should contain a link to this post's permalink URL. Your response will then appear (possibly after moderation) on this page. Want to update or remove your response? Update or delete your post and re-enter your post's URL again. (Find out more about Webmentions.)