Low-cost Information Management, Communications and Collaberation Tools

Although much of the work I do is for very large organisations and extremely costly projects, being an adopted Yorkshire-man, I’ve always an eye open for a bargain! More seriously, there are many small to medium sized businesses and charities that cannot afford big IT budgets but that still are crying out for good information management, communications and collaboration tools. In this article, I’ve tried to highlight a few tools that I think are worth looking at.

I’ve not included anything in this article that requires a monthly or annual cost. All the tools here are available for free at least with limited features. The feature lists given are for the free versions with paid-for key features noted where appropriate. I also note if any of the web sites are blocked by typical enterprise firewalls.

There is a lot more than what I’ve shared here, I’ll try to update this article from time-to-time. Continue reading

Enterprise System Design and Accessibility

Most web designers are well aware of the need to design with accessibility in mind and that this is a legal requirement in many countries.

Not so many IT architects and designers who deal with internal, enterprise systems are aware, though, that these laws and requirements also apply to internal systems.

Recently I’ve yet again seen a number of dreadfully designed user interfaces (UI) for enterprise systems that most certainly don’t meet usability standards let alone accessibility standards! Continue reading

Apple iOS 5, What does it change for me?

There are some really nice looking changes on the way for Apple mobile users lucky enough to have a device that can be upgraded to iOS 5. I thought I’d throw together a quick look at how this will change things on a practical level for me as I use an iPhone and an iPad for work and personal use.

Update: 2011-10-13. I’ve added some more details.
Continue reading

Data Cleansing Tools

It is very common to receive information in a format that cannot easily be formatted and processed as you want it to be. For example, you may have an unstructured list of information that you need in a table; or data in a table that you need as XML; or data in a web page (HTML) that you need in Excel. So many information professionals maintain a tool-kit that allows them to manipulate all sorts of information in different formats.

Here is a small list of tools aimed at cleansing and structuring data.

  • Text editors [Free and commercial; All platforms] Often the first port of call for manipulating un- or semi-structured text. Anything beyond simple search/replace and manual editing is likely to use Regular Expressions (see below).
  • Spreadsheets [Excel - Commercial; Windows, Mac. OpenOffice - Free; Windows, Linux, Mac. Many similar free and commercial software tools] This is often my second tool of choice as they are more structured than plain text but not as restrictive as a full database.
  • Regular Expressions – Quick Reference. Often incomprehensible but very powerful text transformation “language”. Commonly found in good text editors though it is sometimes easier to use one of the web training tools as long as the text to transform is not too big. Well worth learning as many, complex transformations can be built up using a series of regular expressions and they are available from many tools. Also available in any decent code editor or IDE, Microsoft Office (via VBA code), PHP, JavaScript, Java, Linux command line tools, etc.
  • Google Refine [Free software; Windows, Linux, Mac] Run locally to manipulate tabular data. Quite powerful but not especially well documented. Part of the Freebase project. Support is provided by the mailing list. Can save and re-run, easily undo steps. Cannot run in batch mode as a transformation engine. Useful when transformations and cleaning in a Spreadsheet don’t cut it.
  • DataWrangler (Alpha) [Free; Web] From Stanford University, they have a demo version online. Similar to Refine.
  • TextPipe Pro [Commercial US$ 30-400; Windows] “industrial strength text transformation, conversion, cleansing and extraction workbench”

Once you have structured data, there are a number of types of tools that are used to extract, transform and process data – ETL (Extract, Transform and Load), Business Information (BI) and Data Mining. Typically though, these are enterprise class tools and only really worth bothering with when you have large-scale data and need repeatability. Here is a small list of such tools for reference.

  • XSLT [Free and Commercial, All platforms] This is an XML transformation language. It is used for transforming an XML scheme into another one. Perhaps may be thought of as regular expressions for XML.
  • SQL [] Structured Query Language is the standard language for manipulating relational databases.
  • Microsoft PowerPivot for Excel 2010 [Free?; Windows] Desktop BI add-in for Excel 2010.
  • Pentahoe [Free and Commercial]
  • RapidMiner [Free and Commercial]
  • Orange [] Visual programming, visualisation, data analysis
  • Talend Open Studio [Free and Commercial] Aimed at data integration
  • CloverETL [Free and Commercial]
  • IBM Data Stage [Commercial]
  • Ab Inicio [Commercial]

It might also be worth looking at the BI Verdict web-page for the latest analysis of the strongest players amongst the BI vendors and products

Speeding up Cygwin

Yesterday I mentioned my success with Cygwin.

One issue I did have though was with the speed of startup. It was taking 15-20 seconds to start a BASH shell.

It turns out that this was a PATH issue. I went through my Windows PATH and cleared out the clutter. Now it takes just around 3-4 seconds for a full BASH login and less still for just running a script.

I now find myself using the BASH shell for all sorts of things and I’ve set up a number of alias’s to switch to folders I’m using a lot and to open common documents.

One handy function I’ve added to .bashrc (so it is always available) works out the current working version of a document. It assumes that you keep copies that have a version number or date in the file name that will sort correctly.

You can find the code on my development blog.

Here are a few more alias’s I use:

alias np='cygstart "/cygdrive/c/Program Files/Notepad++/notepad++.exe"'
alias c='cd /cygdrive/c/'
alias d='cd /cygdrive/d/'
alias work='cd "$HOME/Documents/Workdocs/"'
alias pers='cd "$HOME/Documents/Persdocs/"'
alias facebook='http://www.facebook.com'

Windows command prompt vs PowerShell vs Cygwin for remote backup scripts

I’ve been struggling with trying to get a new backup routine working for my Laptop.

I should point out that I have several complex requirements for backup so my needs are probably not average.

However, it really shouldn’t be this hard!

I need to use a combination of BZR (Bazaar) for document version control and RSYNC (for files that don’t need version control and for those folders that might contain files too big for version control systems – around 1/3 to 1/2 available memory).

All of the backups need to happen over a secure link since I am often outside of my home network – indeed quite often behind locked-down enterprise firewalls but that’s a story for another time. So I use SSH (Secure SHell) to manage the secure connection and transmission. Thankfully both BZR and RSYNC can both use SSH as a transport.

I don’t want to have to enter my remote system password loads of times though and this is where things started to get annoying. Using the Windows native versions of BZR, RSYNC and SSH I could not get a single shared password to work no matter what I tried.

I also had some problems trying to control the output from the various tools and use it to further control what happens next – for example getting an IP address and working out whether I am on a network and where that network is.

I tried to do this with a Windows command script first but even with the Windows 7 extensions it really is far to hard to get anything useful done and when I found myself turning to more and more utilities to help I thought “enough is enough”.

At that point I happened to be reading an article on Windows PowerShell, the .NET scripting host so I thought I’d give it another go (having tried it before). I soon found that, although powerful for controlling the WMI interface, it is desperately convoluted and annoying for general use.

So, realising that most of the tools I wanted to use have their roots in the UNIX world, it would make sense to try out the latest version of Cygwin. This has really come on a long way since it’s early days and is far more mature. It is also very much lighter in weight than the Microsoft provided UNIX services for Windows or whatever they are currently calling it. The Microsoft provided tools load perhaps a dozen services into memory permanently though they are rarely required. Cygwin only uses memory when it needs to.

After converting my backup script from Windows batch to a BASH script under Cygwin, I soon had everything working as I wanted it – including the seemingly intractable problem of the shared passwords, now using KEYCHAIN to manage the SSH-AGENT and keys. So now I only need to supply a password once, it is held reasonably securely in memory and used by SSH as and when required. I only need to enter it once per reboot.

The full script not only backs things up, it also auto-commits changes to BZR and changes network settings to match my current location and proxy server requirements. The later is not yet converted from batch as I don’t need it just at the moment.

Let me know if you are interested in a copy of the script and I’ll upload it somewhere.

Running Komodo Edit Open Source Code Editor Under Windows 7

The open source version of Komodo’s code editor and development environment Komodo Edit is a great tool for development. I use it for PHP, HTML, JavaScript and more.

However, I haven’t done any serious coding for a while so I haven’t needed to run it under Windows 7 even though I had it installed. When I did, I was disapointed to find it behaving very poorly. It wouldn’t resize properly without messing up the screen. I tried with some of the compatibility settings that Windows 7 gives you but that made no difference. I also tried an upgrade to the latest version.

A quick search didn’t reveal anything about Windows 7 specifically but I did spot a discussion about problems under Vista that were related to file permissions. Sure enough, making Komodo Edit run as Administrator fixed the issues.

I’ve had a few file permission issues under Windows 7, I’m fairly sure it is down to me messing around. However, it is clear that Windows Vista and Windows 7 are both rather sensitive to permissions issues which is worth bearing in mind.

Looking at %USERPROFILE%\AppData\Roaming\ActiveState\KomodoEdit, I could see that SYSTEM, my user and Administrators all had full access but that Administrator was the owner of some of the files. I can only summise that this is the issue.

As this needs rebuilding with Windows 7 RC, I haven’t the time to test further but certainly running as Administrator does the trick.

By the way, Komodo Edit is available for Mac and Linux as well as Windows. It has a big brother “Komodo IDE” with additional features if you need them. Both are highly configurable, support many languages and are based on the Mozilla code (like Firefox and Thunderbird) & can easily be extended with JavaScript.


Technorati : , , ,
Diigo Tag Search : , , ,

How-to use WebDAV on Windows Vista and Windows 7

Windows is supposed to have a built-in WebDAV client. However, it doesn’t ever seem to have been especially robust and certainly since Vista a lot of people (myself included) have found that it simply doesn’t work on many supposedly WebDAV enabled sites.

Thankfully there are a couple of free (and some not free) options that, while not as nicely integrated into Windows Explorer, do enable you to transfer files back and forth.

The two that I found were:

I’ve tried the first of these two and it seems to work OK. It’s not polished but it gets the job done. There is also an online version using a Java plugin if you don’t want to install the client.

It seems that the second option is no longer in active development. Don’t be fooled by the web site though, they have changed the license to freeware.

Now, at last, I can once again load multiple files to our corporate intranet on WebEx without having to switch to a Linux desktop!


Technorati : , ,
Diigo Tag Search : , ,