So I’ve been using iOS 5 on both iPad 1 and iPhone 4 for a couple of days now and I thought I’d follow up on my previous post about what I thought would impact me. Continue reading
Author Archives: Julian Knight
Apple iOS 5, What does it change for me?
There are some really nice looking changes on the way for Apple mobile users lucky enough to have a device that can be upgraded to iOS 5. I thought I’d throw together a quick look at how this will change things on a practical level for me as I use an iPhone and an iPad for work and personal use.
Update: 2011-10-13. I’ve added some more details.
Continue reading
Diigo Bookmarks 09/12/2011
-
The Cottage Smallholder » Raspberry gin recipe#
also raspberry vodka (schnappes).
-
Some really odd ones (basil?) but some good ones too (blackberry)! Basically Vodka with something added.
-
Cloud Security: Open Source Identity Management Software
List of open source identity management systems
Diigo Bookmarks 08/19/2011
-
10 online tools to simplify HTML5 coding | CatsWhoCode.com
Just as it say’s in the title. Some online tools that may be useful when starting some HTML5 coding.
-
David McRitchie’s Excel pages
-
Fast Javascript table/spreadsheet library
Diigo Bookmarks 08/11/2011
-
Very handy. A quick, disposable WordPress development environment that will run from a USB key or anywhere else. Leaves nothing behind, just delete the folder. Includes Apache, PHP, MySQL & a pre-configured WordPress deployment.
Diigo Bookmarks 08/08/2011
-
DUG – Daf’s Uncomplicated Grapher
Simple INI controlled software to graph SNMP values. Can also graph XML (and HTML if an XPATH can be generated).
Diigo Bookmarks 08/05/2011
-
Continuously monitor some basic stats on broadband ADSL routers.
-
Open source identity and access management toolkit
-
Authentic – Overview – Projets Entr’ouvert
Open source identity and access management platform written in Python, Django and C.
-
ForgeRock – ForgeRock | ForgeRock provides solutions based on top quality Open Source software.
Open source identity and access management platform. Includes OOpenAM (authentication and authorisation), OpenDJ (directory service), OpenIDM & OpenICF
-
A native PHP web application that provides Service Provider (for authenticating users to your own web application) and Identity Provider (for providing federated user credentials and single sign-on [SSO]) SAML services for authentication. Open source.
Diigo Bookmarks 07/21/2011
-
Check your email addresses against a combined list of known compromised web sites. If you have ever used an address on a web site, this will check whether hackers know your email address and so may possibly have compromised the password you used on that site.
Diigo Bookmarks 07/19/2011
-
Douglas Crockford’s Javascript
Lots of useful information about Javascript
- Lots of javascript info from the guy who came up with JSON. Looks like it might be a nice rescource – post by Hoppy Blings
- List of articles and books by one of the masters of OO Javascript (he “discovered” a lot of the advanaced uses for it) as well as coming up with the JSON standard.
– post by Lindsay Donaghe
Data Cleansing Tools
It is very common to receive information in a format that cannot easily be formatted and processed as you want it to be. For example, you may have an unstructured list of information that you need in a table; or data in a table that you need as XML; or data in a web page (HTML) that you need in Excel. So many information professionals maintain a tool-kit that allows them to manipulate all sorts of information in different formats.
Here is a small list of tools aimed at cleansing and structuring data.
- Text editors [Free and commercial; All platforms] Often the first port of call for manipulating un- or semi-structured text. Anything beyond simple search/replace and manual editing is likely to use Regular Expressions (see below).
- Spreadsheets [Excel - Commercial; Windows, Mac. OpenOffice - Free; Windows, Linux, Mac. Many similar free and commercial software tools] This is often my second tool of choice as they are more structured than plain text but not as restrictive as a full database.
- Regular Expressions – Quick Reference. Often incomprehensible but very powerful text transformation “language”. Commonly found in good text editors though it is sometimes easier to use one of the web training tools as long as the text to transform is not too big. Well worth learning as many, complex transformations can be built up using a series of regular expressions and they are available from many tools. Also available in any decent code editor or IDE, Microsoft Office (via VBA code), PHP, JavaScript, Java, Linux command line tools, etc.
- Google Refine [Free software; Windows, Linux, Mac] Run locally to manipulate tabular data. Quite powerful but not especially well documented. Part of the Freebase project. Support is provided by the mailing list. Can save and re-run, easily undo steps. Cannot run in batch mode as a transformation engine. Useful when transformations and cleaning in a Spreadsheet don’t cut it.
- DataWrangler (Alpha) [Free; Web] From Stanford University, they have a demo version online. Similar to Refine.
- TextPipe Pro [Commercial US$ 30-400; Windows] “industrial strength text transformation, conversion, cleansing and extraction workbench”
Once you have structured data, there are a number of types of tools that are used to extract, transform and process data – ETL (Extract, Transform and Load), Business Information (BI) and Data Mining. Typically though, these are enterprise class tools and only really worth bothering with when you have large-scale data and need repeatability. Here is a small list of such tools for reference.
- XSLT [Free and Commercial, All platforms] This is an XML transformation language. It is used for transforming an XML scheme into another one. Perhaps may be thought of as regular expressions for XML.
- SQL [] Structured Query Language is the standard language for manipulating relational databases.
- Microsoft PowerPivot for Excel 2010 [Free?; Windows] Desktop BI add-in for Excel 2010.
- Pentahoe [Free and Commercial]
- RapidMiner [Free and Commercial]
- Orange [] Visual programming, visualisation, data analysis
- Talend Open Studio [Free and Commercial] Aimed at data integration
- CloverETL [Free and Commercial]
- IBM Data Stage [Commercial]
- Ab Inicio [Commercial]
It might also be worth looking at the BI Verdict web-page for the latest analysis of the strongest players amongst the BI vendors and products