Egret

a power tool for working with messy data

BuildingBigAnswersFromBigData

9 years ago

Egret is a power tool for working with messy data. Use it to improve data consistency, link it to data registries, augment it with data from other sources, transform it into different formats for other tools to consume, and contribute it to back to the original sources. It is a part of those products that are used as data wranglers. Egret is not a web service but a desktop application that runs on your own computer, so you can process sensitive data with privacy.

Egret was originally developed as "Freebase Gridworks" by Metaweb Technologies. Metaweb was acquired by Google in July 2010 and they renamed the product Google Refine. In October, 2012, the product was renamed OpenRefine as it transitioned to a community supported project. The project was then forked enhanced, and rebranded as Egret.

EgretInFlight - (screenshots)

Egret has many added features some of which are listed below:

  • Data Profiling
  • Data Exploration
  • Extended Transforms
  • Column-wise and Row-wise analysis if data
  • Changeable and configurable clustering algorithms

Egret was part of a big data vision that is abstractly illustrated in the following raw videos:

  1. OneWaves - Waves and waves of data
  2. TwoEgret - Big Data Cleansing
  3. ThreeBodhi - Enlightened Insight

While these conceptual videos were never released they formed the basis of a plan to be implemented to drive true BI across Big Data.

back to codemarc.net


(adsbygoogle = window.adsbygoogle || []).push({});