smart disclosure progressing
Smart disclosure can bring to data the people-centric orientation that smart growth has to urban planning.
Tagged: data
non-profits’ distribution of management expenses
At the recent DC Data Dive, GuideStar put out for analysis some IRS financial data for non-profit organizations. The non-profits are identified only by an ID number, topic of work, and geographic scope. The question for the data dive was: “What financial data may be predictive of an organization’s defunct status in two years?” [...]
Tagged: data
building a data search engine
A good search engine for data would make the web more enlightening. Lack of a good business model may be keeping users in data darkness.
Tagged: data
beware of statistical designs
Misleading statistics on the declining farm share have supported agricultural subsidies. Beware of tendentious use of statistics.
Tagged: data, universal service
why SEC filings don’t contain semantic, queryable data
SEC filings don’t contain semantic, queryable data because companies aren’t interested in making their financial data readily available as such data.
Tagged: data
making data more factually important
Data accessibility requires both data production and data access. Discursive norms help to provide good incentives.
Tagged: data
badly structured tables have a bright future
Which is a better, one big table, or two or more smaller tables? The organization of the data sources, the number of smaller tables, the extent of the relationships between the smaller tables, and economies in table processing all affect the balance of advantage. But cheaper storage, cheaper computing power, and fancier data tools probably [...]
Tagged: data
exploring and remodeling table fields
Sometimes tables are messy not just in their data items, but also in the fields that define the table columns.[1] Various techniques help to deal with such “second order” messiness. Sorting table fields alphabetically or evaluating them with more powerful text similarity measures help to identify inadvertently duplicated fields. Sorting table fields by the [...]
Tagged: data
describing and organizing spreadsheet data
Even in this age of big data, most persons collect data in spreadsheets. Two challenges are common with spreadsheet data, particularly spreadsheet data collected from a variety of sources. First, you need to understand what numbers you have. That means both the definition of a specific number and the presence or absence of particular numbers. [...]
Tagged: data