OnBlog: The Onboard Informatics Blog

Conversations on the Art and Science of Information

Author Archive

Why More Data Makes People Happier

leave a comment »

When potential buyers consider what they want in a community, what comes to mind? Are they young hipsters looking for an apartment closest to the most live music venues? Or are they looking for a chiropractor in the vicinity? The priorities consumers have when buying are as varied as the consumers themselves, and it takes a warehouse full of information to satisfy them all.

Local Amenities, one of the most important products and services Onboard Informatics provides for its customers, is the transformation of raw data into meaningful information about the services available in a particular community. Each month sees new additions to the data records we maintain, meaning that the overall picture of a community that can be created is even more inclusive.

Last year at this time, Onboard was supplying 2,221,609 Amenities records to its clients. Currently that number stands at 4,084,253 records, almost double that.  New categories have been added that give a more comprehensive overview of community offerings. If you can think of something you’d want to have in the place you live, chances are that the data is there to tell you whether or not it’s available.

In 2007, if you were a health nut looking for eating and drinking establishments that specifically sold health foods, you were out of luck. But organic food fans, never fear — new Amenities records can help you find the nearest Whole Foods. Working parents who need childcare services? The most recent Amenities data lets you search for nearby Gymborees.

As for education, new data about student housing and vocational schooling has been added to existing records on Catholic, public, and private schools (as well as higher education). In addition to the information Onboard offers about education under Amenities, clients also have access to School Profiles and Reviews. The increase in data records can be seen here, too, since we’ve added about 5,600 reviews over the most recent month.

The more data you have, the more informed you are. As data records in areas like Local Amenities and School Reviews continue to increase, the results can only be more knowledgeable and more satisfied clients. Plus, now they’ll be able to search for the nearest spas in a community — does it get any better?

Written by Tara Powers

August 5, 2008 at 6:19 pm

Bulldog Pride! …Or Why I Will Never Look at School Mascots the Same Way Again

with 2 comments

bow wow wow

I’ll bet you thought your school mascot was pretty awesome. No other school’s paltry representative could hold a candle to your Mustangs or Lions or Wildcats.

Well, guess what? There are hundreds and thousands and millions (okay, maybe I’m exaggerating) of other Mustangs, Lions, and Wildcats out there. How do I know this? Because I’ve tracked down all of their Web sites.

Well, maybe not all of them (although at times it felt like that). For the past several weeks, part of my work here at Onboard Informatics has included updating our (rather extensive) list of invalid school Web site URLs.

For instance, a link to Willow Grove Elementary School may not lead where it’s supposed to because:

a) there might be a misspelled or incorrect address in our database,

b) the link was correct at one time, but now isn’t because the school district has updated or moved its Web sites, or

c) it actually does lead where it’s supposed to, but takes longer to load and so is coming up invalid.

What to do? Well, since Onboard has the school name, address, and district information available as well, we go out to the Internet to search for that school’s current, working Web site. Oftentimes, searching by district is the easiest way to go about tracking down these schools. Since the data is grouped by district in our file, a group of schools that all come from the same district can be taken care of by finding just one district home page.

But of course, nothing is as easy as it sounds.

For one thing, did you ever stop to think about exactly how many Springfield School Districts there are? (Answer: A lot more than you’d think. One in Massachusetts, Oregon, New Jersey, Missouri, and Illinois, and that’s just the first page of Google results.)

What about those pesky Colorado school districts that follow every normal name with an alpha-numeric code? And don’t even get me started on Missouri and its Roman numerals (Harrisburg R-VIII? Really?).

Then there are the schools that, try as you might, you just can’t find. Maybe they’ve closed, or the district’s Web site really isn’t functioning, or they’re in a rural area whose schools may not have set up Web sites yet. In cases like those, we delete the invalid URL that had previously been misdirecting users, but we leave the field blank — from a data perspective, it’s better not to have a Web site listed for a particular school than to supply an incorrect one.

___________

When the sons of Eli break through the line

All of these school URLs are helpful to have on hand when providing information about the offerings of districts in a particular neighborhood. At Onboard, we have valid, school-specific URLs populated for almost 40,000 of our schools — roughly 33 percent of our total listings. We also have a school or district URL populated for close to 80 percent, or 100,000 schools.

Out of about 36,000 distinct total school URLs that were validated, 3,500 changed their URLs over the last year. Of the remainder, we were able to provide valid URLs for about 15,000 schools for which we previously had no information.

After going through all 36,000+ of those school URLs, we ran the modified data through a check to pull out any links that were still broken — only about 1,000 (a much more manageable number, relatively speaking). And when that 1,000 is compared to the approximately 6,000 invalid URLs we finished with last quarter, that averages out to around 3,500 broken links Onboard deals with over a quarter.

Making sure that the data out there is as clean and accurate as possible is a vital part of what we do at Onboard, and keeping data that is constantly being modified, the way school data is, up-to-date is an ongoing task.

So maybe your school’s mascot isn’t the one-of-a-kind Golden Eagle (or other unstoppable creature) you thought it was, but you can still take pride in the fact that your school has a fully functioning website. Just do all of us data collectors a favor — don’t set all 25 links on your home page to blink simultaneously. Trust me, that’s never a good design…

___________

Quick and Random (and not statistically accurate) Fun Facts:

Most popular mascots — Bulldogs, Eagles, Tigers, Lions
Most unique mascots — Winged Beavers, Atom Smashers, Awesome Blossoms, Fighting Quakers, Cheese Makers
Most “interesting” school names — Slaughter Elementary School, Stalker Elementary School

Written by Tara Powers

July 30, 2008 at 10:41 pm

Data in Real Estate (Part 2): Creating Quality

leave a comment »

Having established a foundational knowledge of data and its application to the geographic sphere of real estate, the ability to determine what sort of data will be most valuable for a company’s business ventures is even more important. In the most basic sense, data quality refers to the degree of excellence in relation to the portrayal of the geographic “phenomena” being examined, all contributing to the data’s fitness for use.

How can we say what makes “good” data? When you talk about a good book or a good movie, isn’t your judgment dependent on certain subjective qualities — interests, mood — that are individual to you? To a degree, yes, but there are also certain aspects that must be present without fail in order for a book or movie to be considered “quality.” A book must be free of unintentional spelling and grammatical errors, for instance, and a movie needs to have clearly identified characters and some form of plot.

The overall quality of data can be thought of in the same way. While the specifics of what makes good data will vary according to the type of data you’re seeking — real estate data as opposed to sports statistical data, for instance — there are non-negotiable elements that apply to data as a whole. Read the rest of this entry »

Written by Tara Powers

July 25, 2008 at 5:58 pm

Posted in Informatics

Tagged with ,

Data in Real Estate (Part 1): Creating Accessibility

leave a comment »

The real estate industry has been affected by the nearly infinite amount of information available through the Internet in the same way that all industries have been. Consequently it is now more important than ever that the information clients receive is accurate and reliable. Understanding the nature of this data and the way in which it is interpreted and coordinated by real estate Web sites can contribute to an awareness of the complexity surrounding such data management, as well as the way in which that complexity is being simplified for the clients served.

But what is data, exactly? Raw data, data normalization, aggregate data — the word is thrown around with such frequency that it may prove difficult to come up with a concrete and coherent definition of such a broad term, even when applied to the real estate field.

Before a company can provide the data its clients want — quality data — it is vital that it has at least a basic understanding of what data and terminology associated with data mean in real estate. Read the rest of this entry »

Written by Tara Powers

July 25, 2008 at 4:44 pm

Posted in Informatics

Tagged with