Intelligence you can count on

The official FPinfomart blog.

  • Archives

  • Categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Coverage and Copyright

Posted by Jennifer Stein on March 11, 2010

A reader posed the following question in response to our previous post on enhancing FPinfomart content:

“As a throw-back to the first article in this series, I’m interested in learning more about which content is included in Infomart. For example, Infomart offers the National Post as a source, but not all articles from the National Post are in the database. Can you tell me what the criteria is for whether articles are included? Does it always have to do with copyright? Is there any way that an Infomart user can determine whether articles from a subscribed source are missing?”

This is a great question, and definitely merits its own post.  And, the answer, like many other things is (unfortunately), “it depends.”

There are 4 basic reasons why an article that was published in a newspaper would not be available on FPinfomart:

1. The author is a freelancer (i.e. not employed directly by the newspaper or publication in which his/her work is published), and has not authorized the publisher to distribute the work in any other medium than the original printed format.

2. The story is republished from a 3rd party source (usually a newswire, such as Reuters, Canadian Press, Associated Press, etc.) who has authorized the newspaper to publish their copy in print, but not to distribute the work in any other medium.

3.  The story has been retracted or suppressed (usually for legal reasons) after publication.

4.  The article was actually paid content, such as an ad supplement, an advertorial, “special section,” Appointment Notice, etc.  In the majority of cases, FPinfomart does not carry such paid content, as a 3rd party has paid the publisher to run it, and therefore it does not carry through to a database such as ours.

The part of this answer that is a bit fuzzy is exactly what happens when an article meets one of the criteria above.

For content that is owned or enhanced by Canwest, we actually add ALL content (with the exception of that belonging to type 4, above) to our database.  We keep a “blacklist” of content that is not allowed to be distributed by FPinfomart for the reasons specified above (specific bylines; original sources, articles that have been retracted, etc).  When displaying a list of results to you, we check all articles against this “blacklist,” and if any meet the criteria for suppression, we display this message to you:

Example of a supressed story

The advantage here, of course, is that even though you can’t read the text, you at least know the article appeared, when, and the citation details.

For content that we receive in database form from other publishers (the hundreds of sources we do not enhance ourselves), we are not able to do this, because the publisher removes the articles (also using the criteria stated above) BEFORE they send us the content.   Therefore, we never receive the restricted articles, and cannot provide even citations for them.  We are not provided with a list of suppressed content, and therefore it is as invisible to us as it is to you.

Thank you, Michelle, for submitting your question.  I’m sure there were a lot of other FPinfomart users wondering the same thing.  I hope this explanation provides the answer you were looking for.

More questions about how FPinfomart works?  Let us know in the comments.  You may inspire our next post!

Posted in content | Tagged: | Leave a Comment »

New Blog Feature: The FPinfomart Scavenger Hunt!

Posted by Jennifer Stein on March 8, 2010

Need a quick break?  We thought so.  We’re going to try an experiment:  an FPinfomart scavenger hunt.  We’ll provide a question, you get to use FPinfomart to find the answer.  First person to enter the correct answer here in the comments wins immortal fame on this blog (feel the glory!)

If this proves popular, we’ll try to run this feature on a weekly basis.  Follow us by RSS, on Twitter, or subscribe to updates by e-mail (see the handy subscription box on the left-side panel) so you can be notified right away when new scavenger hunts are posted.

Please follow the answer format below so that we know you’ve used FPinfomart to find your answer!  The first to successfully answer all 4 questions below will be declared the winner.  (Hint:  You’ll need to search the Archives).

Last night’s Best Actor and Actress Oscar winners were Jeff Bridges and Sandra Bullock.  These two have actually appeared in a film together in the past.

  1. What is the Title of this film?
  2. What year was it released?
  3. What is the name of a third big-name star who also appeared in this film?
  4. Provide a citation for your answer – must include article title, name of paper, date of publication, and byline

Even if a winning answer has already been posted, please let us know in the comments if you’d like us to continue this game on a regular basis!

Posted in Announcements, Scavenger Hunt | Tagged: , , | 4 Comments »

Monthly User Poll – March 2010

Posted by Jennifer Stein on March 5, 2010

This month, we would like to get a better idea of how our clients are using the tools and data within the FP Advisor module of the FPinfomart.ca product.

FP Advisor encompasses a suite of tools to help you research Canadian corporate data.  From basic directory-style listings to in-depth financials, FP Advisor is a valuable tool in your intelligence arsenal.

Answers to our monthly polls are collected anonymously, although you are invited to fill in your contact information at the end of the survey, if you wish to be entered in a prize draw. Please click here to take the survey. Responses are due by noon (ET) on Friday, March 12th.

Posted in Monthly Poll | Tagged: , , | Leave a Comment »

FPinfomart Monthly Newsletter – March 2010

Posted by Jennifer Stein on March 4, 2010

Our monthly newsletter was sent by e-mail this afternoon. A PDF copy is attached here.

If you do not already receive our newsletter and would like to be added to our distribution list, please see our information page.

FPinfomart Monthly Newsletter March 2010

Posted in newsletter | Tagged: | Leave a Comment »

From page layout to database: Enhancing content for FPinfomart

Posted by Jennifer Stein on March 3, 2010

Earlier in the week, I posted a description of how the sources that make up FPinfomart are selected.  Today’s post addresses the technical aspect of “getting content into FPinfomart.”

As I was composing the first post in this series on “how content gets into FPinfomart” I had intended to describe only the licensing issue.  As I wrote, however, it occurred to me that the actual process of moving content from a variety of media into a single, uniform format may also be a bit of a mystery to a lot of our users.  Today’s post attempts to lift the veil and give you a bit of insight into just how we make 4,700+ sources searchable, consistently-formatted, and usable in a database.

Newspaper Front Page

How do we get from here...

Newspaper fulltext article

... to here?

We have many sources for our data.  Some of it comes to us from 3rd parties (publishers, aggregators, etc.) while other content is produced here.  Some of the content comes pre-formatted for us, while other content is run through various automatic conversion programs, and some content is converted through more manual processes.

This post will focus on a process called “Enhancing,” which is how we turn a page formatted for a printed newspaper into separate articles in the FPinfomart database.  We Enhance all the Canwest-owned newspapers, as well as close to 100 additional community papers for whom we provide this service (without which they would have no digital archive of their own newspaper).

Once the paper is sent to the printing press, an electronic copy is sent to us in either PDF or Quark format.  Since this file is laid out for printing, we need to do some work not only to separate out all the individual stories, images, and ads, but also to ensure that the various parts of each story are correctly identified – such as the headline, the byline, the page number, the name of the section in which it appeared, etc – i.e. the fields in our database.

To do this, we use a piece of custom-built software.  We run the formatted pages through this software, which automatically detects upwards of 80% of this information, placing the appropriate information in the correct field.  The data that the software can’t detect, (and some that needs correcting) is handled by our skilled team of Enhancers (individuals trained to get the right data to the right place, very quickly, accurately, and often in the middle of the night).  We have Enhancers working around the clock, because the news never sleeps!

Many of the larger papers that we enhance here arrive in our Enhancers’ inbox shortly after 2:00 a.m. and are online by 4:00 a.m. – which often means you can read a copy of these papers on FPinfomart before they’re available in print.

There are over 70 million documents in the FPinfomart database, and we add between 30,000 and 40,000 new ones every single day.  And, for each of these documents, you can search in standardized fields – such as headline, byline, source, and section.  It’s an amazing undertaking.  I’ll take a moment to thank our Enhancing team and our technical staff for providing us with some order in this potential chaos.  It is because of the routines they’ve created that we are able to find the data we need, without being lost in information overload!

Are there any other mysteries of FPinfomart that you’d like to see described here?  I’d love to hear your suggestions on what other inner workings you’re wondering about.  Use the comments to let me know what interests you! Consider this your backstage pass.

Posted in content | Tagged: | 3 Comments »

How do publications end up on FPinfomart?

Posted by Jennifer Stein on March 1, 2010

As a content aggregator, we bring together news and information sources from a vast variety of geographic regions, subjects, media, and interests.  While collecting this information is old hat for us, a few recent inquires from customers has reminded me that not everyone understands how content makes it into FPinfomart, and why certain sources may not be available.

With Canada’s largest publishing company as our parent, it’s fairly logical that copyright compliance is a big deal to us.  We believe strongly that anytime money changes hands as a result of content that has been created, the content creator is owed payment for that transaction.  (This is the same reason the music and film industries works so hard to protect their content from copying, downloading, etc.)

For much of the internet, the payment for consuming online content is advertising-driven.  Someone will let you read the news, or watch a video, or use their search engine (hello Google!) if you (by visiting and using the site) consent to having ads served to you at the same time.

FPinfomart is a business-to-business product; a premium subscription-based service with a rich feature set and a professional audience.  Subscription fees apply.  We track precisely which articles are viewed, from which publications, and pay a royalty fee to the content owner each time their content is used on our service.

Because of the care with which we treat original content, we require a licensing agreement with each publication that we carry on our service.  We do not crawl content without the permission of the copyright holder.  Therefore, there are several possible reasons why certain sources are not available through FPinfomart, including (but not limited to):

  • Not available in a format that we can put in our database (applies especially to smaller publications)
  • Publisher not interested in licensing to an aggregator
  • Not enough demand from our customer base to justify the addition (e.g. costs to the publisher to format the content may outweigh the potential royalty revenue)
  • Publisher provides content to another aggregator in an exclusive agreement that prohibits providing it to other aggregators

That being said, we keep an active database of the sources that our customers request.  This source request list is reviewed regularly and our licensing team will pursue relationships with publications that have a high demand, or whose content would enrich the FPinfomart experience.  Please let us know if you have any source requests that you’d like to add to our database.  You can send your requests by e-mail, or leave a comment on this post.

Coming in part 2 of this series – a description of how content is transformed from the format of a printed newspaper to an electronic, searchable database. Watch for a post on the Enhancing process, later this week.

Posted in content | Tagged: , | 1 Comment »

More results from our February monthly poll

Posted by Jennifer Stein on February 25, 2010

Last week we shared the winning entry in our Valentine’s love letter contest.   This week, I thought you might like to see a couple of other results from our monthly poll for February.

Graph - length of FPinfomart usage

Our users are a loyal bunch, with the largest group of respondents using the product for over 10 years.

Graph - Favourite Features

When asked to pick one favourite feature, the answers were varied - but Personal Profiles came out on top, follwed by Alerts.

Thanks to everyone who participated in our February monthly poll.  We announce a new poll in each month’s e-newsletter, so watch your inbox in the first week of the new month.

We take your feedback very seriously – this morning I reviewed my statistics and found that in the past year we have implemented 23% of customer suggestions.  We’re very pleased with that ratio, and hope you are too.

Posted in Monthly Poll | Tagged: | Leave a Comment »

FPinfomart is the Lead Sponsor of the 2010 IABC World Conference in Toronto

Posted by Jennifer Stein on February 24, 2010

IABC logoThe IABC World Conference is the leading international forum of its kind. More than 1,400 business communication professionals—decision makers in leading industries around the world—gather annually to learn the latest developments and best practices in communication and make connections that will advance their communication programs and careers.

This year, the annual World Conference will be held in Toronto from June 6-9, and we’re thrilled to be a part of this exciting event.  Watch for more information here, and on the IABC website, as the date approaches.

For more details, program, and registration information, please see the IABC World Conference website.  We hope we’ll see you at the conference!

Posted in Announcements | Tagged: | Leave a Comment »

Spotlight on FPinfomart training

Posted by Jennifer Stein on February 23, 2010

One of the many hats I wear as Product Manager for FPinfomart is to manage our ongoing customer training program.  By know we certainly hope you are aware of our regular FPinfomart training classes.  We offer hands-on classroom training on a seasonal basis, monthly scheduled Virtual (Webex) classes, and a whole host of ad-hoc and customized training options.

We recently conducted a satisfaction survey of those who had participated in recent training sessions and would like to share some of the results with you here.

  • 100% of attendees would recommend FPinfomart classroom training to a colleague.
  • 100% of attendees of Virtual training rated the ease of following the instructor to be “Very Easy.”
  • 100% of virtual training attendees would take virtual training again.
  • 100% of respondents felt that their session(s) covered all material they expected to learn.
  • 95% of respondents reported that they discovered features of FPinfomart that they were not previously aware of
  • 95% of respondents are satisfied that they now know how to use the parts of FPinfomart covered in their session(s) to their full extent.
  • 100% of respondents feel that the hour(s) they spent in training were a valuable investment of [their] time
  • 90% of respondents believe that ALL FPinfomart users should take training.

As you can see from the results above, even those who think they know all there is about FPinfomart can benefit from a session with one of our outstanding trainers.  As well, we have excellent participant satisfaction ratings for our Virtual option – so if you were hesitating to participate via Webex, please do consider joining us online.

We’re very proud of our training program and consistently receive positive feedback.  We hope to see you in one of our classrooms – either in-person or virtually – very soon.

Our next Virtual Training sessions will be held on March 23 and 24th.  However we also offer customized solutions, both virtually, and at your office (within the Toronto area).

For more details and registration information, please see our training schedule.

Posted in Training | Tagged: , | Leave a Comment »

Viewing trends within your results

Posted by Jennifer Stein on February 22, 2010

Did you know that with a single click of your mouse, you can see a 30-day trendline for your search term(s), right from within your results screen?

Simply click the “Trend” link at the top of your results set to open a graph showing the daily hit count for your search, for the past 30 days.  You’ll get a quick graphical view of volume of coverage for the past month.  This will help you understand whether your issue has been building steam, whether it was a “flash-in-the-pan,” if it has peaked, or if coverage is still on the rise.

Trendline for Olympics

A simple Quick Search for "Olympics" reveals that coverage has been mounting, peaked on the day of the opening ceremonies, and continues steady during the Games.

Bryan Adams trendline

A Quick Search for "Bryan Adams" appears to show that there was some speculation as to his involvement in the Games (slight rise before the peak), with a single peak the day after the Opening Ceremonies (in which he was a participant).

Once you’ve clicked the Trend link, this graph will remain open on all your subsequent searches, until you elect to close it by clicking the Trend link again.

Posted in Tips | Tagged: , , | Leave a Comment »