We've launched an exciting new project of our own! Introducing the Kickstarter app for iPhone!

OpenGraph Web Pages deciphered

Update #3 · Apr 15, 2012 · comment

So as trivial as this sounds it's actually important. A recent update now discerns the difference between Word press, Blogger, and other article based webpages via the meta generator tag to type them as "article" typed OpenGraph objects with the associated meta data, rather than website OG objects. We also use HTML5 <article> tags as valid article markers and try to pull articles out of public facing social network sites as well.

It would be nice if the OpenGraph Specification gave us more types to work with but for right now that's what we have. It's pretty accurate for in determining article content from most sites.

We're also creating OG book objects where our logic determines that the website is a book or that a PDF/PS is actually an e-Book.

Again, this may seem trivial, but it's actually very important to weed out the context of web pages. It's something Google doesn't do very well, and that's part of what makes the web hard to digest at this point and time.

It also plays into our voting for documents logic nicely as having different document types categorized this way means that you can vote on articles or articles of a certain type by section, ie. Technology articles.

15
Backers
$269
pledged of $100,000 goal

Funding Canceled Funding for this project was canceled by the project creator on May 24, 2012.

Funding period
Apr 3, 2012 - Jun 2, 2012 (60 days)

Fb_profile_picture.medium

See full bio

  • Pledge $1 or more

    4 backers

    1 year membership to the finished CDNPAL search engine community plus 50 advertising credits for keyword bid advertising.

    Estimated delivery: Oct 2012
  • Pledge $1 or more

    0 backers

    We will send you a private link to a mini-HTML5 game we make using Hype for Mac where you get to shoot the CDNPAL Rabbit in a Duck Hunter style game for points.

    Estimated delivery: Jun 2012
  • Pledge $5 or more

    3 backers

    Have your name included on a thank you page linked from the home page of the website plus all previous tiers.

    Estimated delivery: Oct 2012
  • Pledge $10 or more

    2 backers

    1 year API key to build applications on the CDNPAL Search REST API framework. Whatever can consume REST API can use this, including but not limited to iOS, Android, WP7, Flash and other development tools, plus all previous tiers.

    Estimated delivery: Oct 2012
  • Pledge $25 or more

    3 backers

    Access and a perpetual license to use and modify the full CDNPAL Search source code in any way you want including the REST API framework, the WWW crawling robot, the algorithm based search map/reducer, and the iOS, Android and other front end applications as tarballs and Amazon AMI Instance images plus 1 year of community support via member forum plus all the previous tiers. *NOTE: The search engine stack is based on Apache HBase, Hadoop, DataNucleus, and Spring Framework 3 which provides the REST API. You can download the stack freely.

    Estimated delivery: Oct 2012
  • Pledge $50 or more

    1 backer

    A black high quality CDNPAL ball cap with our cool looking ice blue wifi logo on the front area of the cap plus all previous tiers.

    Estimated delivery: Oct 2012
  • Pledge $100 or more

    0 backers

    Lifetime API key to build your GUI applications on top of the CDNPAL REST API services plus all previous tiers.

    Estimated delivery: Oct 2012
  • Pledge $250 or more

    0 backers

    A color poster of the class and module hierarchy overview of the CDNPAL search engine from it's big column data store to it's algorithm crunching WWW data factoring, to it's REST API delivery to GUI clients and all previous tiers

    Estimated delivery: Dec 2012
  • Pledge $500 or more

    0 backers

    A special CDNPAL artwork picture book containing the story of the search engine, with colorful pictures and graphics including scenic pictures of venues in Los Angeles, where parts of the search engine were made.

    Estimated delivery: Mar 2013
  • Pledge $1,000 or more

    0 backers

    Your advertising link on top of every search matching up to 5000 search terms no matter what on the desktop version of search for an entire year in rotation with other funders at this level plus all previous tiers.

    Estimated delivery: Oct 2012
  • Pledge $2,000 or more

    0 backers

    Your advertising link on top of every search matching up to 5000 search terms no matter what on the mobile(iOS & Android app and web) version of search for an entire year in rotation with other funders at this level plus all previous tiers.

    Estimated delivery: Dec 2012
  • Pledge $7,000 or more

    0 backers Limited (25 of 25 left)

    Dinner at the CheeseCake Factory in Los Angeles with the founders Shaun and Christopher where you will be presented a certificate for helping to fund CDNPAL search plus all previous tiers. * You must show up to claim this prize.

    Estimated delivery: Oct 2012
  • Pledge $10,000 or more

    0 backers

    Your name or company name permanently etched across the footer of the website as a producer, and you get your own private REST API function of your choosing to factor the WWW data we have plus all previous tiers.

    Estimated delivery: Nov 2012