OpenPlaces travel news

We present a way to build feeds of travel-related news (e.g., local cultural/sport event, traveler blog, hotel strike, etc.) for specific destinations (city, state, country). We've used the Openplaces' travel ontology and traveliness guesser and mixed them up with the DayLife API to create a feed of travel-related news.

OpenPlaces travel news is demoed at: http://www.openplaces.com/news

The Recipe

There are six steps to this recipe:

  1. Choose a place (e.g., Montreal).
  2. Match the search term to a DayLife topic of type place
    • Call search_getMatchingTopics with the search term.
    • Loop through topics and find topic with type equals to place.
    • If DayLife topic and search term does not match, try these two backup plans, leveraging ontological data:
      • Check for alternate spelling ("also known as") of the search term.
      • Check for parent places.
  3. Get articles related to the topic using topic_getRelatedArticles (Use the topic_id found in step 2 as parameter).
  4. Get articles related to the topic from known travel sources.
    • Use source_getArticles for known travel sources (e.g., Travel Channel, Smarter Travel)
    • keep articles with matching topic using article_getRelatedTopics.
  5. Fetch web pages and rank them by "traveliness".
    • Use a web page classifier, i.e., a trained classifier that knows the vocabulary of url, title, content, anchors, etc., and can distinguish between a Web page related to travel and one not related to travel.
  6. Select final news to the travel feed.
    • Promote all articles with very high score in document classification.
    • For all articles with medium score, get related articles using article_getRelatedArticles.
    • If the mean score of related articles is also medium/high then it is safe to promote the article in the feed.
Additional information about step 6

Relying solely on document classification tends to produces accurate but shallow result sets (i.e., high precision, but low recall). Through simple observation, we have noticed that direct classification will ignore some highly relevant results that have a medium traveliness score. Because these "medium scoring" items are lost in a sea of irrelevant news that has similar traveliness scores, lowering the acceptance threshold is not a viable option. The DayLife API helps us by providing a cluster of related news (called "stories"). By analyzing the mean traveliness score of articles related to the medium scoring articles, we are able to further differentiate them from the background noise, thus increasing the overall recall.

Conclusion

We've shown how the DayLife API can be mashed-up with ontological data and web classification technology to create a feed of travel-related news. We present the Traveliness Guesser in more detail in this White Paper.

Enjoy your next trip!

About Openplaces:

Openplaces is a semantic search engine for travel. We are a startup based in Montreal, Canada. Visit www.openplaces.com to read more, and register for our upcoming launch.