51°45’35.5” N, 1°15’30.9” W
51°45’35.5” N, 1°15’30.9” W
Wrapidity’s mission is to open up humanity’s greatest resource—the web—to everyone.
For decades, we were promised that the web would become humanity’s greatest database. And that all that data would be available as XML, RDF, or Web 2.0 APIs. Yet, what remains is a vast ocean of dark data, hidden behind the surface in the silos of the deep web. To unearth the potential insights hidden in this accumulated information, the hidden data (estimated to be 80% of all Web data) has to be painstakingly extracted and refined to become amenable to further analysis.
Businesses and data scientists in many verticals have recognized the tremendous value of such insights, but also the punitive cost for their collection. Data scientists spend around 80% of their time on data collection and preparation, and they really hate doing so.
With Wrapidity’s technology every business, every policy maker, and every scientist will be able to afford including relevant web data into their decision models, ultimately leading to better, more fact-driven decisions.
Knowledge + redundancy + specialised AI.
$5M dollar top-tier European research grant, leading to dozens of publications in top international AI, database, and web technologies conferences.
You think you have an idea how to better match people and jobs? Or how to answer “where are Italian restaurants along this route”? Or how to find out where a competitor is currently building up new operations?
But where do you get the necessary data on current job offers, restaurant menus, or other product offers? That’s where Wrapidity comes in: We quickly provide a highly structured database of all offers or goods you are interested in, whether they come from a few websites or hundreds of them—a database you can use to build better search, better recommenders, or better analytics.
This doesn't just help you build your application faster, it also makes applications possible that previously were out of reach even for the internet giants:
This is the era of specialized AI—AI judiciously tailored to specific problems such as image recognition, machine translation, or, with Wrapidity, automatic data extraction. Extracting structured data from the web has been one of the long standing challenges in search and knowledge acquisition that has withstood repeated attempts at solving it in a generic fashion. With Wrapidity we have developed an object extraction system that exploits extensive metadata about the relevant objects (in form of both a schema and sample instances). With this approach we outperform existing semi-supervised and unsupervised approaches by a wide margin (> 95% accuracy on a wide range of domains and sites).
How websites work and what data is relevant for your problem?
Most of this knowledge is generic, but some of it is task- or vertical-specific and thus needs to be acquired for each task or vertical—e.g., that location is key in real estate. While this acquisition often requires some human supervision, it is only needed once for an entire vertical.
Humans think in patterns and thus most websites follow a common set of conventions for presenting data, e.g., most shops will have a prominent price information.
Wrapidity exploits redundancy at many levels, whether in the presentation of the data, the actual instances in the same source, or instances shared between sources.
Wrapidity has developed a specialized AI for the autonomous exploration and classification of web sites and their constituent objects. This AI is able to adapt itself to each website by automatically composing atomic exploration actions and relies on a specialized entity recognition that considers page context rather than textual context.
In Wrapidity we combine passion for solving hard problems for real customers with a drive for pushing the envelope in technology and science. The technical members of the team all have been previously involved in startups and build solutions for customers ranging from one-people companies to multi-national media conglomerates. They have jointly published more than 500 DBLP-listed publications with over 20k cumulative citations.
Want to get a quick overview of what Wrapidity is about? Then this is the right place for you. Read more.
Quick overview of Wrapidity’s vision and technology. Look out for the case studies, including millions of restaurant locations extracted with 90%+ accuracy. Read more.
Wrapidity’s extraction language OXPath, originally published at VLDB 2011 and selected for inclusion into this best paper issue of the VLDB Journal. Read more.
Extended slide set that also illustrates the underlying components and technology. Read more.
67-71 Shoreditch High St
London E1 6JJ