July 5, 2016

Congratulations to Erico Neves De Souza, Kristina Boerder, Stan Matwin and Boris Worm for their recent publication in PLOS ONE.  Their paper can be found online at: 

The abstract is below:

A key challenge in contemporary ecology and conservation is the accurate tracking of the spatial distribution of various human impacts, such as fishing. While coastal fisheries in national waters are closely monitored in some countries, existing maps of fishing effort elsewhere are fraught with uncertainty, especially in remote areas and the High Seas. Better understanding of the behavior of the global fishing fleets is required in order to prioritize and enforce fisheries management and conservation measures worldwide. Satellite-based Automatic Information Systems (S-AIS) are now commonly installed on most ocean-going vessels and have been proposed as a novel tool to explore the movements of fishing fleets in near real time. Here we present approaches to identify fishing activity from S-AIS data for three dominant fishing gear types: trawl, longline and purse seine. Using a large dataset containing worldwide fishing vessel tracks from 2011–2015, we developed three methods to detect and map fishing activities: for trawlers we produced a Hidden Markov Model (HMM) using vessel speed as observation variable. For longliners we have designed a Data Mining (DM) approach using an algorithm inspired from studies on animal movement. For purse seiners a multi-layered filtering strategy based on vessel speed and operation time was implemented. Validation against expert-labeled datasets showed average detection accuracies of 83% for trawler and longliner, and 97% for purse seiner. Our study represents the first comprehensive approach to detect and identify potential fishing behavior for three major gear types operating on a global scale. We hope that this work will enable new efforts to assess the spatial and temporal distribution of global fishing effort and make global fisheries activities transparent to ocean scientists, managers and the public.

April 27, 2016

On Monday April 25 Dr. Marina Sokolova took part in a panel organized by the Interdisciplinary Research Group in Organizational Communication at the University of Ottawa.    Other invited speakers were Lewis Eisen from the Treasury Board of Canada and Karim Bechane from the Canadian Food Inspection Agency.  The topic was"The Challenge of Managing Information in Organizations: From Big Data to Thick Data".

March 1, 2016

Problems should be solved. Pipe dreams should be pursued. Pitch us your data project and we could put a team led by one of Canada’s top data scientists to work for your small or medium-sized business for six months — for free.

For more details go to the competition page.

December 7, 2015

A team of 3 students from the Institute for Big Data Analytics have walked away with the prize of the "Most Innovative Use of Big Data", and a cheque for $800 from the Sports Hack 2015 competition, held Nov 27-29th in Vancouver, Toronto and Halifax and sponsored by a range of big name tech companies, universities and the CFL.  Forty-one teams competed in the challenge to produce an app to encourage fan engagement using datasets from the CFL.  Our team created an app that tracked the Tweets of CFL fans providing them with rankings of their activity levels and a points system to reward their activity and their participation in mini-games about the football matches, making use of both machine learning techniques and sentiment analysis.  The app could also provide CFL organizations with information about their fans activities and locations and provide the opportunity to advertize to fans, to convert fan points into merchandise of the organizations or their sponsors.  It would be fan-centric rather than team-centric .  The contest was a big challenge, given the hard work, the lack of sleep, and the tight competition.









Gurcan Gercek, Hossein Sarshar and Behrouz Haji Soleimani

November 16, 2015

Congratulation to Masters student Hossein Sarshar and Research Assistant Pedram Adibi from the Institute for Big Data Analytics, and their colleague, Dalhousie alumnus Mehran Zamani, who, as Team Sol-Ops, won first prize in the Smart Energy Apps Challenge sponsored by Innovacorp, HRM, NSCC, Shiftkey Labs and Dalhousie University.  The challenge was to create an app which made use of the data collected by Halifax Solar City on a number of parameters relating to installed solar hot water systems.  Team Sol-Ops created an app which made use of techniques of computer science, data science and engineering to provide customers with an analysis of their usage of hot water, and recommendations for changing that behaviour to maximize the environmental and economic benefits they could receive from their systems.  It took 6 weeks to create the winning app, beating 10 other teams and claiming the $6000 first prize.  Success in the competition may also be the stepping stone to other opportunities as the team finds themselves in conversations with investors and other organizations who are taking in interest in the commercial potential of their creation.

July 14, 2015















Rob Warren gave a talk on the Panel on Linked Open data at the Digital Humanities Conference, Sydney Australia, June 29 - July 3. 

His paper explored the notion of place, feature and geometry in the context of the Great War using Linked Open Data. In previous works, the translation of obsolete military coordinates through API’s (Application Program Interface) was previously covered. He reviewed their use as an efficient and effective means of indexing archival documents about the war. Most war diaries, operations orders and dispatches in British and Dominions records refer to locations using both named features and coordinates. This permits the geo-referencing of each statements within a document to find the current location in question while segmenting the document according to different spatial component.

March 17, 2015

Data-Driven Augmented Reality for Museum Exhibits and Lost Heritage Sites.
Museums on the Web 2015 (
Palmer House Hilton, Chicago, IL, USA
April 8-11, 2015

We review the possibilities, pitfalls, and promises of recreating lost heritage sites and historical events using augmented reality and "Big Data" archival databases. We define augmented reality as any means of adding context or content, via audio/visual means, to the current  physical space of a visitor to a museum or outdoor site. Examples range from simple prerecorded audio to graphics rendered in real time and displayed using a smartphone.

Previous work has focused on complex multimedia museum guides, whose utility remains to be evaluated as enabling or distracting. We propose the use of a data­-driven approach where the exhibits' augmentation is not static but dynamically generated from the totality of the data known about the location, artifacts, or event. For example, at Bletchley Park, reenacted audio conversations are played within rooms as visitors walk through them. These can be called "virtual contents," as the audio recordings are manufactured. Given that a number of documentary sources, such as meeting minutes, are available concerning the events that occurred within the site, a dynamic computer-generated script could add to the exhibits.

Visitors' experiences can therefore react to their movements, provide a different experience each time, and be factually correct without requiring any expensive redesign. Furthermore, the use of a data-driven approach allows for the updating of exhibits on the fly as researchers create or curate new data sources within the museum. If artifacts need to be removed from an exhibit, pictures, descriptions, or three-dimensional printed copies can be substituted, and the augmented reality of visitor experience can adapt accordingly.

March 11, 2015

Over 1,000 of the world's leading edge researchers and practitioners in big data are coming to Halifax for the 2017 Conference on Knowledge Discovery and Data Mining.

Stan Matwin, Canada Research Chair (Tier 1) at the Faculty of Computer Science, Dalhousie University and the director of the Institute for Big Data Analytics, announced today, March 9, that Halifax was the successful bidder. The conference, with Dr. Matwin as the general chair and Evangelos Milios of Dalhousie University as the local chair, will be held in the new Halifax Convention Centre, which opens in 2017.

The bid was led by the Institute for Big Data Analytics and the Halifax Convention Centre, in collaboration with a local host committee of academic, government and industry representatives.

"This announcement is further evidence that the Institute for Big Data Analytics has established Dalhousie and Halifax as leaders in the global fields of data science and big data analytics," said Dalhousie president Richard Florizone.

"This is a great opportunity for Dalhousie -- as well as other local organizations and institutions -- to showcase the world-class research, ongoing collaboration and pool of talent we have here in the region to national and international audiences," said Dr. Matwin

Halifax now ranks amongst top cities who have previously hosted the conference, including:

-- Sydney, Australia

-- New York City, New York

-- Beijing, China

-- Paris, France

"We're proud to partner and collaborate with our local experts to host this conference and showcase Nova Scotia's strengths in big data to the world," said Scott Ferguson, president and CEO of Trade Centre Limited, the Crown Corporation that manages the convention centre. "This is an exciting opportunity for our industry, academic and research communities to highlight their work and connect with their global counterparts."

Nova Scotia has a booming information technology sector and the province is quickly establishing itself as an international hub of excellence in big data research. Hosting this conference will allow the local sector to benefit from top academic research and industrial presence aimed at promoting collaboration and growth in big data analytics.

First established in 1995, the Knowledge Discovery and Data Mining conference is the premier international forum for data mining and big data research, bringing together practitioners from academia, industry, and government to share their ideas, research results and experiences. Sponsors of note include Microsoft, Yahoo, Deloitte, Accenture, Facebook, LinkedIn, Google and IBM. The 2016 conference will take place in San Francisco.

March 4, 2015

The 18th International Conference on Discovery Science (DS 2015) will be held in Banff (Canada), on 4-6 October 2015, and provides an open forum for intensive discussions and exchange of new ideas among researchers working in the area of Discovery Science. The scope of the conference includes the development and analysis of methods for discovering scientific knowledge, coming from machine learning, data mining, and intelligent data analysis, as well as their application in various scientific domains.  We welcome papers that focus on the analysis of different types of complex data, such as structured, spatio-temporal and network data. We particularly welcome papers addressing applications. Finally, we would like to encourage contributions from the areas of computational scientific discovery, mining scientific data, computational creativity and discovery informatics.
For more information:

January 7, 2015
The 28th Canadian Conference on Artificial Intelligence, invites graduate students to submit summary (abstract) papers of their thesis research for possible inclusion in the AI 2015 Graduate Student Symposium (GSS-2015) and the AI 2015 proceedings published by Springer Verlag in the LNAI series. The Symposium provides an opportunity for Master’s and PhD students to discuss and explore their research interests and career objectives with their peers and with a panel of established researchers in Artificial Intelligence, helping to develop a supportive community of scholars and a spirit of collaborative research.
The symposium will be a pre- AI/GI/CRV-2015 conference event, on June 2 from 12:00-6:00pm, where students of accepted abstracts will be invited to give a presentation on their thesis work before a group of peers as well as a small team of recognized AI researchers who will offer a critique of each presentation and provide support, advice, and mentoring. The top 20 submissions will also be invited to participate a poster session on the evening of June 2 during the AI 2015 main conference reception. This will be a great opportunity to present and discuss your work with others.
Graduate students are invited to submit a 4-page summary of their on-going thesis work from all areas of Artificial Intelligence. All submissions must be written in English. The paper should clearly state the research problem, the proposed solution and approach and the description of the progress to date, including significant results. Program committee members will review each submission. Presenting students will be selected based on clarity of the submission, difficulty of the problem, novelty of the solution, quality of the research, and evidence of promise such as published papers or technical reports.
Partial financial assistance for travel and accommodations will be available to students presenting at the Symposium, as funding allows.
For more details, the complete Call fro Papers, submission instructions, and an FAQ list, please see the GSS-2015 website at
Marina Sokolova <>
Co-chairs, 2015 Canadian AI Graduate Student Symposium