In software engineering the “big data” catchphrase refers to in-homogeneous large-scale data that can stem from all software development cycles. Such data can be: source code, software bugs and errors, system logs, commits, issues from backtracking systems, discussion threads from consulting sites (e.g. stackoverflow.com), emails from mailing-lists, as well as developers’ demographic data and characteristics and user requirements and reviews. Software engineering can benefit from the aforementioned data in many ways, but there are several challenges regarding the handling of such data.
Comments about The 16th International Conference Interacción 2015
On sept 22, 2015, the Interaccion 2015 International Conference of HCI came to the Universitat Politecnica of Catalonia on Vilanova i la Geltrú (Barcelona, Spain).
Overview
More than 150 professionals, researchers and students come to that series of Conferences, known to be in the top of the Spanish HCI Conferences with the sponsoring of SCHI, the ACM and the AIPO organizations. These series of Conferences shows the work of 105 publications of 22 differents countries. Continue reading
ICDM 2015 Recap Part II: Sampling
In a previous post I summarized some of the plenary talks from the most recent ICDM held in Atlantic city. In this follow up, I will discuss some of the ideas from sessions.
In the main conference track, there were sessions spanning over many of today’s trending topics in computer science: Big Data, social network mining, clustering, spatio-temporal and multilabel learning, classification, dimensionality reduction, and online and social learning. The approaches and applications varied from session to session and talk to talk, but there was, naturally, an overarching theme of efficiently and effectively working with data.
How to Automatically Scan Multiple Files with Multiple Antiviruses
Recently, I’ve been working on a project where I needed to scan a large number of .apk files for potential malware or malicious intent. Given the fact that antiviruses produce many false positives, it would be better for me to scan the files by using more than one antivirus. During a discussion with a colleague, he mentioned the VirusTotal service. VirusTotal is a free service in which a web user can scan files and URLs to see if they are related to any kind of malicious behavior (viruses, worms, Trojans, etc.). To do so, it uses 55 different antiviruses and 61 scan engines. Using it is pretty straightforward: users upload a file and when the engines finish their analysis the results are displayed. Continue reading
