Research Engineer

About Me

Hi, I’m Roque!

I am a research engineer at New York University (NYU), I have been involved in various research and development projects throughout my career.

Most of my background and experience is in applied machine learning and natural language processing, and I also have a interest in reinforcement learning.

Research Software

BDI-Kit

BDI-Kit, a Python toolkit specifically designed to streamline biomedical data integration. BDI-Kit provides a suite of methods and intuitive APIs to facilitate the efficient harmonization of diverse datasets. By simplifying the integration process, BDI-Kit enables practitioners and researchers to create unified datasets.

This is part of the BDF project of ARPA-H.

AlphaD3M

AlphaD3M, an AutoML library implemented in Python that automatically synthesizes end-to-end pipelines for different machine learning tasks and different data types. Through an API, it allows the users to explore the input data and the derived pipelines, as well as customized the pipelines.

This tool is part of NYU’s implementation of the Data Driven Discovery project (D3M), DARPA.

Publications

Selected publications. For the complete list, please visit my Google Scholar profile.

AlphaD3M: An Open-Source AutoML Library for Multiple ML Tasks

Roque Lopez, Raoni Lourenco, Remi Rampin, Sonia Castelo, Aécio Santos, Jorge Ono, Claudio Silva, Juliana Freire

AutoML Conference, 2023

PipelineProfiler: A Visual Analytics Tool for the Exploration of AutoML Pipelines

Jorge Ono, Sonia Castelo, Roque Lopez, Enrico Bertini, Juliana Freire, Claudio Silva

IEEE Visualization Conference, 2020

Opinion Summarization Methods: Comparing and Extending Extractive and Abstractive Approaches

Roque Lopez, Thiago Pardo

Expert Systems with Applications Journal, 2017

Education

São Paulo University

MSc in Computer Science

2013 - 2015

Monograph title: Automatic Aspect-Based Opinion Summarization Methods

In this master’s project are presented investigations to generate extractive and abstractive summaries of opinions using an aspect-based approach. Besides using known methods in the area, it also was proposed two new methods for Portuguese language which got the best performance in the experiments.

San Agustin National University

BSc Computer Science

2006 - 2010

Monograph title: Medical Documents Classification based on Keywords using Semantic Information

In this monograph, it is presented a method to classify medical documents which improves the results of Naive Bayes and Rocchio algorithm. This method, in addition to considering statistical information, taking into account the semantic relatedness between the keywords of medical documents.

CV

You can find a detailed version of my complete CV, including additional information about my professional and research experience, by clicking here.