About Me
Hi, I’m Roque!
I am a research engineer at New York University (NYU), I have been involved in various research and development projects throughout my career.
Most of my background and experience is in applied machine learning and natural language processing, and I also have a interest in reinforcement learning.
Research Software
BDI-Kit, a Python toolkit specifically designed to streamline biomedical data integration. BDI-Kit provides a suite of methods and intuitive APIs to facilitate the efficient harmonization of diverse datasets. By simplifying the integration process, BDI-Kit enables practitioners and researchers to create unified datasets.
This is part of the BDF project of ARPA-H.
AlphaD3M, an AutoML library implemented in Python that automatically synthesizes end-to-end pipelines for different machine learning tasks and different data types. Through an API, it allows the users to explore the input data and the derived pipelines, as well as customized the pipelines.
This tool is part of NYU’s implementation of the Data Driven Discovery project (D3M), DARPA.
Publications
Selected publications. For the complete list, please visit my Google Scholar profile.
Roque Lopez, Raoni Lourenco, Remi Rampin, Sonia Castelo, Aécio Santos, Jorge Ono, Claudio Silva, Juliana Freire
AutoML Conference, 2023
Jorge Ono, Sonia Castelo, Roque Lopez, Enrico Bertini, Juliana Freire, Claudio Silva
IEEE Visualization Conference, 2020
Education
São Paulo University
MSc in Computer Science
2013 - 2015
Monograph title: Automatic Aspect-Based Opinion Summarization Methods
In this master’s project are presented investigations to generate extractive and abstractive summaries of opinions using an aspect-based approach. Besides using known methods in the area, it also was proposed two new methods for Portuguese language which got the best performance in the experiments.
San Agustin National University
BSc Computer Science
2006 - 2010
Monograph title: Medical Documents Classification based on Keywords using Semantic Information
In this monograph, it is presented a method to classify medical documents which improves the results of Naive Bayes and Rocchio algorithm. This method, in addition to considering statistical information, taking into account the semantic relatedness between the keywords of medical documents.
CV
You can find a detailed version of my complete CV, including additional information about my professional and research experience, by clicking here.