Publications

On optimal regression trees to detect critical intervals for multivariate functional data

Rafael Blanquero, Emilio Carrizosa, Cristina Molero-Río, Dolores Romero Morales
(2023), Computers & Operations Research, DOI: 10.1016/j.cor.2023.106152; Published online: 2023-01-13.

Keywords: Optimal randomized regression trees, Multivariate functional data, Critical intervals detection, Nonlinear programming

 

 


A bounded measure for estimating the benefit of visualization (Part I): theoretical discourse and conceptual evaluation

M. Chen and M. Sbert
(2022), Entropy, DOI: 10.3390/e24020228; Published online: 2022-01-31.

Keywords: information theory, theory of visualization, cost–benefit analysis, divergence measure, benefit of visualization, human knowledge in visualization, abstraction, deformation, volume visualization, metro map

 

 


Design space of origin-destination data visualization

M. Tennekes and M. Chen
(2021), Computer Graphics Forum, DOI: 10.1111/cgf.14310; Published online: 2021-06-29.

Keywords: 

Open-source routines can be found here

 

 


Predicting student performance using sequence classification with time-based windows

Galina Deeva, Johannes De Smedt, Cecilia Saint-Pierre, Richard Weber, Jochen De Weerdt
(2022), Expert Systems with Applications, DOI: 10.1016/j.eswa.2022.118182; Published online: 2022-12-15.

Keywords: Machine learning, Sequence mining, Feature engineering, Success prediction, Behavioral patterns

 

 


On mathematical optimization for clustering categories in contingency tables

Emilio Carrizosa, Vanesa Guerrero, Dolores Romero Morales
(2022), Advances in Data Analysis and Classification, DOI: 10.1007/s11634-022-00508-4; Published online: 2022-06-28.

Keywords: Contingency tables, Mathematical optimization, Relational constraints, Clustering

 

 


The tree based linear regression model for hierarchical categorical variables

Emilio Carrizosa, Laust Hvas Mortensen, Dolores Romero Morales, M. Remedios Sillero-Denamiel
(2022), Expert Systems with Applications, DOI: 10.1016/j.eswa.2022.117423; Published online: 2022-10-01.

Keywords: Hierarchical categorical variables, Linear regression models, Accuracy vs. model complexity, Mixed integer convex quadratic problem with linear constraints

 

 


On sparse optimal regression trees

Rafael BlanqueroEmilio Carrizosa, Cristina Molero-Río, Dolores Romero Morales.
(2021), European Journal of Operational Research, DOI: 10.1016/j.ejor.2021.12.022; Published online: 2021-12-18.

Keywords: Machine Learning, Classification and regression trees, Optimal regression trees, Sparsity, Nonlinear Programming.

 

Relevant open-source routines can be found here

 


Interpreting clusters via prototype optimization

Emilio CarrizosaKseniia Kurishchenko, Alfredo Marín, Dolores Romero Morales.
(2021), Omega, DOI: 10.1016/j.omega.2021.102543; Published online: 2021-09-23.

Keywords: Machine Learning, Interpretability, Cluster Analysis, Prototypes, Mixed-Integer Programming.

 

 


Constrained Naïve Bayes with application to unbalanced data classification

Rafael BlanqueroEmilio Carrizosa, Pepa Ramírez Cobo, M Remedios Sillero-Denamiel.
(2021), Central European Journal of Operations Research, DOI: 10.1007/s10100-021-00782-1; Published online: 2021-10-20.

Keywords: Probabilistic Classification, Constrained optimization, Parameter estimation, Efficiency measures, Naïve Bayes.

 

 


Variable selection for Naïve Bayes classification

Rafael BlanqueroEmilio Carrizosa, Pepa Ramírez Cobo, M Remedios Sillero-Denamiel.
(2021), Computers & Operations Research, DOI: 10.1016/j.cor.2021.105456; Published online: 2021-06-02.

Keywords: Clustering, Conditional Independence, Dependence measures, Heuristics, Probabilistic Classification, Cost-sensitive Classification. 

 

 


On Clustering Categories of Categorical Predictors in Generalized Linear Models

Emilio CarrizosaMarcela Galvis Restrepo, Dolores Romero Morales.(2021), Expert Systems with Applications, DOI: 10.1016/j.eswa.2021.115245; Published online: 2021-05-24.

Keywords: Statistical Learning, Interpretability, Greedy Randomized Adaptive Search Procedure, Proximity between categories. 

 

 


On sparse ensemble methods: An application to short-term predictions of the evolution of COVID-19

Sandra Benítez-PeñaEmilio Carrizosa, Vanesa Guerrero.M Dolores Jiménez-Gamero, Belén Martín-Barragán, Cristina Molero-Río, Pepa Ramírez-Cobo, Dolores Romero Morales, M Remedios Sillero-Denamiel. (2021), European Journal of Operational Research, DOI: 10.1016/j.ejor.2021.04.016; Published online: 2021-04-18.

Keywords: Machine Learning, Ensemble Method, Mathematical Optimization, Selective Sparsity, COVID-19.

 

 


Mathematical optimization in classification and regression trees

Emilio CarrizosaCristina Molero-Río, Dolores Romero Morales.(2021), TOP, DOI: 10.1007/s11750-021-00594-1; Published online: 2021-03-17.

Keywords: Classification and regression trees, Tree ensembles, Mixed-integer linear optimization, Continuous nonlinear optimization, Sparsity, Explainability. 

 

 


Optimal randomized classification trees

Rafael Blanquero, Emilio CarrizosaCristina Molero-Río, Dolores Romero Morales.(2021), Computers & Operations Research, DOI: 10.1016/j.cor.2021.105281; Published online: 2021-03-08.

Keywords: Classification and regression trees, Cost-sensitive classification, Nonlinear programming. 

 

 


A cost-sensitive constrained Lasso

Rafael Blanquero, Emilio CarrizosaPepa Ramírez-Cobo, M. Remedios Sillero-Denamiel.(2020), Advances in Data Analysis and Classification, DOI: 10.1007/s11634-020-00389-5; Published online: 2020-03-12.

Keywords: Performance constraints, Cost-sensitive learning, Sparse solutions, Sample average approximation, Heterogeneity, Lasso.

 

 


Expert-driven trace clustering with instance-level constraints

Pieter De Koninck, Klaas NelissenSeppe vanden Broucke, Bart Baesens, Monique Snoeck, Jochen De Weerdt(2020), Knowledge and Information Systems, DOI: 10.1007/s10115-021-01548-6; Published online: 2020-03-01.

Keywords: Trace clustering, Process mining, Semi-supervised learning, Constrained clustering.

 

 

Relevant open-source software can be found here and here


Sparsity in Optimal Randomized Classification Trees

Rafael Blanquero, Emilio Carrizosa Cristina Molero-Río, Dolores Romero Morales(2019), European Journal of Operational Research, Elsevier Ltd., DOI: 10.1016/j.ejor.2019.12.002; Published online: 2019-12-16

Keywords: Data mining, Optimal Classification Trees, Global and Local Sparsity, Nonlinear Programming.

 

 


Enhancing Interpretability in Factor Analysis by Means of Mathematical Optimization

, ,  & (2019), Multivariate Behavioral Research, DOI: 10.1080/00273171.2019.1677208; Published online: 2019-10-30

Keywords: Exploratory factor analysisinterpretabilityfactor rotationexplanatory variablesmathematical optimization

 


Feature Selection in Data Envelopment Analysis: A Mathematical Optimization approach