High-performance tensor contractions for GPUs A Abdelfattah, M Baboulin, V Dobrev, J Dongarra, C Earl, J Falcou, ... Procedia Computer Science 80, 108-118, 2016 | 76 | 2016 |

High-performance matrix-matrix multiplications of very small matrices I Masliah, A Abdelfattah, A Haidar, S Tomov, M Baboulin, J Falcou, ... Euro-Par 2016: Parallel Processing: 22nd International Conference on …, 2016 | 69 | 2016 |

Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices I Masliah, A Abdelfattah, A Haidar, S Tomov, M Baboulin, J Falcou, ... Parallel Computing 81, 1-21, 2019 | 24 | 2019 |

Designing efficient SIMD algorithms for direct connected component labeling A Hennequin, I Masliah, L Lacassagne Proceedings of the 5th Workshop on Programming Models for SIMD/Vector …, 2019 | 13 | 2019 |

Data layout and simd abstraction layers: decoupling interfaces from implementations S Jubertie, I Masliah, J Falcou 2018 International Conference on High Performance Computing & Simulation …, 2018 | 11 | 2018 |

Metaprogramming dense linear algebra solvers applications to multi and many-core architectures I Masliah, M Baboulin, J Falcou 2015 IEEE Trustcom/BigDataSE/ISPA 3, 69-76, 2015 | 10 | 2015 |

A new real-time embedded video denoising algorithm A Petreto, T Romera, F Lemaitre, I Masliah, B Gaillard, M Bouyer, ... 2019 Conference on Design and Architectures for Signal and Image Processing …, 2019 | 9 | 2019 |

Towards a high-performance tensor algebra package for accelerators M Baboulin, V Dobrev, J Dongarra, C Earl, J Falcou, A Haidar, I Karlin, ... Smoky Mountains Computational Sciences and Engineering Conference (SMC 2015), 2015 | 7 | 2015 |

Meta-programming and Multi-stage Programming for GPGPUs I Masliah, M Baboulin, J Falcou 2016 IEEE 10th International Symposium on Embedded Multicore/Many-core …, 2016 | 6 | 2016 |

Small Tensor Operations on Advanced Architectures for High-order Applications A Abdelfattah, M Baboulin, V Dobrev, J Dongarra, A Haidar, I Karlin, ... Technical Report. Technical Report UT-EECS-17-749, 2017 | 4 | 2017 |

Achieving high-performance with a sparse direct solver on Intel KNL E Agullo, A Buttari, M Byckling, A Guermouche, I Masliah Inria Bordeaux Sud-Ouest; CNRS-IRIT; Intel corporation; Université Bordeaux, 2017 | 3 | 2017 |

Débruitage temps réel embarqué pour vidéos fortement bruitées A Petreto, T Romera, F Lemaitre, I Masliah, B Gaillard, M Bouyer, ... COMPAS 2019, 2019 | 1 | 2019 |

Étiquetage et analyse en composantes connexes sur GPUs A Hennequin, L Lacassagne, I Masliah COMPAS, 2019 | 1 | 2019 |

Automatic code generation methods applied to numerical linear algebra in high performance computing I Masliah Université Paris Saclay (COmUE), 2016 | | 2016 |