skip to main content

W-operator learning using linear models for both gray-level and binary inputs

Montagner, Igor Dos Santos

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística 2017-06-12

Acesso online. A biblioteca também possui exemplares impressos.

  • Título:
    W-operator learning using linear models for both gray-level and binary inputs
  • Autor: Montagner, Igor Dos Santos
  • Orientador: Hirata Junior, Roberto; Hirata, Nina Sumiko Tomita
  • Assuntos: Aprendizado De Máquina; Máquinas De Suporte Vetorial; Processamento De Imagens; Projeto Automático De W-Operadores; Image Processing; Linear Classification Methods; Machine Learning; Support Vector Machines; W-Operator Learning
  • Notas: Tese (Doutorado)
  • Descrição: Image Processing techniques can be used to solve a broad range of problems, such as medical imaging, document processing and object segmentation. Image operators are usually built by combining basic image operators and tuning their parameters. This requires both experience in Image Processing and trial-and-error to get the best combination of parameters. An alternative approach to design image operators is to estimate them from pairs of training images containing examples of the expected input and their processed versions. By restricting the learned operators to those that are translation invariant and locally defined ($W$-operators) we can apply Machine Learning techniques to estimate image transformations. The shape that defines which neighbors are used is called a window. $W$-operators trained with large windows usually overfit due to the lack sufficient of training data. This issue is even more present when training operators with gray-level inputs. Although approaches such as the two-level design, which combines multiple operators trained on smaller windows, partly mitigates these problems, they also require more complicated parameter determination to achieve good results. In this work we present techniques that increase the window sizes we can use and decrease the number of manually defined parameters in $W$-operator learning. The first one, KA, is based on Support Vector Machines and employs kernel approximations to estimate image transformations. We also present adequate kernels for processing binary and gray-level images. The second technique, NILC, automatically finds small subsets of operators that can be successfully combined using the two-level approach. Both methods achieve competitive results with methods from the literature in two different application domains. The first one is a binary document processing problem common in Optical Music Recognition, while the second is a segmentation problem in gray-level images. The same techniques were applied without modification in both domains.
  • DOI: 10.11606/T.45.2017.tde-21082017-111455
  • Editor: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística
  • Data de criação/publicação: 2017-06-12
  • Formato: Adobe PDF
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.