Idioma:

W-operator learning using linear models for both gray-level and binary inputs

Montagner, Igor Dos Santos

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística 2017-06-12

Acesso online. A biblioteca também possui exemplares impressos.

Enviar para

Título:
W-operator learning using linear models for both gray-level and binary inputs
Autor: Montagner, Igor Dos Santos
Orientador: Hirata Junior, Roberto; Hirata, Nina Sumiko Tomita
Assuntos: Aprendizado De Máquina; Máquinas De Suporte Vetorial; Processamento De Imagens; Projeto Automático De W-Operadores; Image Processing; Linear Classification Methods; Machine Learning; Support Vector Machines; W-Operator Learning
Notas: Tese (Doutorado)
Descrição: Image Processing techniques can be used to solve a broad range of problems, such as medical imaging, document processing and object segmentation. Image operators are usually built by combining basic image operators and tuning their parameters. This requires both experience in Image Processing and trial-and-error to get the best combination of parameters. An alternative approach to design image operators is to estimate them from pairs of training images containing examples of the expected input and their processed versions. By restricting the learned operators to those that are translation invariant and locally defined ($W$-operators) we can apply Machine Learning techniques to estimate image transformations. The shape that defines which neighbors are used is called a window. $W$-operators trained with large windows usually overfit due to the lack sufficient of training data. This issue is even more present when training operators with gray-level inputs. Although approaches such as the two-level design, which combines multiple operators trained on smaller windows, partly mitigates these problems, they also require more complicated parameter determination to achieve good results. In this work we present techniques that increase the window sizes we can use and decrease the number of manually defined parameters in $W$-operator learning. The first one, KA, is based on Support Vector Machines and employs kernel approximations to estimate image transformations. We also present adequate kernels for processing binary and gray-level images. The second technique, NILC, automatically finds small subsets of operators that can be successfully combined using the two-level approach. Both methods achieve competitive results with methods from the literature in two different application domains. The first one is a binary document processing problem common in Optical Music Recognition, while the second is a segmentation problem in gray-level images. The same techniques were applied without modification in both domains.
DOI: 10.11606/T.45.2017.tde-21082017-111455
Editor: Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística
Data de criação/publicação: 2017-06-12
Formato: Adobe PDF
Idioma: Inglês

Links

Voltar para lista de resultados

Realização: Logos de Redes Sociais:

W-operator learning using linear models for both gray-level and binary inputs

Montagner, Igor Dos Santos

Biblioteca Digital de Teses e Dissertações da USP; Universidade de São Paulo; Instituto de Matemática e Estatística 2017-06-12

Buscando em bases de dados remotas. Favor aguardar.