mgr inż. Szymon Zaporowski | Politechnika Gdańska

Treść strony

mgr inż. Szymon Zaporowski

Kontakt:

email:
szyzapor@pg.edu.pl
strona:
https://mostwiedzy.pl/szymon-zaporowski,768047-1

Zajmowane stanowiska:

Asystent

miejsce pracy:
Katedra Systemów Multimedialnych
Budynek A Wydziału Elektroniki, Telekomunikacji i Informatyki, EA 726

Specjalista informatyk

miejsce pracy:
Katedra Systemów Multimedialnych
Budynek A Wydziału Elektroniki, Telekomunikacji i Informatyki, EA 726
telefon:
58-348-63-32
mgr inż. Szymon Zaporowski

Publikacje:

  1. Identifying different vehicle types can help manage traffic more efficiently, reduce congestion, and improve public safety. This study aims to create a classification model that can recognize vehicle types based on the sound of passing vehicles. To achieve this, a database of raw audio files containing 1763 samples from two sources was assembled. The time-domain signals were converted to a time-frequency representation using the...

    Pełny tekst do pobrania w portalu

  2. Publikacja

    This article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....

    Pełny tekst do pobrania w portalu

  3. Publikacja

    - Rok 2024

    The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...

    Pełny tekst do pobrania w portalu

  4. Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...

    Pełny tekst do pobrania w portalu

  5. The vulnerability of the speaker identity verification system to attacks using voice cloning was examined. The research project assumed creating a model for verifying the speaker’s identity based on voice biometrics and then testing its resistance to potential attacks using voice cloning. The Deep Speaker Neural Speaker Embedding System was trained, and the Real-Time Voice Cloning system was employed based on the SV2TTS, Tacotron,...

    Pełny tekst do pobrania w portalu

dane pochodzą z portalu MOST Wiedzy otwiera się w nowej karcie MOST Wiedzy

Projekty: