Kontakt:
- email:
- szyzapor@pg.edu.pl
Zajmowane stanowiska:
Asystent
- miejsce pracy:
- Katedra Systemów Multimedialnych
Budynek A Wydziału Elektroniki, Telekomunikacji i Informatyki, EA 726
Specjalista informatyk
- miejsce pracy:
- Katedra Systemów Multimedialnych
Budynek A Wydziału Elektroniki, Telekomunikacji i Informatyki, EA 726
- telefon:
- 58-348-63-32

Publikacje:
-
Publikacja
- D. Kobiela
- M. Hajdasz
- M. Erezman
- K. Nurzyńska
- S. Zaporowski
- A. Kurowski
- P. Weichbroth
- Rok 2025
Identifying different vehicle types can help manage traffic more efficiently, reduce congestion, and improve public safety. This study aims to create a classification model that can recognize vehicle types based on the sound of passing vehicles. To achieve this, a database of raw audio files containing 1763 samples from two sources was assembled. The time-domain signals were converted to a time-frequency representation using the...
Pełny tekst do pobrania w portalu
-
Publikacja
- Rok 2024
This article presents a case study on the development of a biometric voice verification system for an intercom solution, utilizing the DeepSpeaker neural network architecture. Despite the variety of solutions available in the literature, there is a noted lack of evaluations for "text-independent" systems under real conditions and with varying distances between the speaker and the microphone. This article aims to bridge this gap....
Pełny tekst do pobrania w portalu
-
Publikacja
- Rok 2024
The article presents preliminary experiments investigating the impact of accent on the performance of the Whisper automatic speech recognition (ASR) system, specifically for the Polish language and medical data. The literature review revealed a scarcity of studies on the influence of accents on speech recognition systems in Polish, especially concerning medical terminology. The experiments involved voice cloning of selected individuals...
Pełny tekst do pobrania w portalu
-
Publikacja
Emotion recognition is a crucial aspect of human communication, with applications in fields such as psychology, education, and healthcare. Identifying emotions accurately is challenging, as people use a variety of signals to express and perceive emotions. In this study, we address the problem of multimodal emotion recognition using both audio and video signals, to develop a robust and reliable system that can recognize emotions...
Pełny tekst do pobrania w portalu
-
Publikacja
- Electronics - Rok 2023
The vulnerability of the speaker identity verification system to attacks using voice cloning was examined. The research project assumed creating a model for verifying the speaker’s identity based on voice biometrics and then testing its resistance to potential attacks using voice cloning. The Deep Speaker Neural Speaker Embedding System was trained, and the Real-Time Voice Cloning system was employed based on the SV2TTS, Tacotron,...
Pełny tekst do pobrania w portalu
Projekty:
-
Projekty
Kierownik projektu: prof. dr hab. inż. Andrzej Czyżewski Program finansujący: INFOSTRATEG
Projekt realizowany w Katedra Systemów Multimedialnych zgodnie z porozumieniem INFOSTRATEG4/0003/2022 z dnia 2023-05-04
-
Projekty
Kierownik projektu: dr hab. inż. Piotr Szczuko Program finansujący: Program Operacyjny Inteligentny Rozwój
Projekt realizowany w Katedra Systemów Multimedialnych zgodnie z porozumieniem POIR.04.01.04-00-0075/19 z dnia 2019-09-24