English  |  正體中文  |  简体中文  |  Items with full text/Total items : 54371/62179 (87%)
Visitors : 8706686      Online Users : 110
RC Version 6.0 © Powered By DSPACE, MIT. Enhanced by NTHU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://nthur.lib.nthu.edu.tw/dspace/handle/987654321/86663


    Title: 改進線性伸縮以用於哼唱選歌
    Authors: 饒彥章
    Jao, Yen-Chang
    Description: GH02101062630
    碩士
    資訊工程學系
    Date: 2014
    Keywords: 音樂檢索;哼唱選歌;線性伸縮;黃金比例搜尋法;序列誤差向量
    music retrieval;query-by-singing/humming;linear scaling;golden section search;sorted error vector
    Abstract: 本論文中,我們提出了一種有效改善哼唱選歌(query by singing/humming, QBSH)的整合架構。其中包含了三種不同的改進方法。第一種方法,是利用黃金比例搜尋法(golden section search)減少傳統線性伸縮(linear scaling)的比對耗時。第二種方法,是針對音高向量(包括使用者的哼唱以及資料庫歌曲)中的休止符加入不同的權重,以減少休止符對距離計算的影響。第三種方法,則是在比對音高向量時,利用序列誤差向量(sorted error vector)的概念,忽略一部分差異過大的距離值,而改使用剩餘的距離值作為比對距離。這是為了減少因使用者哼唱技巧不足或是音高追蹤錯誤,導致的短暫音高偏差所造成的影響。
    我們提出的整合方案,不僅能夠縮短辨識所需的時間(方法一),同時也提升了辨識的正確率(方法二、方法三)。根據我們在MIR-QBSH資料庫與測試語料的實驗中,我們獲得了21.4%的誤差縮減比例(error reduction rate)並減少了49.3%的比對耗時。
    This thesis proposes an improved framework for improving both the efficiency and the effectiveness of a query by singing/humming (QBSH) system. The proposed framework is based on three methods. Method 1 uses golden section search to reduce the computation time in traditional linear scaling (LS) algorithm. Method 2 assigns different weights for rests (in both database songs and in queries) so that these rests now have less effect on computing the weighted distance. Method 3 utilizes a sorted error vector to ignore the LS distances that are overly large and only considers the rest of the LS distances in the computation. This reduces the effect of pitch deviation in a short time span, probably due to the singer being out of tune or errors in pitch track-ing.
    The proposed framework improves the baseline system in both the computation time reduction (via scheme 1) and recognition accuracy (via schemes 2 and 3) of LS-based QBSH. Our experiment shows an error reduction rate of 21.4% in accuracy and 49.3% decrease in computation time on the MIR-QBSH dataset.
    URI: http://nthur.lib.nthu.edu.tw/dspace/handle/987654321/86663
    Source: http://thesis.nthu.edu.tw/cgi-bin/gs/hugsweb.cgi?o=dnthucdr&i=sGH02101062630.id
    Appears in Collections:[資訊工程學系] 博碩士論文

    Files in This Item:

    File SizeFormat
    GH02101062630.pdf78KbAdobe PDF155View/Open


    在NTHUR中所有的資料項目都受到原著作權保護,僅提供學術研究及教育使用,敬請尊重著作權人之權益。若須利用於商業或營利,請先取得著作權人授權。
    若發現本網站收錄之內容有侵害著作權人權益之情事,請權利人通知本網站管理者(smluo@lib.nthu.edu.tw),管理者將立即採取移除該內容等補救措施。

    SFX Query

    與系統管理員聯絡

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - Feedback