日本欧洲视频一区_国模极品一区二区三区_国产熟女一区二区三区五月婷_亚洲AV成人精品日韩一区18p

代寫(xiě)COM6511、代做Python設(shè)計(jì)程序

時(shí)間:2024-04-30  來(lái)源:  作者: 我要糾錯(cuò)



COM4511/COM6511 Speech Technology - Practical Exercise -
Keyword Search
Anton Ragni
Note that for any module assignment full marks will only be obtained for outstanding performance that
goes well beyond the questions asked. The marks allocated for each assignment are 20%. The marks will be
assigned according to the following general criteria. For every assignment handed in:
1. Fulfilling the basic requirements (5%)
Full marks will be given to fulfilling the work as described, in source code and results given.
2. Submitting high quality documentation (5%)
Full marks will be given to a write-up that is at the highest standard of technical writing and illustration.
3. Showing good reasoning (5%) Full marks will be given if the experiments and the outcomes are explained to the best standard.
4. Going beyond what was asked (5%)
Full marks will be given for interesting ideas on how to extend work that are well motivated and
described.
1 Background
The aim of this task is to build and investigate the simplest form of a keyword search (KWS) system allowing to find information
in large volumes of spoken data. Figure below shows an example of a typical KWS system which consists of an index and
a search module. The index provides a compact representation of spoken data. Given a set of keywords, the search module
Search Results
Index
Key− words
queries the index to retrieve all possible occurrences ranked according to likelihood. The quality of a KWS is assessed based
on how accurately it can retrieve all true occurrences of keywords.
A number of index representations have been proposed and examined for KWS. Most popular representations are derived
from the output of an automatic speech recognition (ASR) system. Various forms of output have been examined. These differ
in terms of the amount of information retained regarding the content of spoken data. The simplest form is the most likely word
sequence or 1-best. Additional information such as start and end times, and recognition confidence may also be provided for
each word. Given a collection of 1-best sequences, the following index can be constructed
w1 (f1,1, s1,1, e1,1) . . . (f1,n1 , s1,n1 , e1,n1 )
w2 (f1,1, s1,1, e1,1) . . . (f1,n1 , s1,n1 , e1,n1 )

wN (fN,1, sN,1, eN,1) . . . (fN,nN , sN,nN , eN,nN )
(1)
where wi is a word, ni is the number of times word wi occurs, fi,j is a file where word wi occurs for the j-th time, si,j and ei,j
is the start and end time. Searching such index for single word keywords can be as simple as finding the correct row (e.g. k)
and returning all possible tuples (fk,1, sk,1, ek,1), . . ., (fk,nk , sk,nk , ek,nk ).
The search module is expected to retrieve all possible keyword occurrences. If ASR makes no mistakes such module
can be created rather trivially. To account for possible retrieval errors, the search module provides each potential occurrence
with a relevance score. Relevance scores reflect confidence in a given occurrence being relevant. Occurrences with extremely
low relevance scores may be eliminated. If these scores are accurate each eliminated occurrence will decrease the number of
false alarms. If not then the number of misses will increase. What exactly an extremely low score is may not be very easy
to determine. Multiple factors may affect a relevance score: confidence score, duration, word confusability, word context,
keyword length. Therefore, simple relevance scores, such as those based on confidence scores, may have a wide dynamic range
and may be incomparable across different keywords. In order to ensure that relevance scores are comparable among different
keywords they need to be calibrated. A simple calibration scheme is called sum-to-one (STO) normalisation
(2)
where ri,j is an original relevance score for the j-th occurrence of the i-th keyword, γ is a scale enabling to either sharpen or
flatten the distribution of relevance scores. More complex schemes have also been examined. Given a set of occurrences with
associated relevance scores, there are several options available for eliminating spurious occurrences. One popular approach
is thresholding. Given a global or keyword specific threshold any occurrence falling under is eliminated. Simple calibration
schemes such as STO require thresholds to be estimated on a development set and adjusted to different collection sizes. More
complex approaches such as Keyword Specific Thresholding (KST) yield a fixed threshold across different keywords and
collection sizes.
Accuracy of KWS systems can be assessed in multiple ways. Standard approaches include precision (proportion of relevant retrieved occurrences among all retrieved occurrences) and recall (proportion of relevant retrieved occurrences among all
relevant occurrences), mean average precision and term weighted value. A collection of precision and recall values computed
for different thresholds yields a precision-recall (PR) curve. The area under PR curve (AUC) provides a threshold independent summative statistics for comparing different retrieval approaches. The mean average precision (mAP) is another popular,
threshold-independent, precision based metric. Consider a KWS system returning 3 correct and 4 incorrect occurrences arranged according to relevance score as follows: ✓ , ✗ , ✗ , ✓ , ✓ , ✗ , ✗ , where ✓ stands for correct occurrence and ✗ stands
for incorrect occurrence. The average precision at each rank (from 1 to 7) is 1

7 . If the number of true correct
occurrences is 3, the mean average precision for this keyword 0.7. A collection-level mAP can be computed by averaging
keyword specific mAPs. Once a KWS system operates at a reasonable AUC or mAP level it is possible to use term weighted
value (TWV) to assess accuracy of thresholding. The TWV is defined by
 
(3)
where k ∈ K is a keyword, Pmiss and Pfa are probabilities of miss and false alarm, β is a penalty assigned to false alarms.
These probabilities can be computed by
Pmiss(k, θ) = Nmiss(k, θ)
Ncorrect(k) (4)
Pfa(k, θ) = Nfa(k, θ)
Ntrial(k) (5)
where N<event> is a number of events. The number of trials is given by
Ntrial(k) = T − Ncorrect(k) (6)
where T is the duration of speech in seconds.
2 Objective
Given a collection of 1-bests, write a code that retrieves all possible occurrences of keyword list provided. Describe the search
process including index format, handling of multi-word keywords, criterion for matching, relevance score calibration and
threshold setting methodology. Write a code to assess retrieval performance using reference transcriptions according to AUC,
mAP and TWV criteria using β = 20. Comment on the difference between these criteria including the impact of parameter β.
Start and end times of hypothesised occurrences must be within 0.5 seconds of true occurrences to be considered for matching.
2
3 Marking scheme
Two critical elements are assessed: retrieval (65%) and assessment (35%). Note: Even if you cannot complete this task as a
whole you can certainly provide a description of what you were planning to accomplish.
1. Retrieval
1.1 Index Write a code that can take provided CTM files (and any other file you deem relevant) and create indices in
your own format. For example, if Python language is used then the execution of your code may look like
python index.py dev.ctm dev.index
where dev.ctm is an CTM file and dev.index is an index.
Marks are distributed based on handling of multi-word keywords
• Efficient handling of single-word keywords
• No ability to handle multi-word keywords
• Inefficient ability to handle multi-word keywords
• Or efficient ability to handle multi-word keywords
1.2 Search Write a code that can take the provided keyword file and index file (and any other file you deem relevant)
and produce a list of occurrences for each provided keyword. For example, if Python language is used then the
execution of your code may look like
python search.py dev.index keywords dev.occ
where dev.index is an index, keywords is a list of keywords, dev.occ is a list of occurrences for each
keyword.
Marks are distributed based on handling of multi-word keywords
• Efficient handling of single-word keywords
• No ability to handle multi-word keywords
• Inefficient ability to handle multi-word keywords
• Or efficient ability to handle multi-word keywords
1.3 Description Provide a technical description of the following elements
• Index file format
• Handling multi-word keywords
• Criterion for matching keywords to possible occurrences
• Search process
• Score calibration
• Threshold setting
2. Assessment Write a code that can take the provided keyword file, the list of found keyword occurrences and the corresponding reference transcript file in STM format and compute the metrics described in the Background section. For
instance, if Python language is used then the execution of your code may look like
python <metric>.py keywords dev.occ dev.stm
where <metric> is one of precision-recall, mAP and TWV, keywords is the provided keyword file, dev.occ is the
list of found keyword occurrences and dev.stm is the reference transcript file.
Hint: In order to simplify assessment consider converting reference transcript from STM file format to CTM file format.
Using indexing and search code above obtain a list of true occurrences. The list of found keyword occurrences then can
be assessed more easily by comparing it with the list of true occurrences rather than the reference transcript file in STM
file format.
2.1 Implementation
• AUC Integrate an existing implementation of AUC computation into your code. For example, for Python
language such implementation is available in sklearn package.
• mAP Write your own implementation or integrate any freely available.
3
• TWV Write your own implementation or integrate any freely available.
2.2 Description
• AUC Plot precision-recall curve. Report AUC value . Discuss performance in the high precision and low
recall area. Discuss performance in the high recall and low precision area. Suggest which keyword search
applications might be interested in a good performance specifically in those two areas (either high precision
and low recall, or high recall and low precision).
• mAP Report mAP value. Report mAP value for each keyword length (1-word, 2-words, etc.). Compare and
discuss differences in mAP values.
• TWV Report TWV value. Report TWV value for each keyword length (1-word, 2-word, etc.). Compare and
discuss differences in TWV values. Plot TWV values for a range of threshold values. Report maximum TWV
value or MTWV. Report actual TWV value or ATWV obtained with a method used for threshold selection.
• Comparison Describe the use of AUC, mAP and TWV in the development of your KWS approach. Compare
these metrics and discuss their advantages and disadvantages.
4 Hand-in procedure
All outcomes, however complete, are to be submitted jointly in a form of a package file (zip/tar/gzip) that includes
directories for each task which contain the associated required files. Submission will be performed via MOLE.
5 Resources
Three resources are provided for this task:
• 1-best transcripts in NIST CTM file format (dev.ctm,eval.ctm). The CTM file format consists of multiple records
of the following form
<F> <H> <T> <D> <W> <C>
where <F> is an audio file name, <H> is a channel, <T> is a start time in seconds, <D> is a duration in seconds, <W> is a
word, <C> is a confidence score. Each record corresponds to one recognised word. Any blank lines or lines starting with
;; are ignored. An excerpt from a CTM file is shown below
7654 A 11.34 0.2 YES 0.5
7654 A 12.00 0.34 YOU 0.7
7654 A 13.30 0.5 CAN 0.1
• Reference transcript in NIST STM file format (dev.stm, eval.stm). The STM file format consists of multiple records
of the following form
<F> <H> <S> <T> <E> <L> <W>...<W>
where <S> is a speaker, <E> is an end time, <L> topic, <W>...<W> is a word sequence. Each record corresponds to
one manually transcribed segment of audio file. An excerpt from a STM file is shown below
2345 A 2345-a 0.10 2.03 <soap> uh huh yes i thought
2345 A 2345-b 2.10 3.04 <soap> dog walking is a very
2345 A 2345-a 3.50 4.59 <soap> yes but it’s worth it
Note that exact start and end times for each word are not available. Use uniform segmentation as an approximation. The
duration of speech in dev.stm and eval.stm is estimated to be 57474.2 and 25694.3 seconds.
• Keyword list keywords. Each keyword contains one or more words as shown below
請(qǐng)加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp










 

標(biāo)簽:

掃一掃在手機(jī)打開(kāi)當(dāng)前頁(yè)
  • 上一篇:ACS341代做、代寫(xiě)MATLAB設(shè)計(jì)程序
  • 下一篇:COMP 315代做、代寫(xiě)Java/c++編程語(yǔ)言
  • 無(wú)相關(guān)信息
    昆明生活資訊

    昆明圖文信息
    蝴蝶泉(4A)-大理旅游
    蝴蝶泉(4A)-大理旅游
    油炸竹蟲(chóng)
    油炸竹蟲(chóng)
    酸筍煮魚(yú)(雞)
    酸筍煮魚(yú)(雞)
    竹筒飯
    竹筒飯
    香茅草烤魚(yú)
    香茅草烤魚(yú)
    檸檬烤魚(yú)
    檸檬烤魚(yú)
    昆明西山國(guó)家級(jí)風(fēng)景名勝區(qū)
    昆明西山國(guó)家級(jí)風(fēng)景名勝區(qū)
    昆明旅游索道攻略
    昆明旅游索道攻略
  • 短信驗(yàn)證碼平臺(tái) 理財(cái) WPS下載

    關(guān)于我們 | 打賞支持 | 廣告服務(wù) | 聯(lián)系我們 | 網(wǎng)站地圖 | 免責(zé)聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 kmw.cc Inc. All Rights Reserved. 昆明網(wǎng) 版權(quán)所有
    ICP備06013414號(hào)-3 公安備 42010502001045

    日本欧洲视频一区_国模极品一区二区三区_国产熟女一区二区三区五月婷_亚洲AV成人精品日韩一区18p

              9000px;">

                        国产美女免费网站| 神马久久久久久久| av资源免费看| 中文字幕一区二区三区四区视频 | 精品成人无码一区二区三区| 97在线公开视频| 亚洲天天综合网| 在线免费看毛片| 无码人妻精品一区二区三区蜜桃91 | 色啦啦av综合| 男人天堂av网| 免费一级肉体全黄毛片| 极品尤物一区二区| 韩国av中文字幕| 国产女人18毛片水真多| 国产99久久久| 成人乱码一区二区三区| 97超视频在线观看| 69视频在线观看免费| 亚洲精品成人无码| 中文字幕在线视频第一页| 亚洲成熟少妇视频在线观看| 无码人妻精品一区二| 少妇无码一区二区三区| 日韩中文字幕观看| 日韩欧美国产片| 婷婷在线观看视频| 午夜久久久精品| 一区二区在线免费看| 中日韩在线观看视频| 在线免费看av片| 亚洲日本精品视频| 95视频在线观看| www.com.av| 国产福利小视频| 黄色工厂在线观看| 蜜臀av免费观看| 潘金莲激情呻吟欲求不满视频| 日本在线观看免费视频| 四虎国产精品免费| 亚洲AV无码一区二区三区性| 中文字幕一二三| 99久久99久久精品免费| www.激情五月| 国产又大又黑又粗免费视频| 久久久久久国产精品日本| 欧美大喷水吹潮合集在线观看| 日韩精品人妻中文字幕有码| 无码人妻丰满熟妇区五十路| 中文字幕狠狠干| jizz欧美性20| 国产在线观看中文字幕| 亚洲一区二区观看| 国产富婆一级全黄大片| 激情五月俺来也| 日本免费网站在线观看| 午夜写真片福利电影网| 亚洲日本韩国在线| 国产乱淫a∨片免费观看| 久久久香蕉视频| 日韩欧美综合视频| 亚洲国产精品自拍视频| 国产5g成人5g天天爽| 精品无码一区二区三区| 日本熟妇一区二区三区| 中文字幕在线观看成人| av丝袜天堂网| 久久久久久久久久毛片| 色wwwwww| 99国产精品欲| 久久精品欧美一区二区| 无码人妻精品一区二区三区9厂 | 婷婷丁香花五月天| 97人妻精品一区二区三区动漫 | 销魂美女一区二区| www.黄色av| 免费一级特黄特色大片| 在线观看免费av网址| 国产福利视频网站| 日韩国产亚洲欧美| 91嫩草丨国产丨精品| 精品人妻无码一区二区三区蜜桃一 | 欧美日韩a v| 中文字幕一区二区人妻痴汉电车| 国产97免费视频| 日韩欧美中文在线视频| 91麻豆视频在线观看| 久久久久久久9999| 最新中文字幕一区| 精品人妻一区二区三区日产乱码| 性色av无码久久一区二区三区| 超碰在线观看99| 人妻无码中文字幕| av无码av天天av天天爽| 日本黄色www| av小说天堂网| 日本亚洲色大成网站www久久| 亚洲视屏在线观看| 欧美国产在线一区| 99久久亚洲精品日本无码| 欧美熟女一区二区| 91精品一区二区三区蜜桃| 欧美特级一级片| 99久久国产精| 天天综合天天综合| 国精产品久拍自产在线网站| 一区二区三区在线观看av| 国产美女久久久久久| 在线观看日本www| 久久久久久久亚洲| av免费观看网址| 日韩影院一区二区| 国产三级av在线播放| 中文字幕xxxx| 欧美一级性视频| 国产高清免费在线观看| 天天做天天干天天操| 黄色一级视频免费| 亚洲特级黄色片| 色哟哟网站在线观看| 国产人妖一区二区| 亚洲精品视频91| 日韩Av无码精品| 国产精品一区二区小说| 亚洲黄色精品视频| 日韩在线视频免费播放| 精品国产乱码久久久久久1区二区 精品国产鲁一鲁一区二区三区 | 成人网站免费观看| 中文字幕第69页| 人妻少妇精品无码专区 | 国产性xxxx高清| 亚洲熟女毛茸茸| 午夜免费看毛片| 欧美三级韩国三级日本三斤在线观看 | 国产激情第一页| 亚洲天堂网在线视频| 天天爽天天爽天天爽| 久久一区二区电影| 国产精品热久久| 一级欧美一级日韩| 在线观看中文字幕av| 日韩av片免费观看| 蜜桃色一区二区三区| 国产美女免费无遮挡| 99九九99九九九99九他书对| 亚洲av无码成人精品区| 日韩乱码人妻无码中文字幕| 久草视频在线免费| 国产精品探花在线播放| 91激情视频在线| 中文字幕五月天| 午夜在线观看一区| 日韩字幕在线观看| 日本黄色激情视频| 欧美成人福利在线观看| 精品国产人妻一区二区三区| 国产精品国产三级国产专区52 | 男女男精品视频站| 精品肉丝脚一区二区三区| 国产精品久久久久久久免费看 | 一级特级黄色片| 中文字幕av网址| 在线观看av大片| 午夜视频在线网站| 天天干视频在线观看| 日韩激情小视频| 色综合天天色综合| 日韩欧美一级大片| 日本一区二区免费视频| 欧美一区免费看| 日韩av在线播| 日本一区二区三区在线免费观看| 欧美丰满少妇人妻精品| 欧美日韩人妻精品一区二区三区 | 中文字幕a在线观看| 伊人色在线观看| 中文字幕在线一| 亚洲日本在线播放| 97免费观看视频| 成年人网站免费在线观看| h色网站在线观看| 国产黄色一区二区三区| 国产毛片久久久久久久| 国产伦精品一区二区三区妓女下载| 国产5g成人5g天天爽| 国产毛片毛片毛片毛片毛片毛片 | 一区二区三区少妇| 亚洲精品97久久中文字幕无码| 亚洲国产精品欧美久久| 亚洲天堂视频一区| 波多野结衣一区二区三区四区| 国产91久久久| 精品人妻一区二区三区免费看| 毛片网站免费观看| 日韩 国产 一区| 亚洲第一精品网站| 中文字幕在线观看第三页| 亚洲天堂成人av| 国产黄色高清视频| 久久精品国产99久久99久久久|