Page 109 - profile-ok

P. 109

研究人員｜ Research Faculty

DEGREES RECEIVED: ACADEMIC HONORS AND AWARDS:
葉彌妍 Mi-Yen Yeh ● Ph.D., EE, National Taiwan University, Taiwan (2002/09 - 2009/01) Distinguished Postdoctoral Fellowship, Academia Sinica, Taiwan
● B.S., EE, National Taiwan University, Taiwan (1998/09 - 2002/06) (2009)

PROFESSIONAL EXPERIENCE:
助研究員 Assistant Research Fellow ● Postdoctoral Fellow, CITI, Academia Sinica, Taiwan (2009/02 -
Ph.D., Electrical Engineering, National Taiwan University 2009/09)
● Visiting Scientist, IBM T. J. Watson Research Center, NY, USA (2007/09
Tel: +886-2-2788-3799 ext. 1412 - 2008/07)
Fax: +886-2-2782-4814
Email: miyen@iis.sinica.edu.tw
http://www.iis.sinica.edu.tw/pages/miyen

代表著作 Publications

1. Bi-Ru Dai, Jen-Wei Huang, Mi-Yen Yeh, and Ming-Syan Chen,
”Clustering on Demand for Multiple Data Streams”, In Proceedings
研究簡介 Research Description of the IEEE 4th International Conference on Data Mining (ICDM),
November 1-4, 2004.
我的研究興趣主要是在資料探勘和資料庫方面的相關議 My research interests lie in the area of data mining and databases. Particularly, I have 2. Mi-Yen Yeh, Bi-Ru Dai and Ming-Syan Chen, “COMET: Event-Driv-
題。過去幾年的研究主要是針對多重數值資料流環境， been focusing on topics in a multiple-data-stream environment, including stream en Clustering over Multiple Evolving Streams,” In Proceedings of the
10th Pacific-Asia Conf. on Knowledge Discovery and Data Mining
探討資料摘要、資料叢集、以及相似性查詢處理。資料 summarization, clustering, and similarity query processing. The most important (PAKDD), April 9-12, 2006.
流的特性是持續收集、快速匯入、且大量累積，該如何 characteristics of data streams are continuously arriving, of huge volumes, and with 3. Bi-Ru Dai, Jen-Wei Huang, Mi-Yen Yeh, and Ming-Syan Chen,
設計即時、有限次數檢視、以及限量資料儲存的演算法 fast coming speeds. To accommodate these properties of data streams with limited “Adaptive Clustering for Multiple Evolving Streams,” IEEE Transac-
便成為重要的議題。 computing resources and bounded storage, it is important to design approximate tions on Knowledge and Data Engineering, Vol. 18, No. 9, pp. 1166-
but real-time algorithms. 1180, September 2006.
另外，在多數應用中，資料流的取得常是獨立且分佈在
不同的位置。對於相似性查詢，若先將所有資料集中再 In addition, I am interested in similarity query processing among distributed 4. Mi-Yen Yeh, Bi-Ru Dai, and Ming-Syan Chen, “Clustering over Mul-
tiple Evolving Streams by Events and Correlations,” IEEE Transac-
處理並非一個有效率的方法。因此，探討如何直接在分 streams. In many real-world applications, data streams are collected independently tions on Knowledge and Data Engineering, Vol. 19, No. 10, pp. 1349-
散式環境底下，有效地節省頻寬使用來傳送資料流摘要 in a decentralized manner. For example, streams of climate measurements such as 1362, October 2007.
以獲得查詢的答案，是另一個值得研究的問題。 temperatures are collected from observation stations located over a wide area. It 5. Mi-Yen Yeh, Kun-Lung Wu, Philip S. Yu, and Ming-Syan Chen,
is inefficient to gather all of the distributed streams to a central site before doing “LEEWAVE: Level-Wise Distribution of Wavelet Coefficients for
最後，如何處理含有不確定性的資料，亦是一個重要研 any query processing especially when the available network bandwidth is limited. Processing kNN Queries over Distributed Streams,” In Proceedings of
究議題。受到儀器誤差影響，或是為保護隱私而受到人 Hence, there is a need to develop a bandwidth-efficient approach to processing the 34th International Conference on Very Large Data Bases (VLDB),
為刻意模糊，我們所獲得的資料並非就是所知的定值， queries among distributed streams. August 24-30, 2008.
而應該是一個可能值的機率分佈。由於資料的不確定 6. Mi-Yen Yeh, Kun-Lung Wu, Philip S. Yu, and Ming-Syan Chen,
性，傳統的資料模型和相似度定義將不再適用。因此需 Last but not least, how to do similarity query processing on data with uncertainty “PROUD: A Probabilistic Approach to Processing Similarity Queries
要藉由機率和統計理論輔助，建立不確定性物體之間距 is also a topic that I concern about. Uncertainty of data comes from various sources over Uncertain Data Streams,” In Proceedings of the 12th Internation-
離的模型，定義新的資料相似性查詢模式並設計對應的 such as noise interference of equipments or man-made data perturbation for pri- al Conference on Extending Database Technology (EDBT), March
23-26, 2009.
處理方法。 vacy preservation. In contrast to data with certainty, the data are now regarded as
random variables with probability distributions. Hence, new probabilistic models 7. Su-Chen Lin, Mi-Yen Yeh, and Ming-Syan Chen, “Subsequence
and methods for similarity queries among uncertain data should be designed. Matching of Stream Synopses under the Time Warping Distance,” In
Proceedings of the Pacific-Asia Conference on Knowledge Discovery
and Data Mining (PAKDD), June 21-24, 2010.

108 109

104 105 106 107 108 109 110 111 112 113 114