應用個人化模式於偵測信用卡盜刷之研究

近年來，隨著社會風氣的改變，消費大眾的行為也日益不同，以現金交易的付款方式，逐漸地被信用卡所取代，可見信用卡的重要性日漸提高。然而，隨著這些改變，伴隨而來的卻是層出不窮的信用卡盜刷問題。在過去有關信用卡盜刷偵測的研究方法中，大都是利用他人大量的歷史消費資料來建立模型，去偵測某位使用者的盜刷情況。本研究則採用了一種有別於過去的個人化方法(Personalized Approach)來偵測盜刷。此法可在消費者擁有少數的真實交易資料時，或是在消費者尚未申請信用卡前，即可建立個人化模型來預防盜刷。個人化方法雖然提供了一種不錯的解決方案，但仍有許多問題亟待解決。舉例如下：(1)在蒐集消費資料時，消費者多半不願花太多時間去填答問項，導致蒐集之個人消費資料量過少。(2)由於動態的消費者行為或是消費者可能不願用心填答而產生資料矛盾的情形。因此，本研究的主要重點在探討矛盾情況對預測準確率的影響程度，且在有限的交易資料下，觀察資料分佈的情況對預測準確率的影響，並設法提高盜刷的預測準確率。另外本研究也利用支持向量機、倒傳遞網路、以及二元支持向量系統來建立一個有效的信用卡盜刷偵測模型。研究成果顯示，支持向量機與倒傳遞類神經均可得到不錯的訓練結果，但在預測未來資料上，較高的自我訓練結果反而有較差的預測未來能力。除此之外，本研究也運用許多技巧來提高預測的結果，如過量取樣(Oversampling)、階層式(Hierarchical)SVM、投票多數法(Majority Voting)…等。結果顯示，上述幾種方法均可達成高的異常偵測率。另外，在工具的比較上，三種工具得到的結果差異不大，但BSVS是最簡易操作的工具。

關鍵字

信用卡盜刷；個人化；支持向量機；倒傳遞網路；二元支持向量系統

並列摘要

Credit cards are a popular tool for transactions in many countries lately. However, credit card frauds have occurred frequently. How to detect credit card frauds, therefore, has become a key issue in recent years. Many previous studies proposed models which were constructed from the past real transaction data of many others to detect new transactions of a certain individual. In contrast to those traditional approaches, this study employs a personalized approach to solve the problem of credit card fraud. The personalized approach proposes to prevent fraud before the consumer uses a credit card or when the collected data are few. This new approach is promising. However, there are still some problems which have to be solved. For examples, 1) consumers are not willing to spend too much time to answer questions so that the collected data are few, 2) the dynamic consumer behavior may cause data overlapping. To improve the problems mentioned above, this research employs the personalized approach to address the credit card fraud problem. The main purpose of this study is to investigate the influences of data distribution on the prediction accuracy. Support vector machine (SVM), back propagation network (BPN), and binary support vector system (BSVS) are used to construct detection models for credit card fraud. The experimental results show that SVM and BPN can obtain good training results. However, both techniques fail to predict future data accurately for those cases with high training results. Besides, the classification results of these three classifiers are comparable. Compared to the other two techniques, BSVS is the easiest tool to use. This study also employs several techniques, such as hierarchical SVM, majority voting, and over-sampling, to improve true negative rates. Results from the experiments indicate that these techniques can increase true negative rates effectively.

並列關鍵字

credit card fraud ； personalized approach ； support vector machine ； back propagation network

參考文獻

陳來成（民90）。應用資料探勘技術建立商業預測模型-以信用卡為例。私立元智大學資訊管理研究所碩士論文。桃園縣。

Chen, T. S., Chen, R.C., Tsai, T.H., Li, S. Y., Liang, X., & Lin, C. C. (2005, October). Classification of Microarray Gene Expression Data Using a New Binary Support Vector System. Paper presented at the International Conference on Neural Networks and Brain , Beijing, China.

Chan, P. K., Fan, W., Prodromidis, A. L., & Stolfo, S. J. (1999). Distributed data mining in credit card fraud detection. IEEE Intelligence Systems, 14, 67-74.

Chen, R. C., Chen, J., Chen, T. S., Hsieh, C. H., Chen, T. Y., & Wu, K. Y. (2005). Building an intrusion detection system based on support vector machine and genetic algorithm. Lecture Notes in Computer Science (LNCS), 3498, 409-414.

Chen, R. C., Chen, T. S., & Lin, C. C. (2006). A new binary support vector system for increasing detection rate of credit card fraud. International Journal of Pattern Recognition and Artificial Intelligence, 20(2), 227-239.

國際替代計量

應用個人化模式於偵測信用卡盜刷之研究

未授權

主題瀏覽