透過您的圖書館登入
IP:3.12.36.30
  • 學位論文

利用機率模型與機器學習,向量表示詞語並預測股價方向

Using Probability Models and Machine Learning to Represent Words in the Prediction of Stock Price Directions

指導教授 : 呂育道
若您是本文的作者,可授權文章由華藝線上圖書館中協助推廣。

摘要


本論文主要討論以新聞標題作為依據預測股價的漲跌方向,對於以消息面來做為依據的投資策略,新聞是很重要的一個消息來源,於是我們嘗試利用自然語言處理領域中有名的Doc2vec將新聞標題以向量的方式解讀,一個以神經網路與機率作為基本架構的模型來表示新聞標題,再使用經典機器學習模型預測特定股價的漲跌,預測準確率最高可達70%,以期望用於輔助投資策略的決策。

關鍵字

漲跌預測 股價 自然語言 機器學習 新聞

並列摘要


This thesis mainly discusses how to use news headlines as features to predict stock price directions. It is well-known that some investors believe in news analysis strategy, which is highly depending on the news. Thus we will use the popular method Doc2vec, which uses vectors to represent words based on neural networks and probability, to represent each headline as a vector. Then we use the classical machine learning model to predict individual stock price directions. Our method best perform 70% accuracy for stock price directions prediction and expect to help investors making the right strategy.

參考文獻


Bollen, J., Mao, H., & Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), 1–8.
Chang, C. C., & Lin, C. J. (2011). LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology (TIST), 2(3)1–27.
https://github.com/ldkrsi/jieba-zh_TW
Radinsky, K., Davidovich, S., & Markovitch, S. (2012, April). Learning causality for news events prediction. In Proceedings of the 21st International Conference on World Wide Web, 909–918. Association for Computing Machinery.
Le, Q., & Mikolov, T. (2014, January). Distributed representations of sentences and documents. In Proceedings of the International Conference on Machine Learning, 1188–1196.

延伸閱讀