Advertisement
Help Keep Boards Alive. Support us by going ad free today. See here: https://subscriptions.boards.ie/.
If we do not hit our goal we will be forced to close the site.

Current status: https://keepboardsalive.com/

Annual subs are best for most impact. If you are still undecided on going Ad Free - you can also donate using the Paypal Donate option. All contribution helps. Thank you.
https://www.boards.ie/group/1878-subscribers-forum

Private Group for paid up members of Boards.ie. Join the club.

Data Mining and Statistical Analysis - Bayes' Theorom

  • 01-11-2019 12:45AM
    #1
    Moderators, Computer Games Moderators, Technology & Internet Moderators Posts: 19,242 Mod ✭✭✭✭


    Catchy thread title, but I'm just wondering if anyone else has ever created an application or script to gather data and then conduct a statistical analysis? I'm using Bayes' theorem, which is quite an easy or simple concept to grasp.

    I'm developing a script in Java to gather some horse racing data, such as odds, previous odds and other historical data points. The longest part of my little project was finding somewhere I could source the data, as some book makers have a tendency to shied this data from such endeavours.

    Once I determine the relevant data points, I can then conduct the appropriate analysis, then spit the results out in to a csv for clarity.

    Anyone else every do anything similar, even as an exercise in keeping skills fresh or learning new skills?


Comments

  • Closed Accounts Posts: 22,461 ✭✭✭✭beauf


    I've got into the habit of analyzing problems by looking for patterns in data.
    Then throwing it into some sort of visualization to see patterns.


  • Moderators, Computer Games Moderators Posts: 4,282 Mod ✭✭✭✭deconduo


    Would generally recommend python for statistical analysis and research. It has a huge wealth of libraries, and jupyter notebooks are great for this kind of thing.

    There's a pretty big community with lots of blogs and content out there too.


  • Registered Users, Registered Users 2 Posts: 1,847 ✭✭✭rock22


    You might look here for relevant courses.

    I did a course on Getting started on data science course.
    We used, (were introduced), to "R" which could be downloaded free of charge here .


  • Moderators, Computer Games Moderators, Technology & Internet Moderators Posts: 19,242 Mod ✭✭✭✭L.Jenkins


    deconduo wrote: »
    Would generally recommend python for statistical analysis and research. It has a huge wealth of libraries, and jupyter notebooks are great for this kind of thing.

    There's a pretty big community with lots of blogs and content out there too.

    Thanks for the info. A little bit of over kill at the moment for what I want to do. :D:D


  • Registered Users, Registered Users 2 Posts: 7,080 ✭✭✭Talisman


    It sounds like you're describing my college project that's due in 5/6 weeks! We're using R which is pretty trivial to code and generates nice graphs. For me sourcing and scrubing the data is where the work lies.


  • Advertisement
  • Moderators, Computer Games Moderators, Technology & Internet Moderators Posts: 19,242 Mod ✭✭✭✭L.Jenkins


    I see R popping up an awful, so I might have a craic at it.


  • Registered Users, Registered Users 2 Posts: 45 ShaneODub


    L.Jenkins wrote: »
    I see R popping up an awful, so I might have a craic at it.

    I used the rvest library in R the other day to scrape data tables from the web, with a bit of googling and trouble-shooting.


Advertisement