to be honest...
starting from the beginning and doing this in C++ will be a pain...
What you'll be doing is writing a script
connect to page, get data, process data, record data.
so use a scripting language!
PHP is good for this as there is a build in method for opening a web page as if it were a regular file, so you just open a web address and read the raw HTML as if it were text from a file. the code style and syntax is very close to C so you should be comfortable.
and there are again built in and easy methods for storing data, either to text files, posted to a different website, or pushed into a database, or emailed to you.
if you go the C route you'll need to create a network interface, then make sure if opens the correct port, then you'll need to work on your OSI layer seven code, to make sure it asks for an accepts HTML in a proper and standard way. then yu;re going to need to work with database connectors etc.
I disagree with the idea that if there is not an API that you should just find a different site, ideally, yes an API is grand.
but when you're talking about downloading and transforming plain text or XML from a site, it's not a huge bother to use search and trim commands to filter out the bulk of the HTML that's useless to your program leaving you only with the text data that you want.
additionally, if there is no API it's going to be a lot easier to change a plain text script in the future than it will be to get the code from a program, and recompile.