The thing is with my current University project, I would like to use the tools that I have been working hard for the past couple of months and I am gonna use the experience to improve my skills and confidence in Programming ( as well as in my Tool-Kit ) by tackling tough problems.
As of now, the project seems to be heading towards the direction of Text Processing- Visualization - Data Fetching and Of course Statistics to analyse all that Information and ultimately to find a meaningful way to visualize that data.
It's for that end part of the project that I seek to go through it all - Visualization. Remember my post about Data Artists and Visualization? Yeah, I get to experiment with it all on my own.
However, I am aware of the fact that the time I am gonna invest in this project would have to come at the price of other projects that I wish to complete this year. But, all in all - I think that this is a brilliant opportunity to push my boundaries.
What I seek to learn from the project?
I think, that I won't have the opportunity to work directly with SageMaths ( Just had a Major release btw, check it out ASAP ) but I would definitely be writing a lot more Anaconda Scripts.
I don't exactly know what libraries I would be making use of, for the project but so far these are the ones which seem highly relevant -
* Pandas
* NumPy
* Requests
* Beautiful Soup
* NLTK ( Natural Language Tool Kit)
* Lxml
* Bokeh
* MatplotLib
Apart from these, I have no idea about the what other stuff I might have to look into.
I doubt it very much that I would have the opportunity to use Emacs or even other programming languages in any way but I suppose that I am gonna learn to really handle Real-World programming problems and get to Be Creative with the way I choose to Visualize them.
Through the project, I aim to learn Text Processing i.e. how to use the standard Text Processing tools that Python Library provides and Regular Expressions, along with NLTK which would be immensely helpful in working with Human Languages - the project would consist of analysing sentiments from the text that has been downloaded using the various Web-site data clients. No idea, how to do that as of now btw;P Will learn, though.
I get to learn how to connect to web-sites, something which I have been exploring recently and how can I change that data to a form which would be accepted by the other libraries which are involved in the project. Thus ingenious solutions would be necessary, good challenge.
The total Lines Of Code ( LOCs) might be more than any other program that I have ever worked on. So, I might practice multiple file approach to build huge programs. Needless to say that this is gonna be a great field-testing for my experience with Git for versioning various levels of progress.
Again, this is not directly related to Maths or SageMaths but I do have a couple of Ideas at the back of my head which might be quite useful if I master the art of Text processing. Of course, once I do it in Python then it can also be done in Emacs and then again in LINUX once I switch to the Unix systems.
So, NLTK + Text Processing is the thing for me right now. My main focus for now.
One thing, that I am sure of now - I won't be using Chrome any more, it's so glitchy. Atleast on my WIndow7+ i5 laptop. It might be different on the Linux and on my Desktop but there is another factor that's moving me closer to Mozilla Firefox - it's Rust.
Nope, not the one from True Detective but rather it's a Programming Language which is getting quite a lot of attention these days. I find it to be better as compared to Google's Go language so that's that for me. Anyway, I would like to leave the rest in the Expert Hands of Rust Cohle - Have Fun!!
As of now, the project seems to be heading towards the direction of Text Processing- Visualization - Data Fetching and Of course Statistics to analyse all that Information and ultimately to find a meaningful way to visualize that data.
It's for that end part of the project that I seek to go through it all - Visualization. Remember my post about Data Artists and Visualization? Yeah, I get to experiment with it all on my own.
However, I am aware of the fact that the time I am gonna invest in this project would have to come at the price of other projects that I wish to complete this year. But, all in all - I think that this is a brilliant opportunity to push my boundaries.
What I seek to learn from the project?
I think, that I won't have the opportunity to work directly with SageMaths ( Just had a Major release btw, check it out ASAP ) but I would definitely be writing a lot more Anaconda Scripts.
I don't exactly know what libraries I would be making use of, for the project but so far these are the ones which seem highly relevant -
* Pandas
* NumPy
* Requests
* Beautiful Soup
* NLTK ( Natural Language Tool Kit)
* Lxml
* Bokeh
* MatplotLib
Apart from these, I have no idea about the what other stuff I might have to look into.
I doubt it very much that I would have the opportunity to use Emacs or even other programming languages in any way but I suppose that I am gonna learn to really handle Real-World programming problems and get to Be Creative with the way I choose to Visualize them.
Through the project, I aim to learn Text Processing i.e. how to use the standard Text Processing tools that Python Library provides and Regular Expressions, along with NLTK which would be immensely helpful in working with Human Languages - the project would consist of analysing sentiments from the text that has been downloaded using the various Web-site data clients. No idea, how to do that as of now btw;P Will learn, though.
I get to learn how to connect to web-sites, something which I have been exploring recently and how can I change that data to a form which would be accepted by the other libraries which are involved in the project. Thus ingenious solutions would be necessary, good challenge.
The total Lines Of Code ( LOCs) might be more than any other program that I have ever worked on. So, I might practice multiple file approach to build huge programs. Needless to say that this is gonna be a great field-testing for my experience with Git for versioning various levels of progress.
Again, this is not directly related to Maths or SageMaths but I do have a couple of Ideas at the back of my head which might be quite useful if I master the art of Text processing. Of course, once I do it in Python then it can also be done in Emacs and then again in LINUX once I switch to the Unix systems.
So, NLTK + Text Processing is the thing for me right now. My main focus for now.
Besides, I intend to present my work via interactive graphs - something along the lines of the Videos below. Bokeh is the alternative to D3.js which is pretty much the most famous visualization library out there - the plus point with Bokeh for me being I get to use Python (not Javascript, which I have no experience with) and I intend to explore the entire Computer world via Python first and only later move on to specific tools as the necessity dictates. So, with Bokeh, being a web-browser based technology I am gonna stay in touch with the Internet and how web-pages are structured.
Would take a lot of work for me to get them done but I aim to work hard and get it all done in a month!
One thing, that I am sure of now - I won't be using Chrome any more, it's so glitchy. Atleast on my WIndow7+ i5 laptop. It might be different on the Linux and on my Desktop but there is another factor that's moving me closer to Mozilla Firefox - it's Rust.
Nope, not the one from True Detective but rather it's a Programming Language which is getting quite a lot of attention these days. I find it to be better as compared to Google's Go language so that's that for me. Anyway, I would like to leave the rest in the Expert Hands of Rust Cohle - Have Fun!!
No comments:
Post a Comment