If you want to work in data science in finance, there are a few things you should probably know: firstly, that advantages are increasingly conferred in real time by huge unstructured data sets derived from social media, and secondly that while the building blocks of data strategy (data storage and administration) are increasingly, commoditized there are still interesting roles for data scientists at the top of the "data stack."
"Data engineering teams are really shifting their focus from low level database storage and administration and are instead focusing much more on the high value-add part of the data chain," said Tom Taylor, head of alpha technology at investment business Man Numeric (part of Man Group), speaking at this week's AI & Data Science in Trading conference.
Alternative data like credit card transactions and brand sentiment tracking has now become the norm in finance, said Taylor, and there's so much data around that to be successful funds don't just need a team of researchers analyzing its meaning, but an "industrial scale data onboarding and data science capability."
Some funds, like Two Sigma, have outsourced this process of cleaning data so that it can be onboarded into their systems easily, and companies like Crux Informatics (used by Two Sigma) are emerging as specialists in so-called 'data wrangling' - ingesting, cleaning and structuring data sets.
Important as it is, however, data wrangling isn't where the most appealing data jobs are. If you want to work in some of the most interesting and highest value-adding data jobs in finance, Taylor suggests you should position yourself towards the top of the "data stack."
At Man Numeric, Taylor said the data stack looks like this. The highest value positions are at the top, the lowest are at the bottom.
'Data stack,' Man Numeric:
Source: Man Numeric
While you might be able to get a data job in finance if you're an expert in SQL, kafka or kubernetes, therefore, you won't get the best data jobs just by knowing about data storage and computation packages. Nor will knowing about machine learning or open source Python libraries (in the front office Man is a Python house) be the deciding factors. - The best data science jobs on the buy-side now go to people who can make data available to the rest of the organization, said Taylor.
The future is about increasing the "velocity of data" in firms, and this means enabling "self-service data," he said. While data ingestion and storage are important, they're at the commoditized end of the stack. The real focus is now on allowing, "users from across the organization to manipulate and analyze data."
This means that the new most valuable data scientists are those who can build custom dashboards or visualizations that allow colleagues to access data directly. - Data science teams have an "empowering and enabling role." It's not just data science teams who will do data science. Taylor said that data fluency across organizations is increasing, both as existing staff add to their skills and as new data-confident graduates are hired.
If you want a long term, remunerative career in data science in finance, you therefore need to be pitching yourself at the top of the chart above. "If you can build self-service tools, you will be ready for the next few years," Taylor said.
Have a confidential story, tip, or comment you’d like to share? Contact: firstname.lastname@example.org in the first instance. Whatsapp/Signal/Telegram also available. Bear with us if you leave a comment at the bottom of this article: all our comments are moderated by human beings. Sometimes these humans might be asleep, or away from their desks, so it may take a while for your comment to appear. Eventually it will – unless it’s offensive or libelous (in which case it won’t.)
Photo by Nicolás Pinilla on Unsplash