Artem Ryzhikov
Verified Expert in Engineering
Machine Learning Developer
Artem拥有机器学习(ML)博士学位,在数据结构方面有7年的经验,在ML研究方面有6年的经验, two years of working at tech startups, and four years of team management. As a senior ML engineer, 他为一款手机应用建立了一个CV算法,这款应用在一周内以1个应用排名app Store第二.5 million downloads. Artem has also worked on various other projects, including recommendation and search systems, text processing, and conventional data science.
Portfolio
Experience
Availability
Preferred Environment
Linux, Vim Text Editor, Jupyter Notebook, Git
The most amazing...
...我开发的项目是一款手机应用,它在一周内成为app Store第二好的手机应用. It was sold to Snapchat along with a team.
Work Experience
Research Fellow
National Research University Higher School of Economics
- 学习如何进行异常检测中的机器学习(ML)研究, time series, and domain adaptation algorithms. Designed and implemented new algorithms. Compared them with the existing methods.
- Wrote scientific articles about my new ML algorithms. 在顶级机器学习期刊上发表了六篇影响机器学习的文章,并成为大型强子对撞机(LHCb)合作项目的成员.
- Conducted ML lectures and seminars. Co-authored several ML, deep learning (DL), 以及Coursera和大学的生成对抗网络(GAN)课程. 通过向广大听众发表公开演讲,培养了沟通技巧.
- 设计并实现了一种新的模型无关异常增强技术,使表格和图像数据的ROC AUC增强达到0.08 higher than state-of-the-art methods in four out of six datasets. It was published in a top ML journal.
- 设计了一种新的表格和图像数据的异常检测算法,使ROC AUC boost达到0.1 higher than state-of-the-art methods in five out of six datasets. It was published in the Journal of Machine Learning Research.
- 设计并实现了一种新的多变量数据深度学习变化点检测算法. 在6个基准测试中,该算法将变化点得分提高到现有最先进模型的8倍.
- Applied Bayesian sparsification of classification models, making them 16 times faster with no quality decrease. Deployed the C++ model to the LHCb pipeline. The paper was published in conference proceedings.
- 设计并实现了一种领域自适应技术,在合成数据上有效拟合深度学习模型,避免过拟合. The results were published in conference proceedings.
- 设计并实现了一种领域自适应技术,用于在训练数据集中存在的一小部分领域上有效地拟合深度学习模型. The results were published in conference proceedings.
- Managed three research projects with up to six people in a team. 研究内容包括设计一种基于bert的半监督主题建模算法.
Senior Machine Learning Engineer
Snapchat
- 设计并测试了语义分割算法,从自拍照的人身上分割背景. Conducted experiments with postprocessing. 用平均相交-超并度(IoU)评价的分割质量从0得到了提高.93 to 0.98.
- Worked on neural networks speedup and quantization. The existing segmentation models were sped up nine times.
- 实现了该算法的实时版本,以超过30帧/秒的速度对视频进行分割。.
- 应用头发着色使用生成模型和其他花式过滤器和面具在c++上实现.
- Helped with the deployment of a neural network to mobile devices. The app reached second place in the App Store in a week with 1.5 million downloads. It was sold to Snapchat along with a team.
Data Scientist | Scala Developer
Double Data
- 在三个开发人员的小团队中,用三个月的时间实现了第一个人员搜索引擎. Reached search quality with 54.5% recall at 99.并且获得了使用JUnit和Mockito进行测试驱动开发的大量经验.
- Implemented and optimized Spark jobs for over 200TB data processing. Configured Spark for efficient batch processing.
- Conducted data analytics tasks for the business. Learned to present and visualize the results.
- 训练和评估信用评分的ML模型,达到40%的不良率.
Data Scientist | Data Engineer
VK group
- 对现有协同过滤(CF)推荐算法进行优化,使推荐速度提高5倍, which led to a 45% click-through rate (CTR) increase. The product was sold to VK Group along with a team.
- 针对冷启动问题,设计并实现了一种有效的用户分割算法, which led to a 70+% CTR increase, making it just 0.09% smaller than CF.
- 实现了新的有效推荐、情感分析和主题建模算法.
- 使得几乎所有的推荐算法都是个性化的、实时的,每次用户交互后都会更新.
- Refactored the entire legacy Scala code of the recommendation engine, which was around 4,000 lines long.
- Applied data science analytics for the business and management.
- Designed a distributed data storage architecture and data flow. Sped up data loading three times.
- Managed a machine learning research team. 为推荐算法测试构建沙盒、ab测试和CI.
Experience
PyTorch ARD
http://github.com/HolyBayes/pytorch_ard该项目基于一篇“变分Dropout Sparsifies Deep Neural Networks”的论文.
PyTorch Implementation of TIRE
http://github.com/HolyBayes/TIRE_pytorch更多信息可以在2020年预印本“使用具有时不变表示的自编码器进行时间序列数据中的变化点检测”中找到."
PyTorch Implementation of KL-CPD
http://github.com/HolyBayes/klcpd更多信息可以在2019年的论文“使用辅助深度生成模型的核变化点检测”中找到."
EM Algorithm with Automatic Relevance Determination
http://github.com/HolyBayes/ard-emEducation
PhD Degree in Computer Science
National Research University HSE - Moscow, Russia
Master's Degree in Computer Science
National Research University HSE - Moscow, Russia
Bachelor's Degree in Physics
Novosibirsk State University - Novosibirsk, Russia
Certifications
Yandex School of Data Analysis
Yandex
Skills
Libraries/APIs
PyTorch, Pandas, NumPy, Scikit-learn, OpenCV, TensorFlow, PySpark
Tools
Git, RabbitMQ, DataStax
Languages
Python, Scala, SQL, C++
Paradigms
Data Science, Anomaly Detection
Storage
Elasticsearch, PostgreSQL, Redis, Cassandra
Frameworks
Spark, Flask
Platforms
Linux, Amazon Web Services (AWS)
Other
Machine Learning, Deep Learning, Computer Vision, Experimental Design, Algorithms, Generative Adversarial Networks (GANs), Bayesian Inference & Modeling, Time Series Analysis, Data Structures, Recommendation Systems, Natural Language Processing (NLP), Domain Adaptation, Search, Social Networks, GPT, Generative Pre-trained Transformers (GPT)
How to Work with Toptal
在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring