職位描述
Responsibilities:
We are currently recruiting for a motivated and passionate Data Engineer who is responsible for developing, optimizing and operating our data pipeline and applications, which are used to help implement innovation data platform process applications and give smart insights. In this role you will establish scalable, efficient, automated processes for large scale data pipelines, as well as building insightful applications based on open-ended pool of data.
Job Requirements:
* Build the infrastructure to gather and process large volumes of raw data into data suitable for analysis
* Design and implement near real-time data applications on top of huge amount of data
* Enhance data product like query/monitoring/analysis to ease common users work in big data environment
* Support operational monitoring requests from business team
* Track risk operation issues and work with partners on investigation
Basic Qualifications
* Bachelor’s degree with 3 years working experience in relevant field
* Strong programming skills in Java, Scala or Python; hands on with one of the scripting languages: Bash or Perl
* Experience in custom data pipeline design, implementation and maintenance
* Experience in big data programming (Hadoop/Spark/Kafka/Storm)
* Experience in database usage (MySQL/MongoDB/Redis)
* Experience in web development and machine learning application is a big plus
* Knowledge of AWS Data Stack using S3, EMR, Data Pipeline,Lambda,Kinesis
* Good verbal and written communication skills in English
------------數據開發工程師---------------
工作職責:
我們目前正在招聘一位積極和熱情的數據工程師,負責開發,優化和運營我們的數據處理平臺和應用程序,用于幫助實施創新數據平臺流程應用程序并提供明智的見解。在此角色中,您將為大型數據處理管道的平臺建立可擴展,高效,自動化的流程,并建立基于開放數據池的有見地的應用程序。
工作要求:
*構建基礎設施,將大量原始數據收集并處理成適合分析的數據
*在大量數據之上設計和實現近實時數據應用
*加強數據產品,如查詢/監控/分析,以減輕普通用戶在大數據環境中的工作
*支持業務團隊的運營監控要求
*跟蹤風險操作問題,并與合作伙伴進行調查
基本資格
*本科以上學歷,3年以上的相關工作經驗
*熟練使用Java 或Scala或Python編程語言;同時擅長使用Bash或Perl
*具有定制數據處理管道設計,實施和維護方面的經驗
*擁有基于大數據平臺的編程經驗(Hadoop / Spark / Kafka / Storm)
*豐富的數據庫使用經驗(MySQL / MongoDB / Redis)
*最好具有網頁開發和機器學習應用的經驗
*了解使用基于AWS的S3,EMR,Data pipeline,Lambda,Kinesis
*良好的英語口語和書面溝通能力
企業介紹
PatSnap is a disruptive market leading provider of intellectual property
analytics, for analysing technology trends, accelerating innovation, market
planning, competitor intelligence and maximising returns on existing and new
IP assets. It is used by over 3000 organisations globally including Nasa, GE,
Lego, Vodafone, Ferrari, Siemens, Xiaomi and China Mobile. The company is
backed by world class venture capital firms such as Sequoia, Summit
Partners, Shunwei and Vertex Ventures. With an impressive revenue growth
rate of 1078% from 2014 to 2016, PatSnap was ranked 44 on “Deloitte
Technology Fast 500”.
智慧芽是一家全球領先的知識產權信息服務(SaaS)提供商,基于專利大數據,
幫助分客戶析和了解最新技術發展趨勢并加速創新、獲取競爭對手情報、科學
進行市場布局以及實現知識產權價值最大化,提高企業核心競爭力。目前全球
已有超過3000 多機構和企業成為智慧芽的客戶,如美國宇航局、通用、樂高、
沃達豐、法拉利、西門子、小米、中國移動等。智慧芽得到了包括紅杉、頂峰
投資、順為、淡馬錫祥峰基金等世界頂級風險投資機構的青睞和投資。2014~
2016 年,智慧芽的營業收入以超過1078%的增長率快速發展,被評為德勤亞太
區高科技高成長500 強企業,并獲得第44 位的優質排名。