Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
kb:bigdata [2020/06/29 06:43]
yehuda
kb:bigdata [2021/03/07 15:21] (current)
yehuda [Ingestion]
Line 13: Line 13:
   * [[https://www.cloudera.com/documentation/enterprise/5-9-x/topics/admin_hos_tuning.html|Cloudera tune]]   * [[https://www.cloudera.com/documentation/enterprise/5-9-x/topics/admin_hos_tuning.html|Cloudera tune]]
  
 +==== Tools ====
  
-===== Performance Tools ===== 
   * [[https://unraveldata.com/|Unravel]]   * [[https://unraveldata.com/|Unravel]]
   * [[https://github.com/linkedin/dr-elephant|Dr. Elephant]]   * [[https://github.com/linkedin/dr-elephant|Dr. Elephant]]
   * [[https://www.pepperdata.com/| Dr. Elephant Enterprise]]   * [[https://www.pepperdata.com/| Dr. Elephant Enterprise]]
 +
 +===== Ingestion =====
 +
 +  * [[https://gobblin.apache.org/|Apache Gobblin]]
 +    * [[https://www.youtube.com/watch?v=BQ7aONetKl4|Youtube:Stream and Batch Data Integration at LinkedIn scale using Apache Gobblin]]
 +    * [[https://engineering.linkedin.com/blog/2021/data-integration-library|Linkedin blog: data-integration-library]]
 +    * [[https://gobblin.readthedocs.io/en/latest/miscellaneous/Exactly-Once-Support/#achieving-exactly-once-delivery-with-commitstepstore|Gobblin Exactly-Once-Support readthedocs.io]]
 +    * [[https://www.youtube.com/watch?v=fHFNZlWCpKA|Youtube:Gobblin как ETL-фреймворк / Иван Ахлестин (Rambler&Co)]]
 +    * [[https://cwiki.apache.org/confluence/display/GOBBLIN/Gobblin+as+a+Service|Gobblin as a Service]]
 +    * [[https://gobblin.apache.org/docs/user-guide/Gobblin-CLI/|user-guide Gobblin-CLI]]
 +
 +
 +===== Workflow =====
 +
 +  * [[https://azkaban.github.io|Azkaban]]
 +
 +===== MDM =====
 +  * DataHub: A Generalized Metadata Search & Discovery Tool (ex WhereHows)
 +    * [[https://github.com/linkedin/datahub| Linkedin Datahub (ex WhereHows)]]
 +    * [[https://engineering.linkedin.com/wherehows | Linkedin wherehows]]
  
 ===== OLAP & OLTP ===== ===== OLAP & OLTP =====
Line 98: Line 118:
 ===== Other url ===== ===== Other url =====
   * [[https://streever.atlassian.net/wiki/spaces/HADOOP/pages/9961474/Hive+JDBC+Extended+Connection+URL+Examples| Hadoop]]   * [[https://streever.atlassian.net/wiki/spaces/HADOOP/pages/9961474/Hive+JDBC+Extended+Connection+URL+Examples| Hadoop]]
 +  * [[https://cdap.io/|CDAP]]
kb/bigdata.1593412999.txt.gz · Last modified: 2020/06/29 06:43 by yehuda
Back to top
Driven by DokuWiki Recent changes RSS feed Valid CSS Valid XHTML 1.0