1 2	原则是待切分的大故事是否满足INVEST*原则（“较小的”这一点可以除外）？故事大小是团队速率的 1/10 到 1/6 吗？

第二步运用切分模式

按工作流程步骤切分
延迟性能优化
按简单/复杂切分
按主要工作切分
按不同界面切分
按不同类型的数据切分
按不同的业务规则切分
按操作切分

第三步评估切分
通过对切分提出以下问题进行评估：

新故事的大小大致相等吗？
每个故事大概是团队速率的1/10到1/6吗？
每个故事都满足INVEST原则吗？
有可以降低优先级或删除掉的故事吗？
有没有明显的故事先开始，从而可以获得早期的价值、认知或风险降低等？

注：INVEST - 故事应该是：
独立的
可商谈的
有价值的
可估算的
较小的
可测试的

参考：

2016-05-16

技术

[120]学习《Thoughtworks技术雷达201604》

笔记如下：

技术雷达从两个维度对技术进行评估，一个维度是技术归类的四个象限，包括：技术、平台、工具、语言及框架，另一个维度是反映所持有的态度的四个环，依次为：采用、试验、评估、暂缓。

采用：我们强烈主张业界采用这些技术。如果适用我们的项目，我们会采用他们。
试验：值得追求。重要的是理解如何建立这种能力。企业应该在风险可控的项目中尝试该项技术。
评估：为了确认它将如何影响你所在的企业，值得做一番探究。
暂缓：谨慎推行。

在技术理念上需要关注：

Products over projects
BFF - Backend for frontends
Data Lake
Event Storming
QA in production
Reactive architectures

平台部分关注：
1
2
3
Docker
Apache Mesos
Kubernets
工具部分关注：
1
2
3
Consul
Apache Kafka
Zipkin

语言和框架部分关注：

ES6
React.js
Spring Boot
Swift
Dagger
Dapper
Ember.js
Reactive Native

暂缓部分关注：

1 2	1. Application Servers 2. Jenkins as a deployment pipeline

<完>

2016-04-21

技术

[118]HDFS Snapshot

参考：
1.http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HdfsSnapshots.html
2.http://debugo.com/hdfs-snapshot/
3.http://blog.csdn.net/linlinv3/article/details/44622203

2016-04-18

技术

[117]学习《Real-Time Event Streaming What Are Your Options》

一个典型的流式架构(a typical streaming architecture)

这是一张图片

流式架构三组件

A producer: 与数据源相连的软件系统。生产者从数据源采集、转换、过滤、聚合、增强之后发布事件数据到流式系统中。
The streaming system: 接受生产者发布的数据，持久化这些数据，然后可靠的将数据分发给消费者。
Consumers: 从流中订阅数据，并操作，或者分析这些数据。

技术选型(Options)

Producers:

Apache Flume
StreamSets Data Collector

Streaming System

Apache Kafka
MapR Streams

Comsumers(Processing)

Spark Streaming
Apache Storm
Apache Flink
Apache Apex

参考：

1.Real-Time Event Streaming What Are Your Options?
2.《Streaming Architecture》
3.Stream-based Architecture
4.Streaming Architecture: Ideal Platform for Microservices

2016-04-18

技术

[116]简报：大数据Hadoop动态 - 2016Q2

Apache Storm

Storm发布1.0.0版本，关键特性：

Developer Productivity
New Storm Connectors Storm-Kafka Spout using new client APIs
Distributed Log Search Dynamic Worker Profiling
Enterprise Readiness
Improved nimbus HA Automatic Back pressure
Distributed cache Windowing and State Management
** Storm Performance improvements
Operational Simplicity
Storm Topology Event inspector Resource Aware Scheduling
Dynamic Log Levels Pacemaker Storm Daemon
http://hortonworks.com/blog/announcing-apache-storm-1-0-0/
https://storm.apache.org/2016/04/12/storm100-released.html

HDP 2.4.2版本中APACHE SPARK & APACHE ZEPPELIN的增强

Certified SparkSQL with ODBC (ODBC driver available from Hortonworks).
Bug fixes in Spark Oozie action for a Kerberos enabled cluster.
Spark Streaming with Apache Kafka support in a Kerberos enabled cluster.
SparkSQL & ORC performance improvements.
Final technical preview of Apache Zeppelin that includes Kerberos support, LDAP Authentication, and identity propagation.
http://hortonworks.com/blog/apache-spark-apache-zeppelin-whats-coming-in-hdp-2-4-2/

Cloudera Engineering

Cloudera Vision

How GoPro uses Apache Hadoop in the Cloud
https://vision.cloudera.com/gopro-hadoop-cloud/
SQL-on-Apache Hadoop – Choosing the right tool for the right job
https://vision.cloudera.com/sql-on-apache-hadoop-choosing-the-right-tool-for-the-right-job/
New Open-Source Service Enables Apache Spark Development
https://vision.cloudera.com/new-open-source-service-enables-apache-spark-development/
Tuning Hive on Spark
http://www.cloudera.com/documentation/enterprise/latest/topics/admin_hos_tuning.html
http://www.cloudera.com/documentation/enterprise/latest/topics/admin_performance.html
Faster Batch Processing with Hive-on-Spark
https://vision.cloudera.com/faster-batch-processing-with-hive-on-spark/
Beyond ETL: Real-time, Streaming Architectures
https://vision.cloudera.com/beyond-etl-real-time-streaming-architectures/
The One Platform Initiative Delivers
https://vision.cloudera.com/the-one-platform-initiative-delivers/

Hortonworks

Databricks

Preview of Apache Spark 2.0 now on Databricks Community Edition
https://databricks.com/blog/2016/05/11/spark-2-0-technical-preview-easier-faster-and-smarter.html
Spark Trending in the Stack Overflow Survey
https://databricks.com/blog/2016/03/22/spark-trending-in-the-stack-overflow-survey.html
http://stackoverflow.com/research/developer-survey-2016
Continuous Integration and Delivery of Spark Applications at Metacog
https://databricks.com/blog/2016/04/06/continuous-integration-and-delivery-of-spark-applications-at-metacog.html

MapR

IoT Spotlight: Sensor to Dashboard – Real-Time Stream Processing for Oil and Gas
https://www.mapr.com/blog/iot-spotlight-sensor-dashboard-real-time-stream-processing-oil-and-gas
Using MapR, Mesos, Marathon, Docker, and Apache Spark to Deploy and Run Your First Jobs and Containers
https://www.mapr.com/blog/using-mapr-mesos-marathon-docker-and-apache-spark-deploy-and-run-your-first-jobs-and-containers
Apache Apex on MapR Converged Platform
https://www.mapr.com/blog/apache-apex-mapr-converged-platform
Monitoring a MapR Cluster with Elasticsearch + Kibana
https://www.mapr.com/blog/monitoring-mapr-cluster-elasticsearch-kibana
Real Time Credit Card Fraud Detection with Apache Spark and Event Streaming
https://www.mapr.com/blog/real-time-credit-card-fraud-detection-apache-spark-and-event-streaming
Fast, Scalable, Streaming Applications with the Kafka API (MapR Streams), Spark Streaming, and the HBase API (MapR-DB)
https://www.mapr.com/blog/fast-scalable-streaming-applications-kafka-api-mapr-streams-spark-streaming-and-hbase-api-mapr

2016-04-01

技术

[115]简报：大数据Hadoop动态 - 2016Q1

Cloudera

Hortonworks

MapR

Databricks

参考

On The Open Way

自信人生二百年，会当水击三千里！

[125]简报：TalkingData产品分析 - 2016Q2

[124]简报：机器学习 & 深度学习 & 人工智能 & BI & 数据挖掘 - 2016Q2

机器学习 & 深度学习 & 人工智能

数据挖掘

[123]简报：开源流计算框架 - 201605

[122]大数据之电信运营商案例

美国AT&T

西班牙电信智慧足迹(smart steps)

美国威瑞森(Verizon) Precision Market Insights

[121]敏捷之如何切分用户故事

[120]学习《Thoughtworks技术雷达201604》

[118]HDFS Snapshot

[117]学习《Real-Time Event Streaming What Are Your Options》

一个典型的流式架构(a typical streaming architecture)

流式架构三组件

技术选型(Options)

参考：

[116]简报：大数据Hadoop动态 - 2016Q2

Apache Storm

HDP 2.4.2版本中APACHE SPARK & APACHE ZEPPELIN的增强

Cloudera Engineering

Cloudera Vision

Hortonworks

Databricks

MapR

[115]简报：大数据Hadoop动态 - 2016Q1

Cloudera

Hortonworks

MapR

Databricks

机器学习 & 深度学习 & 人工智能

数据挖掘

美国AT&T

西班牙电信 智慧足迹(smart steps)

美国威瑞森(Verizon) Precision Market Insights

一个典型的流式架构(a typical streaming architecture)

流式架构三组件

技术选型(Options)

参考：

Apache Storm

HDP 2.4.2版本中APACHE SPARK & APACHE ZEPPELIN的增强

Cloudera Engineering

Cloudera Vision

Hortonworks

Databricks

MapR

Cloudera

Hortonworks

MapR

Databricks

西班牙电信智慧足迹(smart steps)