NBD-Requirements Meeting Minutes July 23 2013

We spent the majority of time discussing the use case template arriving finally at uploaded draft

We expect other WG to comment on and probably edit this. The final part of discussion concerned how we could analyse use cases to generalize input. We added a new field in template to support this

We got volunteers to collect use cases

●Yuri Demchenko ( Use case (UvA1): LifeWatch – European Infrastructure for Biodiversity and Ecosystem Research; Use case (UvA2): Humanities and language research infrastructure )

●William Miller (Cargo Shipping)

● Gary Mazzaferro sent template to OOI (Ocean Observatory Initiative)

There are 3 existing use cases

●Particle Physics

●Netflix

●NIST/Genome in a Bottle Consortium

These existing use cases need minor updates for changed template

Log of Chat Session

(11:03 AM) Geoffrey Fox: I'm on computer

(11:04 AM) William Miller (MaCT USA) joined.

(11:04 AM) Sanjay Mishra(Verizon) joined.

(11:04 AM) Yuri Demchenko (UvA) joined.

(11:06 AM) guest joined.

(11:07 AM) guest disconnected.

(11:07 AM) guest joined.

(11:08 AM) Tim Zimmerlin (Automation Technologies) joined.

(11:08 AM) William Miller (MaCT USA): 40 TB is rather small

(11:08 AM) William Miller (MaCT USA): onlu 40 hard drives

(11:09 AM) William Miller (MaCT USA): I will submit one for cargo shipping

(11:10 AM) William Miller (MaCT USA): systems like FedExx, CHL, UPS USPS are hugh

(11:10 AM) William Miller (MaCT USA): they are in common use today and can benefit from big data

(11:10 AM) Alicia Zuniga-Alvarado/AZA joined.

(11:11 AM) Nancy Grady (SAIC) joined.

(11:11 AM) William Miller (MaCT USA): cargo tracking is important and require real-time updates (i.e. GPS, date, time, etc).

(11:12 AM) William Miller (MaCT USA): meed tp add bo=dorectopma;otu

(11:12 AM) William Miller (MaCT USA): no indication of realt-ime updates

(11:12 AM) William Miller (MaCT USA): in use case template

(11:12 AM) William Miller (MaCT USA): roughly ok

(11:13 AM) William Miller (MaCT USA): netflix is not real-time

(11:13 AM) William Miller (MaCT USA): yes

(11:13 AM) Gary Mazzaferro: yes

(11:14 AM) William Miller (MaCT USA): real-time add a dash

(11:15 AM) Gary Mazzaferro: in taxonomy, velocity is being discussed as rate of change

(11:16 AM) Gary Mazzaferro: Overlay in this sense is tachnical issue

(11:30 AM) Bob Marcus: I have to drop off. Please keep in mind the goal of delivering Big Data Requirements that will help other subgroups and be part of the final deliverable.

(11:38 AM) _Cherry Tom_(_IEEE-SA_) joined.

(11:42 AM) Tim Zimmerlin (Automation Technologies): Not me!

(11:42 AM) Gary Mazzaferro: I can send one or two out to groups

(11:43 AM) Alicia Zuniga-Alvarado/AZA disconnected.

(11:44 AM) Gary Mazzaferro: Sending out the template OOI

(11:45 AM) William Miller (MaCT USA): i hagve a couple of new items

(11:45 AM) William Miller (MaCT USA): the use cases do not address mobility

(11:45 AM) William Miller (MaCT USA): it should be asked if the data will be used under mobility

(11:46 AM) William Miller (MaCT USA): do the end devices have constrained memories

(11:46 AM) William Miller (MaCT USA): there are special requirements for accessing data under mobility

(11:47 AM) William Miller (MaCT USA): put in the use case that the supplier should not include any confidential or calssified information

(11:48 AM) William Miller (MaCT USA): also we need to ask the question can the data set be decentralized

(11:48 AM) Gary Mazzaferro: Sent a message to Matt Arrot @Ocean Observatory Intiative

(11:49 AM) William Miller (MaCT USA): under additional comments

(11:49 AM) Yuri Demchenko (UvA): Use case (UvA1): LifeWatch – European Infrastructure for Biodiversity and Ecosystem Research; Use case (UvA2): Humanities and language research infrastructure

(11:49 AM) William Miller (MaCT USA): any device such as smart phones, ipads, notebooks

(11:50 AM) William Miller (MaCT USA): these have speical requiremnts for access to data

(11:50 AM) Gary Mazzaferro: have to drop off

(11:50 AM) Yuri Demchenko (UvA): These are quite specific usecases not from the high end science but with growing demand for data storage and complex data structures

(11:50 AM) Gary Mazzaferro disconnected.

(11:50 AM) William Miller (MaCT USA): thee types of devices each have a different way to access the data

(11:50 AM) William Miller (MaCT USA): there will be stress from mobile devices especially when millions of devices try to access the big data applicatoin

(11:52 AM) _Cherry Tom_(_IEEE-SA_): data source can be mobile also

(11:52 AM) William Miller (MaCT USA): there has to be an adapter

(11:52 AM) William Miller (MaCT USA): for differnet mobile devices

(11:55 AM) William Miller (MaCT USA): the use case form will need a definition of the words used in the use case

(11:56 AM) William Miller (MaCT USA) disconnected.

(12:02 PM) William Miller (MaCT USA) joined.

(12:02 PM) William Miller (MaCT USA): There is an additional item that can be added to the Use Case template

(12:03 PM) William Miller (MaCT USA): Is the data compressible? or what methods are used? but this can be a problem for accessing the data

(12:04 PM) William Miller (MaCT USA): does the current approach use metadata to define access to the resource

(12:05 PM) William Miller (MaCT USA): does the big data system offer a means to define access to the resource, is it encrypted, is it compresseded these are all importnat. Under moiblity this is also a problem since some type of data will ned to be converted or will not be accessible.

(12:05 PM) _Cherry Tom_(_IEEE-SA_): data contents i.e sources are part of current solns?

(12:08 PM) William Miller (MaCT USA): packet size will be very important

(12:09 PM) William Miller (MaCT USA): large files will clog the system and possible cause a lockup

(12:09 PM) William Miller (MaCT USA): this is particularly a problem is web servers are used

(12:11 PM) Alicia Zuniga-Alvarado/AZA joined.

(12:19 PM) Wo Chang (Host, NIST): testing

(12:38 PM) Alicia Zuniga-Alvarado/AZA disconnected.

(12:39 PM) Alicia Zuniga-Alvarado/AZA joined.

(12:54 PM) _Cherry Tom_(_IEEE-SA_): need to separate solns from requirements? requirement is to process x amount of data in y time?

(1:00 PM) William Miller (MaCT USA) disconnected.

(1:00 PM) _Cherry Tom_(_IEEE-SA_): need to unmute me

(1:02 PM) guest disconnected.

(1:07 PM) Tim Zimmerlin (Automation Technologies) disconnected.

(1:07 PM) _Cherry Tom_(_IEEE-SA_) disconnected.

(1:07 PM) Nancy Grady (SAIC) disconnected.

(1:07 PM) Sanjay Mishra(Verizon) disconnected.

(1:08 PM) Alicia Zuniga-Alvarado/AZA disconnected.

(1:15 PM) Yuri Demchenko (UvA) disconnected.