Slide 1: Cloud Computing
정보통신실험실 임건길
Slide 2: Index
• • • • • • What is Cloud Computing? Why there is Cloud Computing? The Problems of Cloud Computing Outside of Our Book Conclusion & Future Cloud Computing in Korea
05/11/10
2
Slide 3: What is Cloud Computing? -- What is a Cloud?
• Cloud = On Demand + Utility Computing + SOA + ASP + SaaS + DaaS + PaaS + …
– On Demand refers to a service or feature which addresses the user's need for instant gratification and immediacy of use. As a user, you can use it at anytime and anywhere. – Utility computing is the packaging of computing resources, such as computation and storage, as a metered service similar to a traditional public utility (such as electricity, water, natural gas, or telephone network). – Service-oriented architecture (SOA) provides methods for systems development and integration where systems group functionality around business processes and package these as interoperable services. – ASP = Application Service Provider – SaaS = Software-as-a-Service – DaaS = Data-as-a-Service – PaaS = Platform-as-a-Service
05/11/10
3
Slide 4: Service-Oriented Architecture
05/11/10
4
Slide 5: Microsoft Live Platform
05/11/10
5
Slide 6: Why there is Cloud Computing?
• • • • • • Hardware Price High-speed Internet Development of Web Programming Development of Digital Equipment Development of Education Development of Distributed Computing and Parallel Computing
6
05/11/10
Slide 7: Hardware Price
• With the development of technology, the hardware price keeps the low level. That low level price leads that the common family can offer a computer. Besides the companies which provide Cloud Computing Services also take more servers.
05/11/10
7
Slide 8: High-speed Internet
• Internet access—typically contrasted with dial-up access over a modem. Dial-up modems are generally only capable of a maximum bitrate of 56 kbit/s (kilobits per second) and require the full use of a telephone line —whereas broadband technologies supply at least double this bandwidth and generally without disrupting telephone use. • Broadband is often called "high-speed" Internet, because it usually has a high rate of data transmission. In general, any connection to the customer of 256 kbit/ s (0.256 Mbit/s) or greater is more concisely considered broadband Internet.
05/11/10 8
Slide 9: Development of Web Programming
• HTML • XHTML • Web Programming
– – – – JSP PHP ASP Ajax
Ajax (asynchronous JavaScript and XML), or AJAX, is a group of interrelated web development techniques used for creating interactive web applications or rich Internet applications. With Ajax, web applications can retrieve data from the server asynchronously in the background without interfering with the display and behavior of the existing page. Data is retrieved using the XMLHttpRequest object or through the use of Remote Scripting in browsers that do not support it. Despite the name, the use of JavaScript and XML is not required, and they do not have to be used asynchronously.
05/11/10
9
Slide 10: Development of Digital Equipment
• In our life, the digital equipment mostly includes digital camera, digital Videocon and camera phone. Based those digital equipments, we can easily record our society. Those digital equipments can be used in any places. And there are some requests that how store those data, analysis those data, share those data. (YouTube, Flickr) • MapReduce is used to processing of satellite imagery data. (Google Map, Google Earth)
05/11/10 10
Slide 11: Development of Education
• English education has been more important than before. In China, there are many English Training Center in society. And in school English is required course since junior middle school until graduate school. Same with China, Korean English education also shows that the English education had been very important. • Based on the no-speak-English country English education level had promoted. The English only applications have the opportunities to be used over the World.
05/11/10 11
Slide 12: Development of Education - Example
• Figure shows that the twitter.com users over the world. Brazil, China and Germany are nospeak-English countries. But there still are many users to use the twitter.com.
05/11/10
12
Slide 13: Development of Distributed Computing and Parallel Computing
• Distributed computing deals with hardware and software systems containing more than one processing element or storage element, concurrent processes, or multiple programs, running under a loosely or tightly controlled regime. • Parallel computing is a form of computation in which many calculations are carried out simultaneously operating on the principle that large problems can often be divided into smaller ones, which are then solved concurrently ("in parallel"). • Google maintains over 450,000 servers over world.
05/11/10 13
Slide 14: The Problems of Cloud Computing
• • • • • • Data Backup Security Privacy Company Closure Make Money Copyright
05/11/10
14
Slide 15: Data Backup
• Cloud Computing is promising for its several favorable characteristics, such as store in cloud, computing in cloud and application in cloud. If we use Cloud Computing services, we have to store our data in cloud. In the other hand, Mostly Cloud Computing system is based on distributed system, failure always happen. So the data backup is very important. • Google backup its data in 3 different palaces. Those data are stored in more than 48000 hard disks.
05/11/10 15
Slide 16: Security
• Base on the internet’s character, there always be hackers. So the safety is another big problem especially in Cloud Computing. • If you care about this part, you can get more detail from http://cloudsecurity.org/.
05/11/10
16
Slide 17: Biggest Cloud Challenge: Security
05/11/10
17
Slide 18: Privacy ( 사생활 )
• Data in the Cloud can be a privacy problem. • Google’s business model, however, is advertising. So you get Gmail and it look like it’s free, but the price you pay is Google’s data collection. • Shared data makes individual experiences better • If you encrypt personally-identifying data, you make it very difficult to create those sites that crunch everyone’s data and make suggestions or recommendations.
05/11/10 18
– Google keeps your search for 18 months. – YouTube keep your IP address.
Slide 19: Company Closure
• Technologically, closure problem is same with data backup problem. If a provider which provide cloud computing services closed down. Our stored data have a highly risk to take back.
05/11/10
19
Slide 20: Make Money
• The best way to make sure the companies don’t close down is make money. So make money is also a problem. Google bought YouTube for $1.65 billion in 2006. But YouTube announce that they still havn’t found any idea to make money until now. YouTube need more new ideas. Developing a new technology, creating a new model, providing more good applications are the answer that how make money. Google is promising for above mentioned characteristics.
05/11/10 20
Slide 21: Copyright
• Copyright is the biggest problem to stop the development. Copyright protection became a rule to be written in law. Copyright is a legal concept, enacted by governments, giving the creator of an original work of authorship exclusive rights to control its distribution for a certain time period, after which the work enters the public domain. So the cloud computing provider should have the function of content checking to make sure the content which uploaded by user is legal.
05/11/10 21
Slide 22: Copyright -Examples
• Upload Burn-E to YouTube
– Burn-E is a short film by Pixar Animation Studios based on the movie WALL-E that is attached to the movie's DVD and Blu-ray release on November 18, 2008.
• Fans upload the fan wallpapers to Desktop Nexus Wallpapers
– Toyota’s lawyers are demanding the withdrawal of all wallpapers that feature a Toyota, Scion, or Lexus. The site’s All images featuring Toyota vehicles should be removed, even images with copyright belonging to others.
05/11/10
22
Slide 23: Cloud Computing Incidents
05/11/10
23
Slide 24: Outage of Cloud Computing
• Amazon S3 Outage • Flexiscale Outage • Gmail Outage
– 8 hours in July 20, 2008 (Affected: all) – Cause: Design fault (server-to-server communication) – 2 days in August 26, 2008 (Affected: all) – Cause: Engineer mistake – 2 hours in August 11, 2008 (Affected: many) – Cause: Change management – Several hours in July 10, 2008 (Affected: many) – Cause: Migration from .Mac to MobileMe
24
• Apple MobileMe Outage
05/11/10
Slide 25: Closure of Cloud Computing
• MediaMax/Linkup
– Cloud storage service – Data loss of half of user files in July 2007 – 20,000 paid users are affected – Finally, service closure in July 2008
• Zimki
– Early cloud platform service (from 2006) – Service closure in December 2007 – Caused by the cease of investment
05/11/10 25
Slide 26: Outside of Our Book
• Health 2.0
– Connecting to Health Professionals – Medical Records
• Some interesting Applications
– last.fm – linkedin.com – Google Mobile App – Google RSS Reader – Twitter
05/11/10 26
Slide 27: Health 2.0
• Health 2.0, web-based apps and services for the healthcare sector, is a nascent but potentially huge market for web 2.0. Probably the biggest change over the past year in health 2.0 has been the number - and quality of health web apps that have become available.
05/11/10
27
Slide 28: Connecting to Health Professionals
• What we're all really wanting in healthcare web apps is the ability to connect with health systems and manage our health online. • Then there are apps such as MyMedLab which enables consumers to order and pay for many routine lab tests online, then go to their local lab to get their blood drawn and have their results sent to them electronically.
05/11/10 28
Slide 29: Medical Records
• There is a lot of competition now in the area of online health records. Google Health and Microsoft's Healthvault are essentially doing the same thing - both are platforms, neither tries to be a healthcare provider or conduit between healthcare professionals and patients, and both have search as their business model.
05/11/10 29
Slide 30: Some interesting Applications
• • • • • last.fm linkedin.com Google Mobile App Google RSS Reader Twitter
05/11/10
30
Slide 31: last.fm
• Scrobbling a song means that when you listen to it, the name of the song is sent to Last.fm and added to your music profile. Once you’ve signed up and downloaded Last.fm, you can scrobble songs you listen to on your computer or iPod automatically. Start scrobbling yourself, and see what artists you really listen to the most. Songs you listen to will also appear on your Last.fm profile page for others to see. Millions of songs are scrobbled every day. This data helps Last.fm to organize and recommend music to people; we use it to create personalized radio stations, and a lot more besides. You can see what Last.fm users listened to the most last week by clicking the Charts button at the top of the page. Add your musical taste into the mix and help us get some decent music into the top 10.
05/11/10 31
Slide 32: Cloud Computing @ last.fm
• Popular this week in Korea • Popular tags
– – – – – – – – – – – – – – – – – – – – All Blues Classical Country Electronic Hip-Hop Indie rock Jazz Latin Pop Punk RnB Reggae Rock Soul World 50s 60s 70s 80s
• Musical Compatibility
05/11/10 32
Slide 33: Popular this week in Korea @ last.fm
Map (Korea, Table) { } Map (All/Rock/etc., kTable) { } Reduce (Artist/Track, nTable) { } Sort () { }
05/11/10 33
Slide 34: linkedin.com ( 개인이력 )
• LinkedIn’s simple philosophy:
– “Relationships Matter”
• What is LinkedIn?
– LinkedIn is an online network of more than 30 million experienced professionals from around the world, representing 150 industries.
• LinkedIn is free to join. We also offer paid accounts that give you more tools for finding and reaching the right people, whether or not they are in your network.
05/11/10 34
Slide 35: Google Mobile App
• Google Mobile App for iPhone now with Voice Search • Speech/Voice Recognition in Cloud
– After you speak your query, Google Mobile App will return search results formatted for your iPhone. – It’s easy to forget that the speech recognition isn’t happening on your phone. It’s happening on Google’s servers. It’s Google’s vast database of speech data that makes the speech recognition work so well. It would be hard to pack all that into a local device. – IBM ViaVioce
• Reverse example:
05/11/10
35
Slide 36: Google RSS Reader
• RSS is a family of Web feed formats used to publish frequently updated works—such as blog entries, news headlines, audio, and video —in a standardized format. • RSS feeds can be read using software called an “RSS reader”, “feed reader”, or “aggregator”, which can be web-based or desktop-based.
05/11/10
36
Slide 37: Twitter ( 말 한마디로 블로그 )
• Twitter is a privately funded startup with offices in the SoMA neighborhood of San Francisco, CA. Started as a side project in March of 2006, Twitter has grown into a real-time short messaging service that works over multiple networks and devices. • Andy Stanford –Clark, an IBM “Master Inventor”, has instrumented his house and hooked it up to Twitter. You can follow his house and see his power consumption, when the phone rings, when the motion-sensitive lights turn on and off, etc.
– https://twitter.com/andy_house
05/11/10 37
Slide 38: Conclusion & Future
• Cloud Computing is trend of development. Web is a black hole. Future media will be based on web. It will include TV, film, video, phone, podcasts, newspapers, books, music, email, blogs, websites, magazines, radio. Our life will move to cloud, move to database, move to share.
05/11/10
38
Slide 39: Talking Web
In 25 Nov 2008, IBM Reveals Five Innovations That Will Change Our Lives in the Next Five Years. It include * Energy saving solar technology will be built into asphalt, paint and windows * You will have a crystal ball for your health * You will talk to the Web . . . and the Web will talk back * You will have your own digital shopping assistants * Forgetting will become a distant memory My focus is on the third. You will talk to the Web . . . and the Web will talk back. Talk to the Web application: Google Mobile App - Speech/Voice Recognition in Cloud Web talk application: http://www.textvoz.com/en/common/tts.shtml
05/11/10
39
Slide 40: Cloud Computing in Korea
• Site http://www.hadoop.or.kr/ http://www.nexr.co.kr • 넥스알에 대해 간단히 소개 드리면 인터넷 서비스와 이를 위한 기반 플랫폼을 전문으로 하는 회사입니다 . 웹 2.0 의 등장과 함께 데이터의 저장과 처리에 대한 중요성이 높아가고 있고 이를 효율적으로 처리해 줄 분산 시스템이 서비스의 핵심 경쟁력으로 떠오르고 있 습니다 . 넥스알은 분산 플랫폼에 대한 개발과 기술 컨 설팅 등을 주 업무로 하고 있습니다 . • Personal Blog http://blog.udanax.org http://freesearch.pe.kr/
05/11/10 40
Slide 41: NexR VCC
• Hadoop on Virtualization • The first server virtualization tool in Korea • Features
– Xen-based(paravirtualization& HVM) – Web-based AJAX Interface – Multiple Virtual Clusters – – – – –
05/11/10
Automatic HadoopDeployment & Execution User/Group Management VM Image Store VM Migration Web Services API
41
• Allocated virtual clusters(VC) to teams • Each team can manage their VC
Slide 42: VCC: Hadoop on Virtualization
• Auto-Deployment & Execution • Integration with Hadoop management
• http://www.hadoop.or.kr:8080/vcc/js/index.html
05/11/10 42