Siri

Apple can credit much of its success to not only delivering good hardware, but using the hardware to develop new software experiences. With the 4S and iOS 5, most of the software experiences are available on older hardware - the sole exception being Siri.

The expectation that Apple will always deliver more than just a hardware upgrade is likely what made Siri a 4S exclusive (that and controlling the rollout as to not overly burden Apple's servers). The work carried out on the phone itself is minimal compared to what’s done server-side. A quick look at CPU utilization while running a simple Siri query (how many meters are in a mile) reveals the most stressful part of the process is animating the results on the screen. The display process is multithreaded but even then peak usage rarely steps above a single CPU core, in other words - this could have been done on the 4.

Siri is the logical next step from after the existing voice engine on the iPhone 4, which I doubt many people used on a day to day basis. Functionality was limited to a small laundry list of operations, and if you wandered outside the confines of those voice commands, you were left out of luck. I can count the number of times I did anything with the iPhone 4 voice commands on one hand.

Siri originally started as a standalone application, where it initially impressed with the ability to take a conversational approach to voice commands. Apple quickly acquired Siri, built it into iOS 5, and enabled the feature on just the iPhone 4S. At present, the feature only works for English (US, UK, Australia), French, and German, though in 2012 the language engine will also be compatible with Japanese, Chinese, Korean, Italian, and Spanish.

 

Hold the phone up to your head when not in a call, or long press the home button (which was the previous voice command action) and you’ll get Siri’s purple microphone and a circle button. If you’re familiar with Android’s voice dialogs, this visualization is pretty similar, though the functionality is very different.

Siri also works with bluetooth headsets and normal headsets, including the Apple-supplied button plus microphone earbuds. Normally Siri doesn’t read text messages or emails, but if you have a headset, it then reads everything back to you and becomes a much more powerful hands-free tool. Hold down the action button on a bluetooth earpiece or on the Apple earbud cable, and it will toggle Siri mode the same as holding the home button down. Bluetooth functionality works pretty well, though there’s added latency each time the earpiece audio link is established and torn down that can be unnerving.

Siri leverages Yelp for businesses/restaurants and Wolfram Alpha for any queries that require computation. It is interesting to think of Siri as Apple’s foray into search. Siri isn’t designed to monetize search, but any queries run through Siri definitely don’t line Google’s pockets. Given how much Google depends on search for its revenues and the growth of its businesses, Siri may be viewed as a competitive threat. Perhaps that’s why we don’t see Siri using Google for search?

There are two components to every Siri interaction: your query, and the results it returns. Where Siri is truly impressive is in its ability to understand your queries. You can ask it to tell you the closest Italian restaurants, or just utter the phrase “I want cake” and get a list of options in the area. For the former, Siri just looks for restaurants classified as Italian, for the latter it looks for restaurants whose Yelp reviews mention the word cake. Siri’s ability to dissect and handle the query is impressive, unfortunately its usefulness is gated by how active Yelp users are in your area. Yelp use in Raleigh is disappointing overall, which in turn hurts Siri’s ability to recommend what I’m looking for.

  

When it works, it works very well. There happen to be a lot of burger options where I live and simply asking Siri “what’s the best burger joint” returned a fairly accurate ranking of my options. The same goes for pizza, but my cake query didn’t really give me useful results either in Raleigh or Durham. Siri is only as smart as the databases that it relies on, and unfortunately simply looking for reviews that mention cake isn’t the best way to direct you to sugary goodness. 

  

The computation aspect of Siri is pretty useful. You can ask Siri simple conversion questions like “how many quarts are in a gallon” or something more complex like “how much is a 15% tip for $247 for five people”. In the case of the latter you’ll get the total tip amount, the per-person share as well as other tip amounts (e.g. 10%, 20%). Take it one step further and ask Siri to tell you when it’s customary to tip and you quickly run into a wall. In this case it’s not the data but Siri’s awareness of the question that’s limiting you.

  

Given how frequently I’m in meetings, one of the most useful aspects of Siri to me personally is its ability to schedule meetings. I can tell Siri to schedule a meeting with AMD at 2PM Pacific and it’ll offer to create the appointment in my calendar for me. If there’s a conflict, Siri offers resolution options. Combine that with seamless calendar syncing via iCloud and you’ve got a pretty powerful tool. Siri also handles US time zones very well. As you may have gathered from the line above, I can tell Siri to schedule a meeting in a different time zone without having to do the conversion myself. The time zone support breaks down once you go beyond the US unfortunately. 

Siri is similarly great for reminders. Using iOS 5’s location based triggers and by simply giving Siri some information ahead of time (e.g. telling it where home and work are in your address book), you can have Siri remind you to buy milk when you leave work or buy a cat when you leave home. 

Siri’s voice recognition engine works pretty well out of the box, but also learns individual voice characteristics and dialects on a larger scale. This individually trained language data can optionally be reset as well by just toggling Siri off and back on. I trained Siri for about a week, reset the data, and went through a few commands - it’s obvious that it does tailor a voice profile after a while.

Gallery: Siri Examples

Siri is surprisingly good at following along in conversations, but ultimately it’s still limited to a certain set of tasks, though the list is pretty big. Things like emailing a contact, sending text messages, responding to messages, setting alarms, reminders, and appointments, playing music (based on artist, album, or title), looking basic things up, and then features that are enabled through ties into Yelp and WolframAlpha. Much of what I ask Siri that isn’t an obvious phone function (text, call, email) ends up being directed into a Yelp search, and the depth of interesting redirects here are impressive - everything from telling Siri you’ve soiled yourself (which sends you clothing stores), that you need drugs (which finds addiction treatment centers) to specific things like wanting a certain genre of food nearby. It’s clear that the team responsible for Siri had a lot of fun, and by now the number of interesting easter eggs have been pretty well documented all over the web.

The A6: What's Next? WiFi, GPS, Audio, Speakerphone
Comments Locked

199 Comments

View All Comments

  • robco - Monday, October 31, 2011 - link

    I've been using the 4S from launch day and agree that Siri needs some work. That being said, it's pretty good for beta software. I would imagine Apple released it as a bonus for 4S buyers, but also to keep the load on their servers small while they get some real-world data before the final version comes in an update.

    The new camera is great. As for me, I'm glad Apple is resisting the urge to make the screen larger. The Galaxy Nexus looks nice, but the screen will be 4.65". I want a smartphone, not a tablet that makes phone calls. I honestly wouldn't want to carry something much larger than the iPhone and I would imagine I'm not the only one.

    Great review as always.
  • TrackSmart - Monday, October 31, 2011 - link

    I'm torn on screen size myself. Pocketable is nice. But I'm intrigued by the idea of a "mini-tablet" form factor, like the Samsung Galaxy Note with it's 5.3" screen (1280x800 resolution) and almost no bezel. That's HUGE for a phone, but if it replaces a tablet and a phone, and fits my normal pants pockets, it would be an interesting alternative. The pen/stylus is also intriguing. I will be torn between small form factor vs mini-tablet when I make my phone upgrade in the near future.

    To Anand and Brian: I'd love to see a review of the Samsung Galaxy Note. Maybe Samsung can send you a demo unit. It looks like a refined Dell Streak with a super-high resolution display and Wacom digitizer built in. Intriguing.
  • Rick83 - Wednesday, November 2, 2011 - link

    That's why I got an Archos 5 two years ago. And what can I say? It works.

    Sadly the Note is A) three times as expensive as the Archos
    and B) not yet on Android 4

    there's also C) Codec support will suck compared to the Archos, and I'm pretty sure Samsung won't release an open bootloader, like Archos does.

    I'm hoping that Archos will soon release a re-fresh of their smaller size tablets base on OMAP 4 and Android 4.
    Alternatively, and equally as expensive as the Note, is the Sony dual-screen tablet. Looks interesting, but same caveats apply....
  • kylecronin - Monday, October 31, 2011 - link

    > It’s going to be a case by case basis to determine which 4 cases that cover the front of the display work with the 4S.

    Clever
  • metafor - Monday, October 31, 2011 - link

    "Here we have two hypothetical CPUs, one with a max power draw of 1W and another with a max power draw of 1.3W. The 1.3W chip is faster under load but it draws 30% more power. Running this completely made-up workload, the 1.3W chip completes the task in 4 seconds vs. 6 for its lower power predecessor and thus overall power consumed is lower. Another way of quantifying this is to say that in the example above, CPU A does 5.5 Joules of work vs. 6.2J for CPU B."

    The numbers are off. 4 seconds vs 6 seconds isn't 30% faster. Time-to-complete is the inverse of clockspeed.

    Say a task takes 100 cycles. It would take 1 second on a 100Hz, 1 IPC CPU and 0.77 seconds on a 130Hz, 1 IPC CPU. This translates to 4.62 sec if given a task that takes 600 cycles of work (6 sec on the 100Hz, 1 IPC CPU).

    Or 1W * 6s = 6J = 1.3W * 4.62s

    Exactly the same amount of energy used for the task.
  • Anand Lal Shimpi - Monday, October 31, 2011 - link

    Err sorry, I should've clarified. For the energy calculations I was looking at the entire period of time (10 seconds) and assumed CPU A & B have the same 0.05W idle power consumption.

    Doing the math that way you get 1W * 6s + 0.05W * 4s = 6.2J (CPU B)

    and

    1.3W * 4s + 0.05W * 6s = 5.5J (CPU A)
  • metafor - Monday, October 31, 2011 - link

    Erm, that still presents the same problem. That is, a processor running at 130% the clockspeed will not finish in 4 seconds, it will finish in 4.62s.

    So the result is:

    1W * 6s + 0.05W * 4s = 6.2J (CPU B)
    1.3W * 4.62s + 0.05 * 5.38s = 6.275J (CPU A)

    There's some rounding error there. If you use whole numbers, say 200Hz vs 100Hz:

    1W * 10s + 0.05W * 10s = 10.5W (CPU B running for 20s with a task that takes 1000 cycles)

    2W * 5s + 0.05W * 15s = 10.75W (CPU A running for 10s with a task that takes 1000 cycles)
  • Anand Lal Shimpi - Monday, October 31, 2011 - link

    I wasn't comparing clock speeds, you have two separate processors - architectures unknown, 100% hypothetical. One draws 1.3W and completes the task in 4s, the other draws 1W and completes in 6s. For the sake of drawing a parallel to the 4S vs 4 you could assume that both chips run at the same clock. The improvements are entirely architectural, similar to A5 vs. A4.

    Take care,
    Anand
  • metafor - Tuesday, November 1, 2011 - link

    In that case, the CPU that draws 1.3W is more power efficient, as it managed to gain a 30% power draw for *more* than a 30% performance increase.

    I absolutely agree that this is the situation with the A5 compared to the A4, but that has nothing to do with the "race to sleep" problem.

    That is to say, if CPU A finishes a task in 4s and CPU B finishes a task in 6s. CPU A is more than 30% faster than CPU B; it has higher perf/W.
  • Anand Lal Shimpi - Tuesday, November 1, 2011 - link

    It is race to sleep though. The more power efficient CPU can get to sleep quicker (hurry up and wait is what Intel used to call it), which offsets any increases in peak power consumption. However, given the right workload, the more power efficient CPU can still use more power.

    Take care,
    Anand

Log in

Don't have an account? Sign up now