Script:
Owner:
Subdir: kristineiln
Blog ID: 97170145
Group ID: User ID: 96393985

    Data processing tools, unstructured data, HANA, and the SAP EIM Roadmap: Interview with Ina Felsheim (transcript)


    SAP Professional Journal editor Scott Priest (@SAPProJournal) recently spoke with SAP’s Ina Felsheim about the new book “Enterprise Information Management with SAP,"  recently published by SAP PRESS.

     In this second in a three-part podcast series with SAPexperts, Ina covered topics on how SAP EIM offerings can drive both  proactive and reactive approaches to data management, and offers  an overview of the tools and HANA integration, as well as some advice on setting timelines and leaving ample time to set business definitions of what makes “good” data in the first place.

    Listen to this interview or read the edited transcript below:

    Scott Priest:  Hello, everybody. Today I'm speaking with Ina Felsheim of SAP. She is a Director of Solution Management, and one of the coauthors of the book, "Enterprise Information Management with SAP,"  released this May from SAP PRESS.  

    This is part 2 of a podcast series that we at SAPexperts are doing on Enterprise Information Management, here on Insider Learning Network. Thanks for joining us today, Ina.

    Ina Felsheim:  Thanks for having me. It’s great to be here.

    Scott Priest:  Today we'll talk about how EIM deals with data processing, specifically with unstructured data. In the first podcast of this series, Ginger Gatling, one of your colleagues at SAP, introduced the topic of EIM generally.  Here, I wanted to ask specifically with you about EIM and data management.

    What are the big challenges right now, in terms of improving data processing? And how are organizations using EIM solutions to address these and other data management issues?

    Ina Felsheim:  Great question. First of all, some of the challenges:

    As Ginger outlined, people don't necessarily know that they have an information problem. If a truck is unable to ship a product out of a certain plant, they may immediately think that that's a people problem or a business problem.

    Part of what information management tackles is a discipline called Information Governance, root‑cause analysis of any of those business problems: In some cases, that is a business process problem. In some cases, that's a people training problem, and in some cases that's an information problem.  

    The challenge is to know when you have an information problem.  When you realize, all of a sudden, "I can't ship product out of this plant, and now that is costing me millions of dollars a day" – that tends to be how it starts.

    Once you have that base understanding of why information is valuable to you  -- and the kind of problems that you can run into if you're not managing that information from an enterprise standpoint, as a true asset ‑ once you have that base, then you can really do some proactive analysis and identify and stop potential problems before they start impacting your business.

    To work through that root‑cause analysis that I talked about, you're going to need people from the business, and from multiple disciplines inside your business. People from Business Processing, people from a specific function in the line of business, people from IT, probably someone from Sales or Marketing or Finance, especially if you're changing the foundational definition of, say, what a "customer" means or what "revenue" means.

    That's going to impact all of those different areas in your company, so you need extreme cross‑functional buy‑in across all those groups, as well as making sure that you can implement and facilitate a better solution from the IT side. It's a huge challenge.

    Scott Priest:  What specific solutions are a part of SAP and EIM offerings?

    Ina Felsheim: As Ginger outlined, we have five main pillars of our EIM technology.

    It starts with Information Steward to identify "What information do I have, is it even fit for use, and where is it used throughout the enterprise?"  

    Secondly, you have Data Services. Data Services then can move the information that you have to any of the systems that need it. Data Services can clean it, and make it good as it moves it. This is huge, supplying data quality technologies to make sure you're not moving 10 copies of the same customer, with slight variations -- that you have one, good customer.

    Then we have Master Data Governance, which acts as that central hub and repository for your best, cleaned records, and can provision them to systems that need it. We also have Information Lifecycle Management, which handles your retention and archiving policies on that master information, once they’re created.

    Then Enterprise Content Management really takes care of all the rich content that goes hand‑in‑hand with your business process, like a sales contract. It doesn't do you a lot of good to have just the master data on who you're doing a sale with, unless you have the sales contract associated with it. Enterprise Content Management lets you tie that rich content with your master data, to have a much better understanding of your whole information system.

    Those are the pillars of the Enterprise Information Management strategy at SAP.

    Scott Priest: How do these offerings relate to something like HANA, which obviously is a big player right now in this whole space?

    Ina Fleisham: HANA is, of course, about high‑volume analytics. HANA right now uses Data Services because high data volumes mean lots of information into HANA. Data Integrator, which is a part of Data Services, is the piece that is going to move high volumes of information, very quickly, with great performance, into HANA.

    Then you can also use Data Services to clean that information as it goes in. Because, of course, if you don't clean the information as it is going into HANA, you're just getting a quicker look at your really bad information.

    We really say you need to have this Data Quality piece. You have just spent all of this time loading and setting up your HANA system. You’ll want to make sure that that information is good quality so you're getting the best results from your analytics.

    Scott Priest:  Earlier, you talked about a proactive approach, as it pertains to EIM. Can you explain how EIM can offer both proactive and reactive approaches to managing data?

    Ina Felsheim:  First there's the reactive approach I talked about earlier. We see this all the time. Maybe a company is getting a lot of returned mail. So there’s someone whose job it is to call someone else, gets the correct address into your source system, then resend the mail. This is just part of their job, part of what they do.

    That’s when you have to do that root‑cause analysis, peeling back the onion to say, "Where, exactly, does this problem happen?" and not just addressing the symptom.

    That's the reactive approachm and these tools can help. Data Services can run a huge, massive cleanup on your ERP data, for example. Information Lifecycle Management can run a massive retention and archiving policy against your system. Those things will never go away.

    But as organizations mature their information management strategy, they start to also implement some proactive processes. As information becomes defined -- as you say, "This is the sort of information I'm going to need" -- you're defining what makes that information good. What does a good customer element look like? Does it include a phone number? An email address? Can it include multiple addresses? Maybe it works if you're in marketing, but does it work if you're a utilities company, for example?

    Once you’ve gathered buy‑in about what makes a “good” customer, how can you automate systems to make sure, whenever a new customer is entered, that data is complying with those rules?

    Usually, companies have multiple points of entry for adding a new customer - maybe web forms, mobile applications, SAP systems. At any of those points of entry, when somebody adds a new customer, let's check if that customer already exists in our system. Let's make sure that that customer information is well‑formed enough for use in the business process or in analytics.

    That's the more proactive approach that we're talking about.

    The same tools that we use for the reactive solutions ‑ Data Services, Information Lifecycle Management, Master Data Governance, all of those tools ‑ can also be used proactively.   

    Whether that's proactive or reactive scenario, there’s an underlying "Design these policies once, then use them in multiple‑use scenarios" approach - you don't have to rewrite them. You're just plugging them into where you need them next.

    Information Steward is also a great help proactively, because it can take a look at your information landscape and say, "What do I have, and what is complete enough to use? What is mostly null values or system default values? How many people are actually looking at this element in their analytics system?"

    If you have an element that is never showing up on a report at all, maybe it's being used by the business process at some other point to investigate, but that's good information to have when you start trying to fix this information.

    Scott Priest: With this boom of unstructured data, can you offer some examples of how companies are using unstructured data to answer their business questions?

    Ina Felsheim: Whenever I present at a conference, people are always saying, "I'm interested in social media." When we ask what they’re trying to do with it, they say, "I don't know, but I know it's important to my CIO."

    We do run into companies who are a little bit further along, who have really tried to figure out how they can use social media to help inform their master data.

    That's how those two relate: Social media is the base form of that data, out in Twitter or Facebook. Then Text Data Processing pulls that information in, tries to parse it and make sense of it so you can make some decisions based on it. That's the Text Data Processing piece.

    We've talked to customers who are trying to identify how a specific product is being received in the market. They'll monitor Twitter or Facebook posts about how people are talking about the product.

    I also talked to one customer who's using the Text Data Processing to parse through the huge amount of resumes that they get every year. In another global company, where they do lots of hiring, they get resumes in from job sites, from emails to a friend. They want to be able to look through what are often  multi‑page documents, pick out key words that are important to them, and understand at a very high level the qualifications and main quality of some of these resumes and store that information. They're using Text Data Processing to bring in those huge amounts of information, and then try to understand the specific pieces of those resumes.

    A third common example of Text Data Processing is around notes during repair calls. The technicians in the field are entering data, and a lot of that gets captured in unstructured ways, in a notes field. Companies are using Text Data Processing to parse through the information for some high‑level analytics, for example: "Of the service calls on refrigerators, how many of them had a problem with the compressor? Or with the ice maker? Was it a warranty problem? Was it an improper use problem?"

    Then they can start to identify how they're going to stock and market their own materials. In all cases, Text Data Processing takes a large chunk of data to parse through it and gather  the key elements to try to figure out what decisions can be made based on the information.

    Scott Priest:  My last question is for organizations that are evaluating their data processing -  maybe they're looking to move to data services. Do you have any advice on the steps that they should be taking?

    Ina Felsheim:  Nearly all companies have some sort of data movement tool in place, somewhere. Maybe they're using SSIS because a guy in the IT department knew SQL really well, for example.

    We would say that, in general, it's great to standardize on a data movement technology for exactly the reason I mentioned earlier: Once I establish my policy of what "good" information looks like, then I can implement it in many different use scenarios.

    That is where you really start to see the benefit of a centralized, standardized, EIM tool like Data Services. We would say, start in one smaller use case. Maybe you're doing a data migration into SAP. Maybe you have a HANA project going on.

    Start in that specific use case. Use Data Services and Data Quality or  Information Steward, for example, so you can get a feel for what the tools can do for you. Once you have used it in one use case, it's going to be really easy for you to extend it into other scenarios.  

    On a side note, I would say make sure that you're talking to the business and to your stakeholders, because in nearly every case, people are spending way more time on the corporate alignment and decision‑making phase. Streamworks is a great help for helping you structure those decisions and making sure they're entirely auditable and captured. Make sure you're spending time in that phase. That can be up to 80% of your entire project time -- getting that corporate alignment, documenting those decisions, and refining your decisions as you look at the resultant information. Then really only maybe 20% of time might be in the actual implementation of the tool.

    I would just say make sure that you're spending time on both sides of that fence, and be aware that a lot of it is going to be on corporate alignment.

    Scott Priest:  Great. Thanks a lot, Ina.

    Ina Felsheim:  Thank you.

    Scott Priest:  For those of you looking for a big‑picture overview of EIM, check out the previous podcast that we mentioned on Insider Learning Network, with Ginger Gatling. And look for their new book "Enterprise Information Management with SAP” from SAP PRESS.

    Be sure to look for upcoming podcasts from Insider Learning Network around EIM. There will also be an opportunity to post your own questions to the authors of this book in the coming months.

    Thanks a lot, Ina.

    Ina:  Thanks, Scott

    0 (0 Ratings)

    A conversation with Anurag Barua, BI Expert author, on uncovering 3 little programs with big impact on BW


    Jacquelyn Howard, senior managing editor for BI Expert, sits down with Anurag Barua, author of the BI Expert article  'Three Hidden Gems that Boost Your Use of SAP NetWeaver BW' (as well as numerous other articles). Go behind the scenes to find out how Anurag discovered the "gems" he writes about in his article.

    Jacquelyn Howard:  I am talking today with Anurag Barua, author of the BI Expert article 'Three Hidden Gems that Boost Your Use of SAP NetWeaver BW'.  Today, Anurag, I'd like to talk a little bit about the stuff behind the scenes.

    What led you to choose the three programs that you highlight in this article?

    Anurag Barua:  I am glad to share that with you. In SAP BW, not to mention every application of SAP, is a whole host of what we call utility programs. These often lurk behind the scenes, but they perform very important functions. And the user is not necessarily aware of them. There are quite a lot of these programs that are available today in NetWeaver BW 7.0 and older versions, also. And SAP keeps adding on all these new programs.

    So, the reason I chose these three? Although there is a lot to choose from, I thought that these are three that really, really do specific things that make the user’s life a lot easier.

    And we'll get into some more details on them later on. But for example, let's talk about the first one that I chose. It's very convenient to be able to look at the components of your query in one screen, as opposed to knowing how to go into the BEx Query Designer and look at everything and all things. And the same holds true for the other two, which I'll talk about in a moment.

    Jacquelyn: Could you talk a little bit more about the second program?

    Anurag:  Sure. But the other thing I'd actually like to add is the fact that the documentation for such a utility program is usually very minimal, if it exists at all. And there is a reason behind that, too; a lot of these programs perform very specific functions and the user interface is very basic. It is not something that SAP typically advertises. So we don't hear or see a lot of information on these programs, which is a reason I highlight some of the key ones.

    So, to answer your question about the second one, which is called 'cube sample create': I think that is particularly useful for creating sample data. Now I should also put a big disclaimer here: we don't want to be using this program or running this program in any production system, because essentially what we are doing is filling cubes with sample data.

    So we don't want to do that with actual data. But it comes in very handy for prototype or for demonstration or proof of concepts, where users want to see actual data. And that's the best way to get them to believe that your design works.

    So what this helps you do is this: Now you just specify the name of the InfoCube, as you can see in one of the screenshots. And you need to put in the number of records that you want the program to populate.

    And that's it. It will go ahead and just populate it with some dummy generated value. Now, the value of all that is that what you see will at least give you some sense of how your data looks. And if this is a prototype you have put in place, then users are going to be able to connect to your model in a much better fashion.

    Jacquelyn:   These little programs are just great. I know we had the article up online, and people have really liked just how quickly they can implement the programs that you talked about, how quickly they can access them and see what they do.

    Anurag:  In terms of accessing these programs, there is always a security aspect right? So different companies have different models and obviously no one is going to have access to everything in a system. For these utility programs, you'd have to first talk to your NetWeaver BW administrator in case you are not able to execute them. And often times, given the nature of these programs, you might have to provide a rationale for requesting access to these programs.

    But if you are successful in doing that -- sometimes that's the tough one --   running these programs,  accessing them and running them, is extremely easy.  

    So the user interface, as I said, is very simple. You just need to fill in the information, if that's relevant, and then just execute it. It is as simple as that.

    Jacquelyn:   Now before we let you go, can you just tell us a little bit about the third program that you highlight?

    Anurag:   The third program is more technical in nature. The first two, I would say, have a lot more user connection and user interest. The last one would be more in the architects’ or developers’ realm, if you will. And the reason I say that is that we are talking here about update rules. As most of us know here, these are transformations that we apply to extract data from SAP and non‑SAP systems to BW before it is moved to the data target, such as the InfoCubes, then DSOs and whatnot.

    But if you have a big team and you a have lot of complex transformations that you are putting in, often times you don't have a good handle, a good way of managing these updates or tracking ultimate changes. And this becomes particularly troublesome when you have resources that are dispersed geographically.

    And sometimes, you are running into issues and can defuse those. It is good to be able for you as an administrator or as a developer, the technical person, to be able to see what all these update or transformations are, when this happened, who may need it. So this information in the aggregate will help you quickly zoom in to the one that could have caused the problem, if you are facing an issue with any of these update rules.

    So, as you can see in the screenshot that I provided in the article (shown below), it has this very clear layout where you can say, OK, these are the InfoCubes, and these have these update rules. And these were the dates these were turned off.

     

     

    And of course you will get into more detail. But the information that you get is going to be very helpful because if you look at the initial selection screen, you can also get it by the user itself. So you know that a particular user has made all these changes. So I think it is a very useful little utility program.

    Jacquelyn:  Great. I appreciate your talking with us for a little behind the scenes view on this article. Will you be attending any of the fall conferences this year?

    Anurag:  I'll try my best as I have tried in the past. I have spoken quite frequently at a lot of the WIS conferences and I hope if not this year, but definitely the BI conference for 2011.

    Jacquelyn:  Excellent. And people can also be looking forward to your BusinessObjects article that should be coming up very soon.

    Anurag:  Thank you. I am looking forward to providing my views on that one, too.

    For more BI tips and information, join the BI-BW Group on Insider Learning Network.

    4.1 (2 Ratings)
    Loading...